Shih-Fu Chang

Orcid: 0000-0003-1444-1205

Affiliations:
  • Columbia University, New York City, USA


According to our database1, Shih-Fu Chang authored at least 557 papers between 1991 and 2024.

Collaborative distances:

Awards

ACM Fellow

ACM Fellow 2017, "For contributions to large-scale multimedia content recognition and multimedia information retrieval".

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
JourneyBench: A Challenging One-Stop Vision-Language Understanding Benchmark of Generated Images.
CoRR, 2024

WIDIn: Wording Image for Domain-Invariant Representation in Single-Source Domain Generalization.
CoRR, 2024

Ferret-v2: An Improved Baseline for Referring and Grounding with Large Language Models.
CoRR, 2024

From Pixels to Insights: A Survey on Automatic Chart Understanding in the Era of Large Foundation Models.
CoRR, 2024

Detecting Multimodal Situations with Insufficient Context and Abstaining from Baseless Predictions.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Ferret: Refer and Ground Anything Anywhere at Any Granularity.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

SCHEMA: State CHangEs MAtter for Procedure Planning in Instructional Videos.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Personalized Video Comment Generation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

VIEWS: Entity-Aware News Video Captioning.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Training-free Deep Concept Injection Enables Language Models for Video Question Answering.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

RAP: Retrieval-Augmented Planner for Adaptive Procedure Planning in Instructional Videos.
Proceedings of the Computer Vision - ECCV 2024, 2024

MoDE: CLIP Data Experts via Clustering.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

What, When, and Where? Self-Supervised Spatio- Temporal Grounding in Untrimmed Multi-Action Videos from Narrated Instructions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Do LVLMs Understand Charts? Analyzing and Correcting Factual Errors in Chart Captioning.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Beyond Grounding: Extracting Fine-Grained Event Hierarchies across Modalities.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Video Summarization: Towards Entity-Aware Captions.
CoRR, 2023

Characterizing Video Question Answering with Sparsified Inputs.
CoRR, 2023

Enhanced Chart Understanding in Vision and Language Task via Cross-modal Pre-training on Plot Table Pairs.
CoRR, 2023

PreViTS: Contrastive Pretraining with Video Tracking Supervision.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

TempCLR: Temporal Alignment Representation with Contrastive Learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

IdealGPT: Iteratively Decomposing Vision and Language Reasoning via Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Dataset Bias Mitigation in Multiple-Choice Visual Question Answering and Beyond.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Language Models are Causal Knowledge Extractors for Zero-shot Video Question Answering.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

DiGeo: Discriminative Geometry-Aware Learning for Generalized Few-Shot Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

In Defense of Structural Symbolic Representation for Video Event-Relation Prediction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Supervised Masked Knowledge Distillation for Few-Shot Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Towards Fast Adaptation of Pretrained Contrastive Models for Multi-channel Video-Language Retrieval.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Non-Sequential Graph Script Induction via Multimedia Grounding.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Enhanced Chart Understanding via Visual Language Pre-training on Plot Table Pairs.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

UniFine: A Unified and Fine-grained Approach for Zero-shot Vision-Language Understanding.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Learning from Children: Improving Image-Caption Pretraining via Curriculum.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Video Event Extraction via Tracking Visual States of Arguments.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Beyond Triplet Loss: Meta Prototypical N-Tuple Loss for Person Re-identification.
IEEE Trans. Multim., 2022

Augmentation Invariant and Instance Spreading Feature for Softmax Embedding.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Multimodal Event Graphs: Towards Event Centric Understanding of Multimodal World.
CoRR, 2022

Multimodal Adaptive Distillation for Leveraging Unimodal Encoders for Vision-Language Tasks.
CoRR, 2022

Multimodal Few-Shot Object Detection with Meta-Learning Based Cross-Modal Prompting.
CoRR, 2022

CLIP-TD: CLIP Targeted Distillation for Vision-Language Tasks.
CoRR, 2022

Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Few-Shot Gaze Estimation with Model Offset Predictors.
Proceedings of the IEEE International Conference on Acoustics, 2022

Asd-Transformer: Efficient Active Speaker Detection Using Self And Multimodal Transformers.
Proceedings of the IEEE International Conference on Acoustics, 2022

Find Someone Who: Visual Commonsense Understanding in Human-Centric Grounding.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Understanding ME? Multimodal Evaluation for Fine-grained Visual Commonsense.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Weakly-Supervised Temporal Article Grounding.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Learning Visual Representation from Modality-Shared Contrastive Language-Image Pre-training.
Proceedings of the Computer Vision - ECCV 2022, 2022

Fine-Grained Visual Entailment.
Proceedings of the Computer Vision - ECCV 2022, 2022

Few-Shot End-to-End Object Detection via Constantly Concentrated Encoding Across Heads.
Proceedings of the Computer Vision - ECCV 2022, 2022

Video in 10 Bits: Few-Bit VideoQA for Efficiency and Privacy.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

CLIP-Event: Connecting Text and Images with Event Structures.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Task-Adaptive Negative Envision for Few-Shot Open-Set Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Few-Shot Object Detection with Fully Cross-Transformer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

One-Stage Object Referring with Gaze Estimation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Learning To Recognize Procedural Activities with Distant Supervision.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

SGEITL: Scene Graph Enhanced Image-Text Learning for Visual Commonsense Reasoning.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media Knowledge Extraction and Grounding.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Meta Faster R-CNN: Towards Accurate Few-Shot Object Detection with Attentive Feature Alignment.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Variational Context: Exploiting Visual and Textual Context for Grounding Referring Expressions.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

RESIN: A Dockerized Schema-Guided Cross-document Cross-lingual Cross-media Information Extraction and Event Tracking System.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Demonstrations, 2021

COVID-19 Literature Knowledge Graph Construction and Drug Repurposing Report Generation.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Demonstrations, 2021

Unsupervised Vision-and-Language Pre-training Without Parallel Images and Captions.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Uncertainty-Aware Few-Shot Image Classification.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Partner-Assisted Learning for Few-Shot Image Classification.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Query Adaptive Few-Shot Object Detection with Heterogeneous Graph Convolutional Networks.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Multimodal Clustering Networks for Self-supervised Learning from Unlabeled Videos.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Joint Multimedia Event Extraction from Video and Article.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Open-Vocabulary Object Detection Using Captions.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Co-Grounding Networks With Semantic Attention for Referring Expression Comprehension in Videos.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Vx2Text: End-to-End Learning of Video-Based Text Generation From Multimodal Inputs.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

InfoSurgeon: Cross-Media Fine-grained Information Consistency Checking for Fake News Detection.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Ref-NMS: Breaking Proposal Bottlenecks in Two-Stage Referring Expression Grounding.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Discovering Image Manipulation History by Pairwise Relation and Forensics Tools.
IEEE J. Sel. Top. Signal Process., 2020

Task-Adaptive Negative Class Envision for Few-Shot Open-Set Recognition.
CoRR, 2020

Neuro-Symbolic Representations for Video Captioning: A Case for Leveraging Inductive Biases for Vision and Language.
CoRR, 2020

Weakly-supervised VisualBERT: Pre-training without Parallel Images and Captions.
CoRR, 2020

Ref-NMS: Breaking Proposal Bottlenecks in Two-Stage Referring Expression Grounding.
CoRR, 2020

Analogical Reasoning for Visually Grounded Language Acquisition.
CoRR, 2020

COVID-19 Literature Knowledge Graph Construction and Drug Repurposing Report Generation.
CoRR, 2020

Rethinking Classification Loss Designs for Person Re-identification with a Unified View.
CoRR, 2020

Deep Learning Guided Building Reconstruction from Satellite Imagery-derived Point Clouds.
CoRR, 2020

Unifying Specialist Image Embedding into Universal Image Embedding.
CoRR, 2020

Training with Streaming Annotation.
CoRR, 2020

FATE/MM 20: 2nd International Workshop on Fairness, Accountability, Transparency and Ethics in MultiMedia.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Cross-lingual Structure Transfer for Zero-resource Event Extraction.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Learning Visual Commonsense for Robust Scene Graph Generation.
Proceedings of the Computer Vision - ECCV 2020, 2020

Bridging Knowledge Graphs to Generate Scene Graphs.
Proceedings of the Computer Vision - ECCV 2020, 2020

Learning to Learn Words from Visual Scenes.
Proceedings of the Computer Vision - ECCV 2020, 2020

Context-Gated Convolution.
Proceedings of the Computer Vision - ECCV 2020, 2020

Weakly Supervised Visual Semantic Parsing.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Cross-media Structured Common Space for Multimedia Event Extraction.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

GAIA: A Fine-grained Multimedia Knowledge Extraction System.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 2020

General Partial Label Learning via Dual Bipartite Graph Autoencoder.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Special Section on Multimodal Understanding of Social, Affective, and Subjective Attributes.
ACM Trans. Multim. Comput. Commun. Appl., 2019

Multi-Modal Multi-Scale Deep Learning for Large-Scale Image Annotation.
IEEE Trans. Image Process., 2019

Automatic visual pattern mining from categorical image dataset.
Int. J. Multim. Inf. Retr., 2019

Flow-Distilled IP Two-Stream Networks for Compressed Video Action Recognition.
CoRR, 2019

Learning to Learn Words from Narrated Video.
CoRR, 2019

LPAT: Learning to Predict Adaptive Threshold for Weakly-supervised Temporal Action Localization.
CoRR, 2019

Report of 2017 NSF Workshop on Multimedia Challenges, Opportunities and Research Roadmaps.
CoRR, 2019

CDSA: Cross-Dimensional Self-Attention for Multivariate, Geo-tagged Time Series Imputation.
CoRR, 2019

Detecting and Simulating Artifacts in GAN Fake Images.
Proceedings of the IEEE International Workshop on Information Forensics and Security, 2019


PANEL: Challenges for Multimedia/Multimodal Research in the Next Decade.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

FAT/MM'19: 1st International Workshop on Fairness, Accountability, and Transparency in MultiMedia.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Unsupervised Rank-Preserving Hashing for Large-Scale Image Retrieval.
Proceedings of the 2019 on International Conference on Multimedia Retrieval, 2019

One-Shot Learning for Function-Specific Region Segmentation in Mouse Brain.
Proceedings of the 16th IEEE International Symposium on Biomedical Imaging, 2019

Multimodal Social Media Analysis for Gang Violence Prevention.
Proceedings of the Thirteenth International Conference on Web and Social Media, 2019

Counterfactual Critic Multi-Agent Training for Scene Graph Generation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Cross-lingual Structure Transfer for Relation and Event Extraction.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Unsupervised Embedding Learning via Invariant and Spreading Instance Feature.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

DMC-Net: Generating Discriminative Motion Cues for Fast Compressed Video Action Recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Multi-Granularity Generator for Temporal Action Proposal.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Urban Semantic 3D Reconstruction From Multiview Satellite Imagery.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

Multi-Level Multimodal Common Semantic Space for Image-Phrase Grounding.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Editorial IEEE Transactions on Multimedia Special Section on Video Analytics: Challenges, Algorithms, and Applications.
IEEE Trans. Multim., 2018

Modeling Multimodal Clues in a Hybrid Deep Learning Framework for Video Classification.
IEEE Trans. Multim., 2018

Model-Driven Feedforward Prediction for Manipulation of Deformable Objects.
IEEE Trans Autom. Sci. Eng., 2018

Exploiting Feature and Class Relationships in Video Categorization with Regularized Deep Neural Networks.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Guest Editorial.
Comput. Vis. Image Underst., 2018

Scene Dynamics: Counterfactual Critic Multi-Agent Training for Scene Graph Generation.
CoRR, 2018

Heated-Up Softmax Embedding.
CoRR, 2018

AutoLoc: Weakly-supervised Temporal Action Localization.
CoRR, 2018

Online Action Detection in Untrimmed, Streaming Videos - Modeling and Evaluation.
CoRR, 2018

Ask not what your postdoc can do for you ...
Commun. ACM, 2018


Low-shot Learning via Covariance-Preserving Adversarial Augmentation Networks.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

EE-USAD: ACM MM 2018Workshop on UnderstandingSubjective Attributes of Data focus on Evoked Emotions.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

PatternNet: Visual Pattern Mining with Deep Neural Network.
Proceedings of the 2018 ACM on International Conference on Multimedia Retrieval, 2018

Skip RNN: Learning to Skip State Updates in Recurrent Neural Networks.
Proceedings of the 6th International Conference on Learning Representations, 2018

Incorporating Background Knowledge into Video Description Generation.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Entity-aware Image Caption Generation.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Online Detection of Action Start in Untrimmed, Streaming Videos.
Proceedings of the Computer Vision - ECCV 2018, 2018

AutoLoc: Weakly-Supervised Temporal Action Localization in Untrimmed Videos.
Proceedings of the Computer Vision - ECCV 2018, 2018

Grounding Referring Expressions in Images by Variational Context.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Zero-Shot Visual Recognition Using Semantics-Preserving Adversarial Embedding Networks.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Hash Bit Selection for Nearest Neighbor Search.
IEEE Trans. Image Process., 2017

On Binary Embedding using Circulant Matrices.
J. Mach. Learn. Res., 2017

Guest editorial: Multimodal sentiment analysis and mining in the wild.
Image Vis. Comput., 2017

A survey of multimodal sentiment analysis.
Image Vis. Comput., 2017

Multilingual visual sentiment concept clustering and analysis.
Int. J. Multim. Inf. Retr., 2017

Zero-Shot Visual Recognition using Semantics-Preserving Adversarial Embedding Network.
CoRR, 2017

ConvNet Architecture Search for Spatiotemporal Feature Learning.
CoRR, 2017

Deep Image Set Hashing.
Proceedings of the 2017 IEEE Winter Conference on Applications of Computer Vision, 2017

Improving Event Extraction via Multimodal Integration.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

LSVC2017: Large-Scale Video Classification Challenge.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

More Cat than Cute?: Interpretable Prediction of Adjective-Noun Pairs.
Proceedings of the Workshop on Multimodal Understanding of Social, 2017

MUSA2: First ACM Workshop on Multimodal Understanding of Social, Affective and Subjective Attributes.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Learning Spread-Out Local Feature Descriptors.
Proceedings of the IEEE International Conference on Computer Vision, 2017

PPR-FCN: Weakly Supervised Visual Relation Detection via Parallel Pairwise R-FCN.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Learning Discriminative and Transformation Covariant Local Feature Detectors.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Visual Translation Embedding Network for Visual Relation Detection.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

CDC: Convolutional-De-Convolutional Networks for Precise Temporal Action Localization in Untrimmed Videos.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Localizing Actions from Video Labels and Pseudo-Annotations.
Proceedings of the British Machine Vision Conference 2017, 2017

2016
Learning to Hash for Indexing Big Data - A Survey.
Proc. IEEE, 2016

EventNet Version 1.1 Technical Report.
CoRR, 2016

Generic Instance Search and Re-identification from One Example via Attributes and Categories.
CoRR, 2016

Action Temporal Localization in Untrimmed Videos via Multi-stage CNNs.
CoRR, 2016

Model-Driven Feed-Forward Prediction for Manipulation of Deformable Objects.
CoRR, 2016

Event Specific Multimodal Pattern Mining with Image-Caption Pairs.
CoRR, 2016

Going Deeper for Multilingual Visual Sentiment Detection.
CoRR, 2016

Columbia MVSO Image Sentiment Dataset.
CoRR, 2016

3D shape retrieval using a single depth image from low-cost sensors.
Proceedings of the 2016 IEEE Winter Conference on Applications of Computer Vision, 2016


Event Specific Multimodal Pattern Mining for Knowledge Base Construction.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Deep Cross Residual Learning for Multitask Visual Recognition.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Tamp: A Library for Compact Deep Neural Networks with Structured Matrices.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Placing Broadcast News Videos in their Social Media Context Using Hashtags.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Watching What and How Politicians Discuss Various Topics: A Large-Scale Video Analytics UI.
Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, 2016

Multilingual Visual Sentiment Concept Matching.
Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, 2016

Complura: Exploring and Leveraging a Large-scale Multilingual Visual Sentiment Ontology.
Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, 2016

SentiCart: Cartography and Geo-contextualization for Multilingual Visual Sentiment.
Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, 2016

New Frontiers of Large Scale Multimedia Information Retrieval.
Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, 2016

PanoSwarm: Collaborative and Synchronized Multi-Device Panoramic Photography.
Proceedings of the 21st International Conference on Intelligent User Interfaces, 2016

Temporal Action Localization in Untrimmed Videos via Multi-stage CNNs.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Interactive Segmentation on RGBD Images via Cue Selection.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

A Multi-media Approach to Cross-lingual Entity Knowledge Transfer.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

2015
Uploader Intent for Online Video: Typology, Inference, and Applications.
IEEE Trans. Multim., 2015

Super Fast Event Recognition in Internet Videos.
IEEE Trans. Multim., 2015

Learning Sample Specific Weights for Late Fusion.
IEEE Trans. Image Process., 2015

Assistive Image Comment Robot - A Novel Mid-Level Concept-Based Representation.
IEEE Trans. Affect. Comput., 2015

Spherical Hashing: Binary Code Embedding with Hyperspheres.
IEEE Trans. Pattern Anal. Mach. Intell., 2015

Special issue on concept detection with big data.
Int. J. Multim. Inf. Retr., 2015

Deep Transfer Network: Unsupervised Domain Adaptation.
CoRR, 2015

Compact Nonlinear Maps and Circulant Extensions.
CoRR, 2015

CamSwarm: Instantaneous Smartphone Camera Arrays for Collaborative Photography.
CoRR, 2015

Fast Neural Networks with Circulant Projections.
CoRR, 2015

Using a Knowledge Graph to Combat Human Trafficking.
Proceedings of the ISWC 2015 Posters & Demonstrations Track co-located with the 14th International Semantic Web Conference (ISWC-2015), 2015


EventNet: A Large Scale Structured Concept Library for Complex Event Detection in Video.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Large Video Event Ontology Browsing, Search and Tagging (EventNet Demo).
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

ASM'15: The 1st International Workshop on Affect and Sentiment in Multimedia.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Visual Affect Around the World: A Large-scale Multilingual Visual Sentiment Ontology.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Image Popularity Prediction in Social Media Using Sentiment and Context Features.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Opportunities and Challenges of Industry-Academic Collaborations in Multimedia Research.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Words and Pictures: Crowdsource Discovery beyond Image Semantics.
Proceedings of the Fourth International Workshop on Crowdsourcing for Multimedia, 2015

2nd Workshop on Computational Models of Social Interactions: Human-Computer-Media Communication (HCMC2015).
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Encoding Concept Prototypes for Video Event Detection and Summarization.
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

Regrasping and unfolding of garments using predictive thin shell modeling.
Proceedings of the IEEE International Conference on Robotics and Automation, 2015

Fast Orthogonal Projection Based on Kronecker Product.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

An Exploration of Parameter Redundancy in Deep Networks with Circulant Projections.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Cross-document Event Coreference Resolution based on Cross-media Features.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

New insights into Laplacian similarity search.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Attributes and categories for generic instance search from one example.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015


Low-Rank Similarity Metric Learning in High Dimensions.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
Mixed image-keyword query adaptive hashing over multilabel images.
ACM Trans. Multim. Comput. Commun. Appl., 2014

Discovering joint audio-visual codewords for video event detection.
Mach. Vis. Appl., 2014

On Learning with Label Proportions.
CoRR, 2014

Building A Large Concept Bank for Representing Events in Video.
CoRR, 2014

DeepSentiBank: Visual Sentiment Concept Classification with Deep Convolutional Neural Networks.
CoRR, 2014

BBN VISER TRECVID 2014 Multimedia Event Detection and Multimedia Event Recounting Systems.
Proceedings of the 2014 TREC Video Retrieval Evaluation, 2014

Discrete Graph Hashing.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Scalable Visual Instance Mining with Threads of Features.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Modeling Attributes from Category-Attribute Proportions.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Predicting Viewer Perceived Emotions in Animated GIFs.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Object-Based Visual Sentiment Concept Analysis and Application.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Event-Driven Semantic Concept Discovery by Exploiting Weakly Tagged Internet Images.
Proceedings of the International Conference on Multimedia Retrieval, 2014

Predicting Viewer Affective Comments Based on Image Content in Social Media.
Proceedings of the International Conference on Multimedia Retrieval, 2014

Minimally Needed Evidence for Complex Event Recognition in Unconstrained Videos.
Proceedings of the International Conference on Multimedia Retrieval, 2014

Predicting Evoked Emotions in Video.
Proceedings of the 2014 IEEE International Symposium on Multimedia, 2014

Real-time pose estimation of deformable objects using a volumetric approach.
Proceedings of the 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2014

Circulant Binary Embedding.
Proceedings of the 31th International Conference on Machine Learning, 2014

Why We Watch the News: A Dataset for Exploring Sentiment in Broadcast Video News.
Proceedings of the 16th International Conference on Multimodal Interaction, 2014

From Low-Cost Depth Sensors to CAD: Cross-Domain 3D Shape Retrieval via Regression Tree Fields.
Proceedings of the Computer Vision - ECCV 2014, 2014

Discriminative Indexing for Probabilistic Image Patch Priors.
Proceedings of the Computer Vision - ECCV 2014, 2014

Recognizing Complex Events in Videos by Learning Key Static-Dynamic Evidences.
Proceedings of the Computer Vision - ECCV 2014, 2014

Hash-SVM: Scalable Kernel Machines for Large-Scale Visual Classification.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Video Event Detection by Inferring Temporal Instance Labels.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Locally Linear Hashing for Extracting Non-linear Manifolds.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

2013
How far we've come: Impact of 20 years of multimedia information retrieval.
ACM Trans. Multim. Comput. Commun. Appl., 2013

Query-Adaptive Image Search With Hash Codes.
IEEE Trans. Multim., 2013

Semi-supervised learning using greedy max-cut.
J. Mach. Learn. Res., 2013

High-level event recognition in unconstrained videos.
Int. J. Multim. Inf. Retr., 2013

Divide-and-Conquer Subspace Segmentation
CoRR, 2013

BBN VISER TRECVID 2013 Multimedia Event Detection and Multimedia Event Recounting Systems.
Proceedings of the 2013 TREC Video Retrieval Evaluation, 2013

Analyzing the Harmonic Structure in Graph-Based Learning.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

News rover: exploring topical structures and serendipity in heterogeneous multimedia news.
Proceedings of the ACM Multimedia Conference, 2013

Structured exploration of who, what, when, and where in heterogeneous multimedia news sources.
Proceedings of the ACM Multimedia Conference, 2013

Large-scale visual sentiment ontology and detectors using adjective noun pairs.
Proceedings of the ACM Multimedia Conference, 2013

SentiBank: large-scale ontology and classifiers for detecting sentiment and emotions in visual content.
Proceedings of the ACM Multimedia Conference, 2013

Towards a comprehensive computational model foraesthetic assessment of videos.
Proceedings of the ACM Multimedia Conference, 2013

\(\propto\)SVM for Learning with Label Proportions.
Proceedings of the 30th International Conference on Machine Learning, 2013

Large-Scale Video Hashing via Structure Learning.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Distributed Low-Rank Subspace Segmentation.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Designing Category-Level Attributes for Discriminative Visual Recognition.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Label Propagation from ImageNet to 3D Point Clouds.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Sample-Specific Late Fusion for Visual Category Recognition.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Hash Bit Selection: A Unified Solution for Selection Problems in Hashing.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

A Bayesian Approach to Multimodal Visual Dictionary Learning.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Robust Object Co-detection.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

2012
Active query sensing: Suggesting the best query view for mobile visual search.
ACM Trans. Multim. Comput. Commun. Appl., 2012

Fast Semantic Diffusion for Large-Scale Context-Based Image and Video Annotation.
IEEE Trans. Image Process., 2012

Robust and Scalable Graph-Based Semisupervised Learning.
Proc. IEEE, 2012

Web-Scale Multimedia Processing and Applications [Scanning the Issue].
Proc. IEEE, 2012

Semi-Supervised Hashing for Large-Scale Search.
IEEE Trans. Pattern Anal. Mach. Intell., 2012

BBNVISER : BBN VISER TRECVID 2012 Multimedia Event Detection and Multimedia Event Recounting Systems.
Proceedings of the 2012 TREC Video Retrieval Evaluation, 2012

IBM Research and Columbia University TRECVID-2012 Multimedia Event Detection (MED), Multimedia Event Recounting (MER), and Semantic Indexing (SIN) Systems.
Proceedings of the 2012 TREC Video Retrieval Evaluation, 2012

Learning with Partially Absorbing Random Walks.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Hybrid social media network.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Submodular video hashing: a unified framework towards video pooling and indexing.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Joint audio-visual bi-modal codewords for video event detection.
Proceedings of the International Conference on Multimedia Retrieval, 2012

Compact hashing for mixed image-keyword query over multi-label images.
Proceedings of the International Conference on Multimedia Retrieval, 2012

Compact Hyperplane Hashing with Bilinear Functions.
Proceedings of the 29th International Conference on Machine Learning, 2012

On the Difficulty of Nearest Neighbor Search.
Proceedings of the 29th International Conference on Machine Learning, 2012

Accelerated Large Scale Optimization by Concomitant Hashing.
Proceedings of the Computer Vision - ECCV 2012, 2012

Scene Aligned Pooling for Complex Video Recognition.
Proceedings of the Computer Vision - ECCV 2012, 2012

Weak attributes for large-scale image retrieval.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Robust late fusion with rank minimization.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Supervised hashing with kernels.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Segmentation using superpixels: A bipartite graph partitioning approach.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Robust visual domain adaptation with low-rank reconstruction.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Spherical hashing.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Mobile product search with Bag of Hash Bits and boundary reranking.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Exploiting web images for event recognition in consumer videos: A multiple source domain adaptation approach.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

2011
Modeling Scene and Object Contexts for Human Action Retrieval With Few Examples.
IEEE Trans. Circuits Syst. Video Technol., 2011

Media Search in Mobile Devices [From the Guest Editors].
IEEE Signal Process. Mag., 2011

Anomaly detection in information streams without prior domain knowledge.
IBM J. Res. Dev., 2011


IBM Research and Columbia University TRECVID-2011 Multimedia Event Detection (MED) System.
Proceedings of the 2011 TREC Video Retrieval Evaluation, 2011

A mobile location search system with active query sensing.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Active query sensing for mobile location search.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Towards low bit rate mobile visual search with multiple-channel coding.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Mobile product search with bag of hash bits.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Content based multimedia retrieval: lessons learned from two decades of research.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Consumer video understanding: a benchmark database and an evaluation of human and machine performance.
Proceedings of the 1st International Conference on Multimedia Retrieval, 2011

Lost in binarization: query-adaptive ranking for similar image search with compact codes.
Proceedings of the 1st International Conference on Multimedia Retrieval, 2011

Hashing with Graphs.
Proceedings of the 28th International Conference on Machine Learning, 2011

Towards Optimal Discriminating Order for Multiclass Classification.
Proceedings of the 11th IEEE International Conference on Data Mining, 2011

Learning component-level sparse representation using histogram information for image classification.
Proceedings of the IEEE International Conference on Computer Vision, 2011

Noise resistant graph ranking for improved web image search.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Compact hashing with joint optimization of search accuracy and time.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

2010
Cortically-Coupled Computer Vision.
Proceedings of the Brain-Computer Interfaces, 2010

Audio-visual atoms for generic video concept classification.
ACM Trans. Multim. Comput. Commun. Appl., 2010

Semi-supervised distance metric learning for collaborative image retrieval and clustering.
ACM Trans. Multim. Comput. Commun. Appl., 2010

Camera Response Functions for Image Forensics: An Automatic Algorithm for Splicing Detection.
IEEE Trans. Inf. Forensics Secur., 2010

Near Duplicate Identification With Spatially Aligned Pyramid Matching.
IEEE Trans. Circuits Syst. Video Technol., 2010

In a Blink of an Eye and a Switch of a Transistor: Cortically Coupled Computer Vision.
Proc. IEEE, 2010

Columbia-UCF TRECVID2010 Multimedia Event Detection: Combining Multiple Modalities, Contextual Concepts, and Temporal Matching.
Proceedings of the TRECVID 2010 workshop participants notebook papers, 2010

Scalable similarity search with optimized kernel hashing.
Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2010

Sequential Projection Learning for Hashing with Compact Codes.
Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010

Large Graph Construction for Scalable Semi-Supervised Learning.
Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010

Single-view recaptured image detection based on physics-based features.
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

Semi-supervised hashing for scalable image retrieval.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

2009
Identifying and prefiltering images.
IEEE Signal Process. Mag., 2009

Enhancing Bilinear Subspace Learning by Element Rearrangement.
IEEE Trans. Pattern Anal. Mach. Intell., 2009

An image score inference system for RNAi genome-wide screening based on fuzzy mixture regression modeling.
J. Biomed. Informatics, 2009

VIREO/DVMM at TRECVID 2009: High-Level Feature Extraction, Automatic Video Search, and Content-Based Copy Detection.
Proceedings of the TRECVID 2009 workshop participants notebook papers, 2009

Brain state decoding for rapid image retrieval.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Semantic context transfer across heterogeneous sources for domain adaptive video search.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Short-term audio-visual atoms for generic video concept classification.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Mobile media search: has media search finally found its perfect platform? part II.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Graph construction and <i>b</i>-matching for semi-supervised learning.
Proceedings of the 26th Annual International Conference on Machine Learning, 2009

Muti-scale temporal segmentation and outlier detection in sensor networks.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Domain adaptive semantic diffusion for large scale context-based video annotation.
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

Visual saliency with side information.
Proceedings of the IEEE International Conference on Acoustics, 2009

Label diagnosis through self tuning forweb image search.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Robust multi-class transductive learning with graphs.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

2008
An Introduction to the Special Issue on Event Analysis in Videos.
IEEE Trans. Circuits Syst. Video Technol., 2008

Computers, Robotics, and the Human Brain [From the Editor].
IEEE Signal Process. Mag., 2008

Quality-Optimized and Secure End-to-End Authentication for Media Delivery.
Proc. IEEE, 2008

Query-Adaptive Fusion for Multimodal Search.
Proc. IEEE, 2008

Video Event Recognition Using Kernel Methods with Multilevel Temporal Alignment.
IEEE Trans. Pattern Anal. Mach. Intell., 2008

Columbia University/VIREO-CityU/IRIT TRECVID2008 High-Level Feature Extraction and Interactive Video Search.
Proceedings of the TRECVID 2008 workshop participants notebook papers, 2008

SIFT-Bag kernel for video event analysis.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Internet image archaeology: automatically tracing the manipulation history of photographs on the web.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

CuZero: embracing the frontier of interactive visual search for informed users.
Proceedings of the 1st ACM SIGMM International Conference on Multimedia Information Retrieval, 2008

Graph transduction via alternating minimization.
Proceedings of the Machine Learning, 2008

Cross-domain learning methods for high-level visual concept classification.
Proceedings of the International Conference on Image Processing, 2008

Semantic Concept Classification by Joint Semi-supervised Learning of Feature Subspaces and Support Vector Machines.
Proceedings of the Computer Vision, 2008

Near duplicate image identification with patially Aligned Pyramid Matching.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Active microscopic cellular image annotation by superposable graph transduction with imbalanced labels.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Semi-supervised distance metric learning for Collaborative Image Retrieval.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Fast kernel learning for spatial pyramid matching.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Future directions in multimedia retrieval: : impact of new technology.
Proceedings of the 7th ACM International Conference on Image and Video Retrieval, 2008

Visual islands: intuitive browsing of visual search results.
Proceedings of the 7th ACM International Conference on Image and Video Retrieval, 2008

Columbia University's semantic video search engine 2008.
Proceedings of the 7th ACM International Conference on Image and Video Retrieval, 2008

Relevance aggregation projections for image retrieval.
Proceedings of the 7th ACM International Conference on Image and Video Retrieval, 2008

Statistical fusion of multiple cues for image tampering detection.
Proceedings of the 42nd Asilomar Conference on Signals, Systems and Computers, 2008

2007
Utility-Based Video Adaptation for Universal Multimedia Access (UMA) and Content-Based Utility Function Prediction for Real-Time Video Transcoding.
IEEE Trans. Multim., 2007

Learn how to learn [From the Editor].
IEEE Signal Process. Mag., 2007

Enabling MPEG-7 structural and semantic descriptions in retrieval applications.
J. Assoc. Inf. Sci. Technol., 2007

Reranking Methods for Visual Search.
IEEE Multim., 2007

Recent Advances and Open Issues of Digital Image/Video Search.
Proceedings of the Eighth International Workshop on Image Analysis for Multimedia Interactive Services, 2007

Columbia University TRECVID 2007 High-Level Feature Extraction.
Proceedings of the TRECVID 2007 workshop participants notebook papers, 2007

Video search reranking through random walk over document-level context graph.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

Kodak's consumer video benchmark data set: concept definition and annotation.
Proceedings of the 9th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2007

New challenges in multimedia research for the increasingly connected and fast growing digital society.
Proceedings of the 9th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2007

Large-scale multimodal semantic concept detection for consumer video.
Proceedings of the 9th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2007

Blind Passive Media Forensics: Motivation and Opportunity.
Proceedings of the Multimedia Content Analysis and Mining, International Workshop, 2007

Dynamic Local Tracing for 3d Axon Curvilinear Structure Detection from Microscopic Image Stack.
Proceedings of the 2007 IEEE International Symposium on Biomedical Imaging: From Nano to Macro, 2007

Image Splicing Detection using Camera Response Function Consistency and Automatic Segmentation.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Context-Based Concept Fusion with Boosted Conditional Random Fields.
Proceedings of the IEEE International Conference on Acoustics, 2007

Recent Advances and Challenges of Semantic Image/Video Search.
Proceedings of the IEEE International Conference on Acoustics, 2007

Element Rearrangement for Tensor-Based Subspace Learning.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

Visual Event Recognition in News Video using Kernel Methods with Multi-Level Temporal Alignment.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

Using Geometry Invariants for Camera Response Function Estimation.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

Kernel Sharing With Joint Boosting For Multi-Class Concept Detection.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

A reranking approach for context-based concept fusion in video indexing and retrieval.
Proceedings of the 6th ACM International Conference on Image and Video Retrieval, 2007

Columbia University's semantic video search engine.
Proceedings of the 6th ACM International Conference on Image and Video Retrieval, 2007

2006
New semi-fragile image authentication watermarking techniques using random bias and nonuniform quantization.
IEEE Trans. Multim., 2006

Making luck happen [from the Editor].
IEEE Signal Process. Mag., 2006

You have been reading...so what do you think? [from The Editor].
IEEE Signal Process. Mag., 2006

"First who ... Then what" [from the Editor].
IEEE Signal Process. Mag., 2006

Challenges in the new year [from The Editor].
IEEE Signal Process. Mag., 2006

Large-Scale Concept Ontology for Multimedia.
IEEE Multim., 2006

Columbia University TRECVID-2006 Video Search and High-Level Feature Extraction.
Proceedings of the 2006 TREC Video Retrieval Evaluation, 2006

An online system for classifying computer graphics images from natural photographs.
Proceedings of the Security, Steganography, and Watermarking of Multimedia Contents VIII, 2006

Video search reranking via information bottleneck principle.
Proceedings of the 14th ACM International Conference on Multimedia, 2006

Concept-based electronic health records: opportunities and challenges.
Proceedings of the 14th ACM International Conference on Multimedia, 2006

To search or to label?: predicting the performance of search-based automatic image classifiers.
Proceedings of the 8th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2006

Pattern Mining in Visual Concept Streams.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Detecting Image Splicing using Geometry Invariants and Camera Characteristics Consistency.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Visual Event Detection using Multi-Dimensional Concept Dynamics.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Active Context-Based Concept Fusionwith Partial User Labels.
Proceedings of the International Conference on Image Processing, 2006

Topic Tracking Across Broadcast News Videos with Visual Duplicates and Semantic Concepts.
Proceedings of the International Conference on Image Processing, 2006

Complexity Adaptive H.264 Encoding for Light Weight Streams.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Modeling the Activity Pattern of the Constellation of Cardiac Chambers in Echocardiogram Videos.
Proceedings of the Computer Vision Approaches to Medical Image Analysis, 2006

A Generative-Discriminative Hybrid Method for Multi-View Object Detection.
Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006), 2006

Exploring the Dynamics of Visual Events in the Multi-dimensional Semantic Concept Space.
Proceedings of the Image and Video Retrieval, 5th International Conference, 2006

Exploring Text and Image Features to Classify Images in Bioscience Literature.
Proceedings of the Workshop on Linking Natural Language and Biology, 2006

2005
A Robust and Secure Media Signature Scheme for JPEG Images.
J. VLSI Signal Process., 2005

A secure and robust digital signature scheme for JPEG2000 image authentication.
IEEE Trans. Multim., 2005

Classification-based multidimensional adaptation prediction for scalable video coding using subjective quality evaluation.
IEEE Trans. Circuits Syst. Video Technol., 2005

Video Adaptation: Concepts, Technologies, and Open Issues.
Proc. IEEE, 2005

A Crypto Signature Scheme For Image Authentication Over Wireless Channel.
Int. J. Image Graph., 2005

Columbia University TRECVID-2005 Video Search and High-Level Feature Extraction.
Proceedings of the 2005 TREC Video Retrieval Evaluation, 2005

A Framework for Sub-Window Shot Detection.
Proceedings of the 11th International Conference on Multi Media Modeling (MMM 2005), 2005

Physics-motivated features for distinguishing photographic images and computer graphics.
Proceedings of the 13th ACM International Conference on Multimedia, 2005

Automatic discovery of query-class-dependent models for multimodal search.
Proceedings of the 13th ACM International Conference on Multimedia, 2005

Layered dynamic mixture model for pattern discovery in asynchronous multi-modal streams [video applications].
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Commercial Detection in Heterogeneous Video Streams Using Fused Multi-Modal and Temporal Features.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Combining text and audio-visual features in video indexing.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Visual Cue Cluster Construction via Information Bottleneck Principle and Kernel Density Estimation.
Proceedings of the Image and Video Retrieval, 4th International Conference, 2005

2004
Structure analysis of soccer video with domain knowledge and hidden Markov models.
Pattern Recognit. Lett., 2004

Real-time view recognition and event detection for sports video.
J. Vis. Commun. Image Represent., 2004

Multimedia database management systems.
J. Vis. Commun. Image Represent., 2004

Predicting optimal operation of MC-3DSBC multidimensional scalable video coding using subjective quality measurement.
Proceedings of the Visual Communications and Image Processing 2004, 2004


Discovery and fusion of salient multimodal features toward news story segmentation.
Proceedings of the Storage and Retrieval Methods and Applications for Multimedia 2004, 2004

Detecting image near-duplicate by stochastic attributed relational graph matching with learning.
Proceedings of the 12th ACM International Conference on Multimedia, 2004

Story boundary detection in large broadcast news video archives: techniques, experience and trends.
Proceedings of the 12th ACM International Conference on Multimedia, 2004

Blind detection of photomontage using higher order statistics.
Proceedings of the 2004 International Symposium on Circuits and Systems, 2004

Semantic video clustering across sources using bipartite spectral clustering.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Color-mood analysis of films based on syntactic and psychological models.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Subjective preference of spatio-temporal rate in video adaptation using multi-dimensional scalable coding.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Understanding and modeling user interests in consumer videos.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Generative, discriminative, and ensemble learning on multi-modal perceptual fusion toward news video story segmentation.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Discovering meaningful multimedia patterns with audio-visual concepts and associated text.
Proceedings of the 2004 International Conference on Image Processing, 2004

A model for image splicing.
Proceedings of the 2004 International Conference on Image Processing, 2004

Video mining: pattern discovery versus pattern recognition.
Proceedings of the 2004 International Conference on Image Processing, 2004

News video story segmentation using fusion of multi-level multi-modal features in TRECVID 2003.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Learning to Detect Scene Text Using a Higher-Order MRF with Belief Propagation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2004

Automatic View Recognition in Echocardiogram Videos Using Parts-Based Representation.
Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2004), with CD-ROM, 27 June, 2004

Pattern Mining in Large-Scale Image and Video Sources p.
Proceedings of the Image and Video Retrieval: Third International Conference, 2004

2003
Survey of compressed-domain features used in audio-visual indexing and analysis.
J. Vis. Commun. Image Represent., 2003

Discovery and Fusion of Salient Multi-modal Features Towards News Story Segmentation.
Proceedings of the 2003 TREC Video Retrieval Evaluation, 2003

IBM Research TRECVID-2003 Video Retrieval System.
Proceedings of the 2003 TREC Video Retrieval Evaluation, 2003

Mining spatio-temporal patterns and knowledge structures in multimedia collection.
Proceedings of the First ACM International Workshop on Multimedia Databases, 2003

Unsupervised discovery of multilevel statistical video structures using hierarchical hidden Markov models.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

Content-adaptive utility-based video adaptation.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

A statistical framework for fusing mid-level perceptual features in news story segmentation.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

Feature selection for unsupervised discovery of statistical temporal structures in video.
Proceedings of the 2003 International Conference on Image Processing, 2003

Content-based utility function prediction for real-time MPEG-4 video transcoding.
Proceedings of the 2003 International Conference on Image Processing, 2003

Image classification using multimedia knowledge networks.
Proceedings of the 2003 International Conference on Image Processing, 2003

Content-based video summarization and adaptation for ubiquitous media access.
Proceedings of the 12th International Conference on Image Analysis and Processing (ICIAP 2003), 2003

A Bayesian Framework for Fusing Multiple Word Knowledge Models in Videotext Recognition.
Proceedings of the 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2003), 2003

2002
Computable scenes and structures in films.
IEEE Trans. Multim., 2002

Special issue on multimedia adaptation.
Signal Process. Image Commun., 2002

The Holy Grail of Content-Based.
IEEE Multim., 2002

New semifragile image authentication watermarking techniques using random bias and nonuniform quantization.
Proceedings of the Security and Watermarking of Multimedia Contents IV, 2002

Event detection in baseball video using superimposed caption recognition.
Proceedings of the 10th ACM International Conference on Multimedia 2002, 2002

A utility framework for the automatic generation of audio-visual skims.
Proceedings of the 10th ACM International Conference on Multimedia 2002, 2002

Duplicate detection in consumer photography and news video.
Proceedings of the 10th ACM International Conference on Multimedia 2002, 2002

Multimedia Knowledge Integration, Summarization And Evaluation.
Proceedings of the Third International Workshop on Multimedia Data Mining, 2002

FGS+: optimizing the joint SNR-temporal video quality in MPEG-4 fine grained scalable coding.
Proceedings of the 2002 International Symposium on Circuits and Systems, 2002

Semantic knowledge construction from annotated image collections.
Proceedings of the 2002 IEEE International Conference on Multimedia and Expo, 2002

Perceptual knowledge construction from annotated image collections.
Proceedings of the 2002 IEEE International Conference on Multimedia and Expo, 2002

General and domain-specific techniques for detecting and recognizing superimposed text in video.
Proceedings of the 2002 International Conference on Image Processing, 2002

Video skims: taxonomies and an optimal generation framework.
Proceedings of the 2002 International Conference on Image Processing, 2002

A quantitative semi-fragile JPEG2000 image authentication system.
Proceedings of the 2002 International Conference on Image Processing, 2002

Semi-fragile image authentication using generic wavelet domain features and ECC.
Proceedings of the 2002 International Conference on Image Processing, 2002

Echocardiogram videos: summarization, temporal segmentation and browsing.
Proceedings of the 2002 International Conference on Image Processing, 2002

Structure analysis of soccer video with hidden Markov models.
Proceedings of the IEEE International Conference on Acoustics, 2002

A robust and secure media signature scheme for JPEG images.
Proceedings of the IEEE 5th Workshop on Multimedia Signal Processing, 2002

2001
A robust image authentication method distinguishing JPEG compression from malicious manipulation.
IEEE Trans. Circuits Syst. Video Technol., 2001

Overview of the MPEG-7 standard.
IEEE Trans. Circuits Syst. Video Technol., 2001

Introduction to the special issue on MPEG-7.
IEEE Trans. Circuits Syst. Video Technol., 2001

A conceptual framework and empirical research for classifying visual descriptors.
J. Assoc. Inf. Sci. Technol., 2001

Learning Structured Visual Detectors from User Input at Multiple Levels.
Int. J. Image Graph., 2001

Real-time personalized sports video filtering and summarization.
Proceedings of the 9th ACM International Conference on Multimedia 2001, Ottawa, Ontario, Canada, September 30, 2001

SARI: self-authentication-and-recovery image watermarking system.
Proceedings of the 9th ACM International Conference on Multimedia 2001, Ottawa, Ontario, Canada, September 30, 2001

IMKA: a multimedia organization system combining perceptual and semantic knowledge.
Proceedings of the 9th ACM International Conference on Multimedia 2001, Ottawa, Ontario, Canada, September 30, 2001

PERSIVAL, a system for personalized search and summarization over multimedia healthcare information.
Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, 2001

View segmentation and static/dynamic summary generation for echocardiogram videos.
Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, 2001

Watermarking Capacity of Digital Images Based on Domain-Specific Masking Effects.
Proceedings of the 2001 International Symposium on Information Technology (ITCC 2001), 2001

Structure Analysis of Sports Video Using Domain Models.
Proceedings of the 2001 IEEE International Conference on Multimedia and Expo, 2001

Algorithms And System For Segmentation And Structure Analysis In Soccer Video.
Proceedings of the 2001 IEEE International Conference on Multimedia and Expo, 2001

Condensing Computable Scenes Using Visual Complexity And Film Syntax Analysis.
Proceedings of the 2001 IEEE International Conference on Multimedia and Expo, 2001

Long-term moving object segmentation and tracking using spatio-temporal consistency.
Proceedings of the 2001 International Conference on Image Processing, 2001

Zero-error information hiding capacity of digital images.
Proceedings of the 2001 International Conference on Image Processing, 2001

VISMap: an interactive image/video retrieval system using visualization and concept maps.
Proceedings of the 2001 International Conference on Image Processing, 2001

Using human observer eye movements in automatic image classifiers.
Proceedings of the Human Vision and Electronic Imaging VI, 2001

MPEG-7 MDS Content Description Tools and Applications.
Proceedings of the Computer Analysis of Images and Patterns, 9th International Conference, 2001

2000
Video-server retrieval scheduling and resource reservation for variable bit rate scalable video.
IEEE Trans. Circuits Syst. Video Technol., 2000

A practical methodology for guaranteeing quality of service for video-on-demand.
IEEE Trans. Circuits Syst. Video Technol., 2000

Object-based multimedia content description schemes and applications for MPEG-7.
Signal Process. Image Commun., 2000

Error-resilient transcoding for video over wireless channels.
IEEE J. Sel. Areas Commun., 2000

VQ-based digital signature scheme for multimedia content authentication.
Proceedings of the Security and Watermarking of Multimedia Contents II, 2000

Semifragile watermarking for authenticating JPEG visual content.
Proceedings of the Security and Watermarking of Multimedia Contents II, 2000

Automatic selection of visual features and classifiers.
Proceedings of the Storage and Retrieval for Media Databases 2000, 2000

Motion trajectory matching of video objects.
Proceedings of the Storage and Retrieval for Media Databases 2000, 2000

Determining computable scenes in films and their structures using audio-visual memory models.
Proceedings of the 8th ACM International Conference on Multimedia 2000, Los Angeles, CA, USA, October 30, 2000

Principles and applications of content-aware video communication.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2000

Video Scene Segmentation using Video and Audio Features.
Proceedings of the 2000 IEEE International Conference on Multimedia and Expo, 2000

Image Retrieval with Sketches and Compositions.
Proceedings of the 2000 IEEE International Conference on Multimedia and Expo, 2000

A Knowledge Engineering Approach for Image Classification based on Probabilistic Reasoning Systems.
Proceedings of the 2000 IEEE International Conference on Multimedia and Expo, 2000

Generating Semantic Visual Templates for Video Databases.
Proceedings of the 2000 IEEE International Conference on Multimedia and Expo, 2000

Structural and Semantic Analysis of Video.
Proceedings of the 2000 IEEE International Conference on Multimedia and Expo, 2000

Experiments in Constructing Belief Networks for Image Classification Systems.
Proceedings of the 2000 International Conference on Image Processing, 2000

Discovering Recurrent Visual Semantics in Consumer Photographs.
Proceedings of the 2000 International Conference on Image Processing, 2000

Audio scene segmentation using multiple features, models and time scales.
Proceedings of the IEEE International Conference on Acoustics, 2000

1999
An integrated approach for content-based video object segmentation and retrieval.
IEEE Trans. Circuits Syst. Video Technol., 1999

Introduction to the special issue on object-based video coding and description.
IEEE Trans. Circuits Syst. Video Technol., 1999

Objective and subjective quality of service performance of video-on-demand in ATM-WAN.
Signal Process. Image Commun., 1999

Integrated Spatial and Feature Image Query.
Multim. Syst., 1999

Searching and Editing MPEG-Compressed Video in a Distributed Online Environment.
Multim. Syst., 1999

Editorial: Content-Processing for Video Browsing, Retrieval, and Editing.
Multim. Syst., 1999

Image Retrieval: Current Techniques, Promising Directions, and Open Issues.
J. Vis. Commun. Image Represent., 1999

GUEST EDITORS' INTRODUCTION: Content-Based Access of Image and Video Libraries.
Comput. Vis. Image Underst., 1999

Issues and solutions for authenticating MPEG video.
Proceedings of the Security and Watermarking of Multimedia Contents, 1999

Efficient video sequence retrieval in large repositories.
Proceedings of the Storage and Retrieval for Image and Video Databases VII, 1999

Model-based classification of visual information for content-based retrieval.
Proceedings of the Storage and Retrieval for Image and Video Databases VII, 1999

Multimedia access and retrieval: the state of the art and future directions (panel session).
Proceedings of the 7th ACM International Conference on Multimedia '99, Orlando, FL, USA, October 30, 1999

Region Feature Based Similarity Searching of Semantic Video Objects.
Proceedings of the 1999 International Conference on Image Processing, 1999

1998
A fully automated content-based video search engine supporting spatiotemporal queries.
IEEE Trans. Circuits Syst. Video Technol., 1998

Effective algorithms for video transmission over wireless channels.
Signal Process. Image Commun., 1998

Database Research at Columbia University.
SIGMOD Rec., 1998

Next-generation content representation, creation, and searching for new-media applications in education.
Proc. IEEE, 1998

Special Issue on Image Technology for World Wide Web Applications: Guest Editors' Comments.
J. Vis. Commun. Image Represent., 1998

Using Relevance Feedback in Content-Based Image Metasearch.
IEEE Internet Comput., 1998

Generating Multimedia Briefings: Coordinating Language and Illustration.
Artif. Intell., 1998

High Performance Digital Video Servers: Storage and Retrieval of Compressed Scalable Video.
Adv. Comput., 1998

VideoQ: a fully automated video retrieval system using motion sketches.
Proceedings of the Proceedings Fourth IEEE Workshop on Applications of Computer Vision, 1998

Robust Image Authentication Method Surviving JPEG Lossy Compression.
Proceedings of the Storage and Retrieval for Image and Video Databases VI, 1998

MetaSEEk: A Content-Based Metasearch Engine for Images.
Proceedings of the Storage and Retrieval for Image and Video Databases VI, 1998

AMOS: An Active System for MPEG-4 Video Object Segmentation.
Proceedings of the 1998 IEEE International Conference on Image Processing, 1998

Video Transcoding for Resilience in Wireless Channels.
Proceedings of the 1998 IEEE International Conference on Image Processing, 1998

Embedding Visible Video Watermarks in the Compressed Domain.
Proceedings of the 1998 IEEE International Conference on Image Processing, 1998

Semantic Visual Templates: Linking Visual Features to Semantics.
Proceedings of the 1998 IEEE International Conference on Image Processing, 1998

Digital image/video library and MPEG-7: standardization and research issues.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

1997
A highly efficient system for automatic face region detection in MPEG video.
IEEE Trans. Circuits Syst. Video Technol., 1997

Content-based indexing and retrieval of visual information.
IEEE Signal Process. Mag., 1997

Hybrid object-based/block-based coding in video compression at very low bit-rate.
Signal Process. Image Commun., 1997

DAVIC and Interoperability Experiments.
Multim. Tools Appl., 1997

Columbia's VoD and Multimedia Research Testbed with Heterogeneous Network Support.
Multim. Tools Appl., 1997

Guest Editorial.
Multim. Tools Appl., 1997

Columbia Digital News Project: An Environment for Briefing and Search over Multimedia Information.
Int. J. Digit. Libr., 1997

Visually Searching the Web for Content.
IEEE Multim., 1997

Finding Images/Video in Large Archives: Columbia's Content-Based Visual Query Project.
D Lib Mag., 1997

Visual Information Retrieval from Large Distributed Online Repositories.
Commun. ACM, 1997

Image and Video Search Engine for the World Wide Web.
Proceedings of the Storage and Retrieval for Image and Video Databases V, 1997

Enhancing image search engines in visual information environments.
Proceedings of the First IEEE Workshop on Multimedia Signal Processing, 1997

SaFe: a general framework for integrated spatial and feature image search.
Proceedings of the First IEEE Workshop on Multimedia Signal Processing, 1997

WebClip: a WWW video editing/browsing system.
Proceedings of the First IEEE Workshop on Multimedia Signal Processing, 1997

A distributed system for editing and browsing compressed video over the network.
Proceedings of the First IEEE Workshop on Multimedia Signal Processing, 1997

VideoQ: An Automated Content Based Video Search System Using Visual Cues.
Proceedings of the Fifth ACM International Conference on Multimedia '97, 1997

Spatio-Temporal Video Search Using the Object-Based Video Representation.
Proceedings of the Proceedings 1997 International Conference on Image Processing, 1997

Joint adaptive space and frequency basis selection.
Proceedings of the Proceedings 1997 International Conference on Image Processing, 1997

Exploring Image Functionalities in WWW Applications- Development of Image/Video Search and Editing Engines.
Proceedings of the Proceedings 1997 International Conference on Image Processing, 1997

Columbia Digital News System An Environment for Briefing and Search over Multimedia Information.
Proceedings of the 4th International Forum on Research and Technology Advances in Digital Libraries (ADL '97), 1997

1996
Development of Columbia's video on demand testbed.
Signal Process. Image Commun., 1996

Clustering Methods for Video Browsing and Annotation.
Proceedings of the Storage and Retrieval for Still Image and Video Databases IV, 1996

Tools and Techniques for Color Image Retrieval.
Proceedings of the Storage and Retrieval for Still Image and Video Databases IV, 1996

Tools for Compressed-Domain Video Indexing and Editing.
Proceedings of the Storage and Retrieval for Still Image and Video Databases IV, 1996

VisualSEEk: A Fully Automated Content-Based Image Query System.
Proceedings of the Forth ACM International Conference on Multimedia '96, 1996

CVEPS - A Compressed Video Editing and Parsing System.
Proceedings of the Forth ACM International Conference on Multimedia '96, 1996

Video Server Retrieval Scheduling for Variable Bit Rate Scalable Video.
Proceedings of the IEEE International Conference on Multimedia Computing and Systems, 1996

Local color and texture extraction and spatial query.
Proceedings of the Proceedings 1996 International Conference on Image Processing, 1996

A robust content based digital signature for image authentication.
Proceedings of the Proceedings 1996 International Conference on Image Processing, 1996

A content based video traffic model using camera operations.
Proceedings of the Proceedings 1996 International Conference on Image Processing, 1996

Automated binary texture feature sets for image retrieval.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

Efficient Techniques for Feature-Based Image/Video Access and Manipulation.
Proceedings of the Data Processing Clinic: Digital Image Access & Retrieval, 1996

Hybrid Block-Based/Segment-Based Video Compression at very Low Bitrate.
Proceedings of the 6th Data Compression Conference (DCC '96), Snowbird, Utah, USA, March 31, 1996

1995
Manipulation and Compositing of MC-DCT Compressed Video.
IEEE J. Sel. Areas Commun., 1995

Exploring Functionalities in the Compressed Image/Video Domain.
ACM Comput. Surv., 1995

Scalable MPEG2 Video Servers with Heterogeneous QoS on Parallel Disk Arrays.
Proceedings of the Network and Operating System Support for Digital Audio and Video, 1995

Single color extraction and image query.
Proceedings of the Proceedings 1995 International Conference on Image Processing, 1995

Compressed-domain techniques for image/video indexing and manipulation.
Proceedings of the Proceedings 1995 International Conference on Image Processing, 1995

Frequency and spatially adaptive wavelet packets.
Proceedings of the 1995 International Conference on Acoustics, 1995

1994
Quad-Tree Segmentation for Texture-Based Image Query.
Proceedings of the Second ACM International Conference on Multimedia '94, 1994

Error Accumulation of Repetitive Image Coding.
Proceedings of the 1994 IEEE International Symposium on Circuits and Systems, ISCAS 1994, London, England, UK, May 30, 1994

Transform Features for Texture Classification and Discrimination in Large Image Databases.
Proceedings of the Proceedings 1994 International Conference on Image Processing, 1994

1993
Transform Coding of Arbitrarily-Shaped Image Segments.
Proceedings of the First ACM International Conference on Multimedia '93, 1993

A new approach to decoding and compositing motion-compensated DCT-based images.
Proceedings of the IEEE International Conference on Acoustics, 1993

1992
Designing high-throughput VLC decoder. I. Concurrent VLSI architectures.
IEEE Trans. Circuits Syst. Video Technol., 1992

Compositing motion-compensated video within the network.
Comput. Commun. Rev., 1992

1991
VLSI Designs for High-Speed Huffman Decoder.
Proceedings of the Proceedings 1991 IEEE International Conference on Computer Design: VLSI in Computer & Processors, 1991


  Loading...