Chong-Wah Ngo

Orcid: 0000-0003-4182-8261

Affiliations:
  • Singapore Management University, School of Computing and Information Systems, Singapore
  • City University of Hong Kong, Department of Computer Science, Hong Kong
  • Hong Kong University of Science and Technology, Hong Kong (PhD 2000)


According to our database1, Chong-Wah Ngo authored at least 290 papers between 1996 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
PosMLP-Video: Spatial and Temporal Relative Position Encoding for Efficient Video Recognition.
Int. J. Comput. Vis., December, 2024

(Un)likelihood Training for Interpretable Embedding.
ACM Trans. Inf. Syst., May, 2024

FoodMask: Real-time food instance counting, segmentation and recognition.
Pattern Recognit., February, 2024

Learning Temporal Dynamics in Videos With Image Transformer.
IEEE Trans. Multim., 2024

The ACM Web Conference 2024 Report.
SIGWEB Newsl., 2024

WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines.
CoRR, 2024

Towards Multimodal Emotional Support Conversation Systems.
CoRR, 2024

RoDE: Linear Rectified Mixture of Diverse Experts for Food Large Multi-Modal Models.
CoRR, 2024

LLM-based query paraphrasing for video search.
CoRR, 2024

Leveraging LLMs and Generative Models for Interactive Known-Item Video Search.
Proceedings of the MultiMedia Modeling - 30th International Conference, 2024

Navigating Weight Prediction with Diet Diary.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video Diffusion Models.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Improving Interpretable Embeddings for Ad-hoc Video Search with Generative Captions and Multi-word Concept Bank.
Proceedings of the 2024 International Conference on Multimedia Retrieval, 2024

OVFoodSeg: Elevating Open-Vocabulary Food Image Segmentation via Image-Informed Textual Representation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
FoodLMM: A Versatile Food Assistant using Large Multi-modal Model.
CoRR, 2023

Incremental Learning on Food Instance Segmentation.
CoRR, 2023

GroundNLQ @ Ego4D Natural Language Queries Challenge 2023.
CoRR, 2023

Cross-domain Food Image-to-Recipe Retrieval by Weighted Adversarial Learning.
CoRR, 2023

Reinforcement Learning Enhanced PicHunter for Interactive Search.
Proceedings of the MultiMedia Modeling - 29th International Conference, 2023

Adaptive Split-Fusion Transformer.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

ObjectFusion: Multi-modal 3D Object Detection with Object-Centric Fusion.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

CONE: An Efficient COarse-to-fiNE Alignment Framework for Long Video Temporal Grounding.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Deeply Activated Salient Region for Instance Search.
ACM Trans. Multim. Comput. Commun. Appl., 2022

Learning From Web Recipe-Image Pairs for Food Recognition: Problem, Baselines and Performance.
IEEE Trans. Multim., 2022

Approximate k-NN Graph Construction: A Generic Online Approach.
IEEE Trans. Multim., 2022

Mixed Dish Recognition With Contextual Relation and Domain Alignment.
IEEE Trans. Multim., 2022

On the Merge of k-NN Graph.
IEEE Trans. Big Data, 2022

SibNet: Food instance counting and segmentation.
Pattern Recognit., 2022

An Efficient COarse-to-fiNE Alignment Framework @ Ego4D Natural Language Queries Challenge 2022.
CoRR, 2022

Adaptive Split-Fusion Transformer.
CoRR, 2022

Reinforcement Learning-Based Interactive Video Search.
Proceedings of the MultiMedia Modeling - 28th International Conference, 2022

Long-term Leap Attention, Short-term Periodic Shift for Video Classification.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Interactive Video Corpus Moment Retrieval using Reinforcement Learning.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Cross-lingual Adaptation for Recipe Retrieval with Mixup.
Proceedings of the ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27, 2022

Wave-ViT: Unifying Wavelet and Transformers for Visual Representation Learning.
Proceedings of the Computer Vision - ECCV 2022, 2022

Dynamic Temporal Filtering in Video Models.
Proceedings of the Computer Vision - ECCV 2022, 2022

MLP-3D: A MLP-like 3D Architecture with Grouped Time Mixing.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Group Contextualization for Video Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Interactive Search vs. Automatic Search: An Extensive Study on Video Retrieval.
ACM Trans. Multim. Comput. Commun. Appl., 2021

A Hybrid Approach for Detecting Prerequisite Relations in Multi-Modal Food Recipes.
IEEE Trans. Multim., 2021

Learning to Match Anchor-Target Video Pairs With Dual Attentional Holographic Networks.
IEEE Trans. Image Process., 2021

A Study of Multi-Task and Region-Wise Deep Learning for Food Ingredient Recognition.
IEEE Trans. Image Process., 2021

Pyramid Fusion Dark Channel Prior for Single Image Dehazing.
CoRR, 2021

SQL-Like Interpretable Interactive Video Search.
Proceedings of the MultiMedia Modeling - 27th International Conference, 2021

Token Shift Transformer for Video Classification.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

CONQUER: Contextual Query-aware Ranking for Video Corpus Moment Retrieval.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

AIxFood'21: 3rd Workshop on AIxFood.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Optimization Planning for 3D ConvNets.
Proceedings of the 38th International Conference on Machine Learning, 2021

Condensing a Sequence to One Informative Frame for Video Recognition.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Boosting Video Representation Learning With Multi-Faceted Integration.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

k-sums Clustering: A Stochastic Optimization Approach.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

Terrace-based Food Counting and Segmentation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Neighbourhood Structure Preserving Cross-Modal Embedding for Video Hyperlinking.
IEEE Trans. Multim., 2020

k-sums: another side of k-means.
CoRR, 2020

Deeply Activated Salient Region for Instance Search.
CoRR, 2020

VIREO @ TRECVid 2020: Ad-hoc Video Search.
Proceedings of the 2020 TREC Video Retrieval Evaluation, 2020

VIREO @ Video Browser Showdown 2020.
Proceedings of the MultiMedia Modeling - 26th International Conference, 2020

Cross-domain Cross-modal Food Transfer.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Interpretable Embedding for Ad-Hoc Video Search.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Multi-modal Cooking Workflow Construction for Food Recipes.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Compact Bilinear Augmented Query Structured Attention for Sport Highlights Classification.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Person-level Action Recognition in Complex Events via TSD-TSM Networks.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

CookGAN: Causality Based Text-to-Image Synthesis.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Transferring and Regularizing Prediction for Semantic Segmentation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Exploring Category-Agnostic Clusters for Open-Set Domain Adaptation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Hyperbolic Visual Embedding Learning for Zero-Shot Recognition.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Zero-Shot Ingredient Recognition by Multi-Relational Graph Convolutional Network.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
A Fine Granularity Object-Level Representation for Event Detection and Recounting.
IEEE Trans. Multim., 2019

Special issue on multimedia recommendation and multi-modal data analysis.
Multim. Syst., 2019

vireoJD-MM at Activity Detection in Extended Videos.
CoRR, 2019

VIREO-EURECOM @ TRECVID 2019: Ad-hoc Video Search (AVS).
Proceedings of the 2019 TREC Video Retrieval Evaluation, 2019

VireoJD-MM @ TRECVid 2019: Activities in Extended Video (ActEV).
Proceedings of the 2019 TREC Video Retrieval Evaluation, 2019

EURECOM at TRECVid AVS 2019.
Proceedings of the 2019 TREC Video Retrieval Evaluation, 2019

VIREO @ Video Browser Showdown 2019.
Proceedings of the MultiMedia Modeling - 25th International Conference, 2019

DietLens-Eout: Large Scale Restaurant Food Photo Recognition.
Proceedings of the 2019 on International Conference on Multimedia Retrieval, 2019

Fusion of Multimodal Embeddings for Ad-Hoc Video Search.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

R2GAN: Cross-Modal Recipe Retrieval With Generative Adversarial Network.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Learning Spatio-Temporal Representation With Local and Global Diffusion.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Transferrable Prototypical Networks for Unsupervised Domain Adaptation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Exploring Object Relation in Mean Teacher for Cross-Domain Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Mixed Dish Recognition through Multi-Label Learning.
Proceedings of the 11th Workshop on Multimedia for Cooking and Eating Activities, 2019

2018
Video Summarization.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Video Shot Detection.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

PageSense: Toward Stylewise Contextual Advertising via Visual Analysis of Web Pages.
IEEE Trans. Circuits Syst. Video Technol., 2018

Cross-modal recipe retrieval with stacked attention model.
Multim. Tools Appl., 2018

<i>k</i>-means: A revisit.
Neurocomputing, 2018

The VIREO KIS at VBS 2018.
CoRR, 2018

Enhanced VIREO KIS at VBS 2018.
Proceedings of the MultiMedia Modeling - 24th International Conference, 2018

Food Photo Recognition for Dietary Tracking: System and Experiment.
Proceedings of the MultiMedia Modeling - 24th International Conference, 2018

Interpretable Multimodal Retrieval for Fashion Products.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Deep Understanding of Cooking Procedure for Cross-modal Recipe Retrieval.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

2017
Semantic Reasoning in Zero Example Video Event Retrieval.
ACM Trans. Multim. Comput. Commun. Appl., 2017

VIREO @ TRECVID 2017: Video-to-Text, Ad-hoc Video Search, and Video hyperlinking.
Proceedings of the 2017 TREC Video Retrieval Evaluation, 2017

Deep Learning for Food Recognition.
Proceedings of the Eighth International Symposium on Information and Communication Technology, 2017

Concept-Based Interactive Search System.
Proceedings of the MultiMedia Modeling - 23rd International Conference, 2017

Cross-Modal Recipe Retrieval: How to Cook this Dish?
Proceedings of the MultiMedia Modeling - 23rd International Conference, 2017

Cross-modal Recipe Retrieval with Rich Food Attributes.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

PIC2DISH: A Customized Cooking Assistant System.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

On the Selection of Anchors and Targets for Video Hyperlinking.
Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, 2017

Video Indexing, Search, Detection, and Description with Focus on TRECVID.
Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, 2017

Color-Sketch Simulator: A Guide for Color-Based Visual Known-Item Search.
Proceedings of the Advanced Data Mining and Applications - 13th International Conference, 2017

2016
Opinion Question Answering by Sentiment Clip Localization.
ACM Trans. Multim. Comput. Commun. Appl., 2016

Fast Covariant VLAD for Image Search.
IEEE Trans. Multim., 2016

Hierarchical Visualization of Video Search Results for Topic-Based Browsing.
IEEE Trans. Multim., 2016

Automatic Hookworm Detection in Wireless Capsule Endoscopy Images.
IEEE Trans. Medical Imaging, 2016

Hyperlink-Aware Object Retrieval.
IEEE Trans. Image Process., 2016

Detection of bird nests in overhead catenary system images for high-speed rail.
Pattern Recognit., 2016

On the use of commonsense ontology for multimedia event recounting.
Int. J. Multim. Inf. Retr., 2016

Blind late fusion in multimedia event retrieval.
Int. J. Multim. Inf. Retr., 2016

Boost K-Means.
CoRR, 2016

VIREO @ TRECVID 2016: Multimedia Event Detection, Ad-hoc Video Search, Video to Text Description.
Proceedings of the 2016 TREC Video Retrieval Evaluation, 2016

Deep-based Ingredient Recognition for Cooking Recipe Retrieval.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Serendipity-driven Celebrity Video Hyperlinking.
Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, 2016

Event Detection with Zero Example: Select the Right and Suppress the Wrong Concepts.
Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, 2016

2015
Topological Spatial Verification for Instance Search.
IEEE Trans. Multim., 2015

Deep Multimodal Learning for Affective Analysis and Retrieval.
IEEE Trans. Multim., 2015

Unsupervised Celebrity Face Naming in Web Videos.
IEEE Trans. Multim., 2015

Human Action Recognition in Unconstrained Videos by Explicit Motion Modeling.
IEEE Trans. Image Process., 2015

Click-boosting multi-modality graph-based reranking for image search.
Multim. Syst., 2015

VIREO-TNO @ TRECVID 2015: Multimedia Event Detection and Video Hyperlinking.
Proceedings of the 2015 TREC Video Retrieval Evaluation, 2015

Semi-supervised Hashing with Semantic Confidence for Large Scale Visual Search.
Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2015

Multimedia COMMONS - Community-Organized Multimodal Mining: Opportunities for Novel Solutions (MMCommons Workshop 2015).
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Mutlimodal Learning with Deep Boltzmann Machine for Emotion Prediction in User Generated Videos.
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

Improving Automatic Name-Face Association using Celebrity Images on the Web.
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

Learning Query and Image Similarities with Ranking Canonical Correlation Analysis.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Semi-supervised Domain Adaptation with Subspace Learning for visual recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014
A Hamming Embedding Kernel with Informative Bag-of-Visual Words for Video Semantic Indexing.
ACM Trans. Multim. Comput. Commun. Appl., 2014

Placing Videos on a Semantic Hierarchy for Search Result Navigation.
ACM Trans. Multim. Comput. Commun. Appl., 2014

Video Event Detection Using Motion Relativity and Feature Selection.
IEEE Trans. Multim., 2014

Visual Typo Correction by Collocative Optimization: A Case Study on Merchandize Images.
IEEE Trans. Image Process., 2014

Multimedia modeling.
Multim. Tools Appl., 2014

Name-Face Association in Web Videos: A Large-Scale Dataset, Baselines, and Open Issues.
J. Comput. Sci. Technol., 2014

Collaborative error reduction for hierarchical classification.
Comput. Vis. Image Underst., 2014

VIREO-TNO @ TRECVID 2014: Multimedia Event Detection and Recounting (MED and MER).
Proceedings of the 2014 TREC Video Retrieval Evaluation, 2014

VIREO @ TRECVID 2014: Instance Search and Semantic Indexing.
Proceedings of the 2014 TREC Video Retrieval Evaluation, 2014

Click-through-based cross-view learning for image search.
Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2014

Scalable Visual Instance Mining with Threads of Features.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Organizing Video Search Results to Adapted Semantic Hierarchies for Topic-based Browsing.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Click-through-based Subspace Learning for Image Search.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

CeleLabel: an interactive system for annotating celebrities in web videos.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

CeleBrowser: An example of browsing big data on small device.
Proceedings of the International Conference on Multimedia Retrieval, 2014

2013
Flip-Invariant SIFT for Copy and Object Detection.
IEEE Trans. Image Process., 2013

Circular Reranking for Visual Search.
IEEE Trans. Image Process., 2013

Guest editorial: selected papers from ICIMCS 2011.
Multim. Syst., 2013

Web-Scale Near-Duplicate Search: Techniques and Applications.
IEEE Multim., 2013

Near-duplicate video retrieval: Current research and future trends.
ACM Comput. Surv., 2013

Unified entity search in social media community.
Proceedings of the 22nd International World Wide Web Conference, 2013

VIREO/ECNU @ TRECVID 2013: A Video Dance of Detection, Recounting and Search with Motion Relativity and Concept Learning from Wild.
Proceedings of the 2013 TREC Video Retrieval Evaluation, 2013

Error recovered hierarchical classification.
Proceedings of the ACM Multimedia Conference, 2013

Annotation for free: video tagging by mining user search behavior.
Proceedings of the ACM Multimedia Conference, 2013

Image search by graph-based label propagation with image representation from DNN.
Proceedings of the ACM Multimedia Conference, 2013

Searching visual instances with topology checking and context modeling.
Proceedings of the International Conference on Multimedia Retrieval, 2013

The Vireo Team at MediaEval 2013: Violent Scenes Detection by Mid-level Concepts Learnt from Youtube.
Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop, 2013

Video concept detection by learning from web images: A case study on cross domain learning.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2013

Click-boosting random walk for image search reranking.
Proceedings of the International Conference on Internet Multimedia Computing and Service, 2013

2012
Boosting web video categorization with contextual information from social web.
World Wide Web, 2012

Sampling and Ontologically Pooling Web Images for Visual Concept Learning.
IEEE Trans. Multim., 2012

Summarizing Rushes Videos by Motion, Object, and Event Understanding.
IEEE Trans. Multim., 2012

Fast Semantic Diffusion for Large-Scale Context-Based Image and Video Annotation.
IEEE Trans. Image Process., 2012

Semantic Indexing and Multimedia Event Detection: ECNU at TRECVID 2012.
Proceedings of the 2012 TREC Video Retrieval Evaluation, 2012

VIREO @ TRECVID 2012: Searching with Topology, Recounting will Small Concepts, Learning with Free Examples.
Proceedings of the 2012 TREC Video Retrieval Evaluation, 2012

FashionAsk: pushing community answers to your fingertips.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Snap-and-ask: answering multimodal question by naming visual instance.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Predicting domain adaptivity: redo or recycle?
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Video hyperlinking: libraries and tools for threading and visualizing large video collection.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Community as a connector: associating faces with celebrity names in web videos.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

The Shanghai-Hongkong Team at MediaEval2012: Violent Scene Detection Using Trajectory-based Features.
Proceedings of the Working Notes Proceedings of the MediaEval 2012 Workshop, 2012

Trajectory-Based Modeling of Human Actions with Motion Reference Points.
Proceedings of the Computer Vision - ECCV 2012, 2012

2011
Beyond search: Event-driven summarization for web videos.
ACM Trans. Multim. Comput. Commun. Appl., 2011

Concept-Driven Multi-Modality Fusion for Video Search.
IEEE Trans. Circuits Syst. Video Technol., 2011

Tracking Web Video Topics: Discovery, Visualization, and Monitoring.
IEEE Trans. Circuits Syst. Video Technol., 2011

Mining Event Structures from Web Videos.
IEEE Multim., 2011

VIREO @ TRECVID 2011: Instance Search, Semantic Indexing, Multimedia Event Detection and Known-Item Search.
Proceedings of the 2011 TREC Video Retrieval Evaluation, 2011

On the pooling of positive examples with ontology for visual concept learning.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Context-based friend suggestion in online photo-sharing community.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Cross media hyperlinking for search topic browsing.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Towards textually describing complex video contents with audio-visual concept classifiers.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Galaxy browser: exploratory search of web videos.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Fusing heterogeneous modalities for video and image re-ranking.
Proceedings of the 1st International Conference on Multimedia Retrieval, 2011

2010
On the Annotation of Web Videos by Efficient Near-Duplicate Search.
IEEE Trans. Multim., 2010

Representations of Keypoint-Based Semantic Concept Detection: A Comprehensive Study.
IEEE Trans. Multim., 2010

Efficient Mining of Multiple Partial Near-Duplicate Alignments by Temporal Network.
IEEE Trans. Circuits Syst. Video Technol., 2010

Data-Driven Approaches to Community-Contributed Video Applications.
IEEE Multim., 2010

PageSense: style-wise web page advertising.
Proceedings of the 19th International Conference on World Wide Web, 2010

VIREO at TRECVID 2010: Semantic Indexing, Known-Item Search, and Content-Based Copy Detection.
Proceedings of the TRECVID 2010 workshop participants notebook papers, 2010

Topical summarization of web videos by visual-text time-dependent alignment.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Trajectory-based visualization of web video topics.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Semantic context modeling with maximal margin Conditional Random Fields for automatic image annotation.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Automatic Generation of Semantic Fields for Annotating Web Images.
Proceedings of the COLING 2010, 2010

On the sampling of web images for learning visual concept classifiers.
Proceedings of the 9th ACM International Conference on Image and Video Retrieval, 2010

Co-reranking by mutual reinforcement for image search.
Proceedings of the 9th ACM International Conference on Image and Video Retrieval, 2010

Coherent bag-of audio words model for efficient large-scale video copy detection.
Proceedings of the 9th ACM International Conference on Image and Video Retrieval, 2010

2009
Video Summarization.
Proceedings of the Encyclopedia of Database Systems, 2009

Video Shot Detection.
Proceedings of the Encyclopedia of Database Systems, 2009

Real-Time Near-Duplicate Elimination for Web Video Search With Content and Context.
IEEE Trans. Multim., 2009

Scale-Rotation Invariant Pattern Entropy for Keypoint-Based Near-Duplicate Detection.
IEEE Trans. Image Process., 2009

Localized matching using Earth Mover's Distance towards discovery of common patterns from small image samples.
Image Vis. Comput., 2009

Visual word proximity and linguistics for semantic video indexing and near-duplicate retrieval.
Comput. Vis. Image Underst., 2009

VIREO/DVMM at TRECVID 2009: High-Level Feature Extraction, Automatic Video Search, and Content-Based Copy Detection.
Proceedings of the TRECVID 2009 workshop participants notebook papers, 2009

Towards google challenge: combining contextual and social information for web video categorization.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Localizing volumetric motion for action recognition in realistic videos.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Scalable detection of partial near-duplicate videos by visual-temporal consistency.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Semantic context transfer across heterogeneous sources for domain adaptive video search.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Event driven summarization for web videos.
Proceedings of the first SIGMM workshop on Social media, 2009

Distribution-based concept selection for concept-based video retrieval.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Large-scale near-duplicate web video search: Challenge and opportunity.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Domain adaptive semantic diffusion for large scale context-based video annotation.
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

A revisit of Generative Model for Automatic Image Annotation using Markov Random Fields.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Exploring inter-concept relationship with context space for semantic video indexing.
Proceedings of the 8th ACM International Conference on Image and Video Retrieval, 2009

2008
Video Databases.
Proceedings of the Wiley Encyclopedia of Computer Science and Engineering, 2008

Multimodal News Story Clustering With Pairwise Visual Near-Duplicate Constraint.
IEEE Trans. Multim., 2008

Selection of Concept Detectors for Video Search by Ontology-Enriched Semantic Spaces.
IEEE Trans. Multim., 2008

Simulating a Smartboard by Real-Time Gesture Detection in Lecture Videos.
IEEE Trans. Multim., 2008

Structuring low-quality videotaped lectures for cross-reference browsing by video text analysis.
Pattern Recognit., 2008

Measuring novelty and redundancy with multiple modalities in cross-lingual broadcast news.
Comput. Vis. Image Underst., 2008

Beyond Semantic Search: What You Observe May Not Be What You Think.
Proceedings of the TRECVID 2008 workshop participants notebook papers, 2008

Columbia University/VIREO-CityU/IRIT TRECVID2008 High-Level Feature Extraction and Interactive Video Search.
Proceedings of the TRECVID 2008 workshop participants notebook papers, 2008

Bag-of-visual-words expansion using visual relatedness for video indexing.
Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2008

Fusing semantics, observability, reliability and diversity of concept detectors for video search.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Video event detection using motion relativity and visual relatedness.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Accelerating near-duplicate video matching by combining visual similarity and alignment distortion.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Modeling video hyperlinks with hypergraph for web video reranking.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Ontology-based visual word matching for near-duplicate retrieval.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

2007
Near-Duplicate Keyframe Identification With Interest Point Matching and Pattern Learning.
IEEE Trans. Multim., 2007

Lecture Video Enhancement and Editing by Integrating Posture, Gesture, and Text.
IEEE Trans. Multim., 2007

Moving-Object Detection, Association, and Selection in Home Videos.
IEEE Trans. Multim., 2007

OM-based video shot retrieval by one-to-one matching.
Multim. Tools Appl., 2007

Introduction: special issue for the selected papers in the fourth international conference on Intelligent Multimedia Computing and Networking (IMMCN) 2005.
Multim. Tools Appl., 2007

Experimenting VIREO-374: Bag-of-Visual-Words and Visual-Based Ontology for Semantic Video Indexing and search.
Proceedings of the TRECVID 2007 workshop participants notebook papers, 2007

Mining Multiple Visual Appearances of Semantics for Image Annotation.
Proceedings of the Advances in Multimedia Modeling, 2007

Practical elimination of near-duplicates from web video search.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

Novelty detection for cross-lingual news stories with visual duplicates and speech transcripts.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

Ontology-enriched semantic space for video search.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

Rushes video summarization by object and event understanding.
Proceedings of the 1st ACM Workshop on Video Summarization, 2007

Evaluating bag-of-visual-words representations in scene classification.
Proceedings of the 9th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2007

Efficient Near-Duplicate Keyframe Retrieval with Visual Language Models.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Near-duplicate keyframe retrieval with visual keywords and semantic context.
Proceedings of the 6th ACM International Conference on Image and Video Retrieval, 2007

Towards optimal bag-of-features for object categorization and semantic video retrieval.
Proceedings of the 6th ACM International Conference on Image and Video Retrieval, 2007

2006
Clip-based similarity measure for query-dependent clip retrieval and video summarization.
IEEE Trans. Circuits Syst. Video Technol., 2006

Threading and autodocumenting news videos: a promising solution to rapidly browse news topics.
IEEE Signal Process. Mag., 2006

Gestalt-based feature similarity measure in trademark database.
Pattern Recognit., 2006

Modeling Local Interest Points for Semantic Detection and Video Search at TRECVID 2006.
Proceedings of the 2006 TREC Video Retrieval Evaluation, 2006

Audio similarity measure by graph modeling and matching.
Proceedings of the 14th ACM International Conference on Multimedia, 2006

Fast tracking of near-duplicate keyframes in broadcast domain with transitivity propagation.
Proceedings of the 14th ACM International Conference on Multimedia, 2006

Prediction-Based Gesture Detection in Lecture Videos by Combining Visual, Speech and Electronic Slides.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Keyframe Retrieval by Keypoints: Can Point-to-Point Matching Help?.
Proceedings of the Image and Video Retrieval, 5th International Conference, 2006

Hierarchical Hidden Markov Model for Rushes Structuring and Indexing.
Proceedings of the Image and Video Retrieval, 5th International Conference, 2006

2005
Video summarization and scene detection by graph modeling.
IEEE Trans. Circuits Syst. Video Technol., 2005

Selective object stabilization for home video consumers.
IEEE Trans. Consumer Electron., 2005

Video text detection and segmentation for optical character recognition.
Multim. Syst., 2005

Motion Driven Approaches to Shot Boundary Detection, Low-Level Feature Extraction and BBC Rushes Characterization at TRECVID 2005.
Proceedings of the 2005 TREC Video Retrieval Evaluation, 2005

Exploiting self-adaptive posture-based focus estimation for lecture video editing.
Proceedings of the 13th ACM International Conference on Multimedia, 2005

Co-Clustering of Time-Evolving News Story with Transcript and Keyframe.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

Common Pattern Discovery Using Earth Mover's Distance and Local Flow Maximization.
Proceedings of the 10th IEEE International Conference on Computer Vision (ICCV 2005), 2005

Hot Event Detection and Summarization by Graph Modeling and Matching.
Proceedings of the Image and Video Retrieval, 4th International Conference, 2005

EMD-Based Video Clip Retrieval by Many-to-Many Matching.
Proceedings of the Image and Video Retrieval, 4th International Conference, 2005

Multibiometrics Based on Palmprint and Handgeometry.
Proceedings of the 4th Annual ACIS International Conference on Computer and Information Science (ICIS 2005), 2005

2004
Indexing and matching of polyphonic songs for query-by-singing system.
Proceedings of the 12th ACM International Conference on Multimedia, 2004

Clip-based similarity measure for hierarchical video retrieval.
Proceedings of the 6th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2004

Structuring home video by snippet detection and pattern parsing.
Proceedings of the 6th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2004

Gesture Tracking and Recognition for Lecture Video Editing.
Proceedings of the 17th International Conference on Pattern Recognition, 2004

Deformable Geometry Model Matching by Topological and Geometric Signatures.
Proceedings of the 17th International Conference on Pattern Recognition, 2004

Novel Seed Selection for Multiple Objects Detection and Tracking.
Proceedings of the 17th International Conference on Pattern Recognition, 2004

ICA-FX Features for Classification of Singing Voice and Instrumental Sound.
Proceedings of the 17th International Conference on Pattern Recognition, 2004

Graph Based Image Matching.
Proceedings of the 17th International Conference on Pattern Recognition, 2004

A robust method for recovering geometric proxy from multiple panoramic images.
Proceedings of the 2004 International Conference on Image Processing, 2004

Deformable Object Model Matching by Topological and Geometric Similarity.
Proceedings of the 2004 Computer Graphics International (CGI 2004), 16-19 June 2004, 2004

2003
Motion analysis and segmentation through spatio-temporal slices processing.
IEEE Trans. Image Process., 2003

Synchronization of lecture videos and electronic slides by video text analysis.
Proceedings of the Eleventh ACM International Conference on Multimedia, 2003

A robust dissolve detector by support vector machine.
Proceedings of the Eleventh ACM International Conference on Multimedia, 2003

Trifocal Morphing.
Proceedings of the Seventh International Conference on Information Visualization, 2003

Structuring lecture videos for distance learning applications.
Proceedings of the Fifth International Symposium on Multimedia Software Engineering, 2003

Video clip retrieval by maximal matching and optimal matching in graph theory.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

Automatic Video Summarization by Graph Modeling.
Proceedings of the 9th IEEE International Conference on Computer Vision (ICCV 2003), 2003

Detection of Documentary Scene Changes by Audio-Visual Fusion.
Proceedings of the Image and Video Retrieval, Second International Conference, 2003

2002
On clustering and retrieval of video shots through temporal slices analysis.
IEEE Trans. Multim., 2002

Motion-Based Video Representation for Scene Change Detection.
Int. J. Comput. Vis., 2002

Motion Retrieval by Temporal Slices Analysis.
Proceedings of the 16th International Conference on Pattern Recognition, 2002

Detection of slide transition for topic indexing.
Proceedings of the 2002 IEEE International Conference on Multimedia and Expo, 2002

2001
Video partitioning by temporal slice coherency.
IEEE Trans. Circuits Syst. Video Technol., 2001

Exploiting image indexing techniques in DCT domain.
Pattern Recognit., 2001

Recent Advances in Content-Based Video Analysis.
Int. J. Image Graph., 2001

On clustering and retrieval of video shots.
Proceedings of the 9th ACM International Conference on Multimedia 2001, Ottawa, Ontario, Canada, September 30, 2001

Integrating color and spatial features for content-based video retrieval.
Proceedings of the 2001 International Conference on Image Processing, 2001

2000
Motion-Based Video Representation for Scene Change Detection.
Proceedings of the 15th International Conference on Pattern Recognition, 2000

1999
Motion tracking of human mouth by generalized deformable models.
Pattern Recognit. Lett., 1999

Camera Break Detection by Partitioning of 2D Spatio-Temporal Images in MPEG Domain.
Proceedings of the IEEE International Conference on Multimedia Computing and Systems, 1999

Detection of Gradual Transitions through Temporal Slice Analysis.
Proceedings of the 1999 Conference on Computer Vision and Pattern Recognition (CVPR '99), 1999

1996
Experiments on Routing, Filtering and Chinese Text Retrieval in TREC-5.
Proceedings of The Fifth Text REtrieval Conference, 1996

Tracking of deformable contours by synthesis and match.
Proceedings of the 13th International Conference on Pattern Recognition, 1996


  Loading...