Shuqiang Jiang

Proceedings of the 40th International Conference on Data Engineering, ICDE 2024, 2024

Imagine Before Go: Self-Supervised Generative Map for Object Goal Navigation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Lookahead Exploration with Neural Radiance Representation for Continuous Vision-Language Navigation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

An Interactive Navigation Method with Effect-oriented Affordance.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

A Category Agnostic Model for Visual Rearrangment.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

Large Scale Visual Food Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., August, 2023

Vision-based fruit recognition via multi-scale attention CNN.

[BibT_eX]

[DOI]

Comput. Electron. Agric., July, 2023

Focus and Align: Learning Tube Tokens for Video-Language Pre-Training.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2023

Multi-Object Navigation Using Potential Target Position Policy Function.

[BibT_eX]

[DOI]

Haitao Zeng

IEEE Trans. Image Process., 2023

MemBridge: Video-Language Pre-Training With Memory-Augmented Inter-Modality Bridge.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2023

Composite Object Relation Modeling for Few-Shot Scene Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2023

Ingredient Prediction via Context Learning Network With Class-Adaptive Asymmetric Loss.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2023

TransWeaver: Weave Image Pairs for Class Agnostic Common Object Detection.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2023

Dataset Bias in Few-Shot Image Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2023

From Plate to Production: Artificial Intelligence in Modern Consumer-Driven Food Systems.

[BibT_eX]

[DOI]

CoRR, 2023

CaMP: Causal Multi-policy Planning for Interactive Navigation in Multi-room Scenes.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

SeeDS: Semantic Separable Diffusion Synthesizer for Zero-shot Food Detection.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Generating Explanations for Embodied Action Decision from Visual Observation.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Long-Short Term Policy for Visual Object Navigation.

[BibT_eX]

[DOI]

IROS, 2023

A Cross-direction Task Decoupling Network for Small Logo Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

GridMM: Grid Memory Map for Vision-and-Language Navigation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Layout-based Causal Inference for Object Navigation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Data-Free Knowledge Distillation via Feature Exchange and Activation Region Constraint.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Bi-Level Meta-Learning for Few-Shot Domain Generalization.

[BibT_eX]

[DOI]

Xiaorong Qin

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

KERM: Knowledge Enhanced Reasoning for Vision-and-Language Navigation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

LogoDet-3K: A Large-scale Image Dataset for Logo Detection.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2022

Amorphous Region Context Modeling for Scene Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2022

Ingredient-Guided Region Discovery and Relationship Modeling for Food Category-Ingredient Prediction.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2022

Applications of knowledge graphs for food science and industry.

[BibT_eX]

[DOI]

Patterns, 2022

A Noise-robust Locality Transformer for Fine-grained Food Image Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 5th IEEE International Conference on Multimedia Information Processing and Retrieval, 2022

Generative Meta-Adversarial Network for Unseen Object Navigation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Rethinking the Optimization of Average Precision: Only Penalizing Negative Instances before Positive Ones Is Enough.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Attribute-Guided Feature Learning for Few-Shot Image Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2021

Hybrid-Attention Enhanced Two-Stream Fusion Network for Video Venue Prediction.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2021

Plant Disease Recognition: A Large-Scale Benchmark Dataset and a Visual Region and Loss Reweighting Approach.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2021

Vision-Based Food Analysis for Automatic Dietary Assessment.

[BibT_eX]

[DOI]

CoRR, 2021

Towards Building a Food Knowledge Graph for Internet of Food.

[BibT_eX]

[DOI]

Chunlin Liu

CoRR, 2021

Rethinking Ranking-based Loss Functions: Only Penalizing Negative Instances before Positive Ones is Enough.

[BibT_eX]

[DOI]

CoRR, 2021

See More for Scene: Pairwise Consistency Learning for Scene Classification.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

ION: Instance-level Object Navigation.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

FoodLogoDet-1500: A Dataset for Large-Scale Food Logo Detection via Multi-Scale Feature Decoupling Network.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

AIxFood'21: 3rd Workshop on AIxFood.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

What If We Could Not See? Counterfactual Analysis for Egocentric Action Anticipation.

[BibT_eX]

[DOI]

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Hierarchical Object-to-Zone Graph for Object Navigation.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020

Few-shot Food Recognition via Multi-view Representation Learning.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2020

Learning Scene Attribute for Scene Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2020

A Two-Stage Triplet Network Training Framework for Image Retrieval.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2020

Food Recommendation: Framework, Existing Solutions, and Challenges.

[BibT_eX]

[DOI]

Ramesh C. Jain

IEEE Trans. Multim., 2020

Image Representations With Spatial Object-to-Object Relations for RGB-D Scene Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2020

Multi-Task Deep Relative Attribute Learning for Visual Urban Perception.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2020

Multi-Scale Multi-View Deep Feature Aggregation for Food Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2020

Scene Recognition With Prototype-Agnostic Scene Layout.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2020

Multifaceted Analysis of Fine-Tuning in a Deep Model for Visual Recognition.

[BibT_eX]

[DOI]

Trans. Data Sci., 2020

融合三元卷积神经网络与关系网络的小样本食品图像识别 (Few-shot Food Recognition Combining Triplet Convolutional Neural Network with Relation Network).

[BibT_eX]

[DOI]

计算机科学, 2020

Deep neural networks for emerging multimedia computing and applications.

[BibT_eX]

[DOI]

Neurocomputing, 2020

An Egocentric Action Anticipation Framework via Fusing Intuition and Analysis.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Generalized Zero-shot Learning with Multi-source Semantic Embeddings for Scene Recognition.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

ISIA Food-500: A Dataset for Large-Scale Food Recognition via Stacked Global-Local Attention Network.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Food Computing for Multimedia.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Expressional Region Retrieval.

[BibT_eX]

[DOI]

Xiaoqian Guo

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Multi-attention Meta Learning for Few-shot Fine-grained Image Recognition.

[BibT_eX]

[DOI]

Chenlong Liu

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Logo-2K+: A Large-Scale Logo Dataset for Scalable Logo Classification.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Deep Patch Representations with Shared Codebook for Scene Classification.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2019

Know More Say Less: Image Captioning Based on Scene Graphs.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2019

Hierarchy-Dependent Cross-Platform Multi-View Feature Learning for Venue Category Prediction.

[BibT_eX]

[DOI]

Shuhuan Mei

IEEE Trans. Multim., 2019

Learning Effective RGB-D Representations for Scene Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2019

Class Agnostic Image Common Object Detection.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2019

Instance-level object retrieval via deep region CNN.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2019

Hybrid incremental learning of new data and new classes for hand-held object recognition.

[BibT_eX]

[DOI]

J. Vis. Commun. Image Represent., 2019

A Survey on Food Computing.

[BibT_eX]

[DOI]

ACM Comput. Surv., 2019

Attention-based Densely Connected LSTM for Video Captioning.

[BibT_eX]

[DOI]

Yongqing Zhu

Proceedings of the 27th ACM International Conference on Multimedia, 2019

Aberrance-aware Gradient-sensitive Attentions for Scene Recognition with RGB-D Videos.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

MUCH: Mutual Coupling Enhancement of Scene Recognition and Dense Captioning.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

Ingredient-Guided Cascaded Multi-Attention Network for Food Recognition.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

A Real-Time Scene Recognition System Based on RGB-D Video Streams.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Multimodal Interaction, 2019

Learning Object Context for Dense Captioning.

[BibT_eX]

[DOI]

Jungong Han

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

You Are What You Eat: Exploring Rich Recipe Information for Cross-Region Food Analysis.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2018

Bundled Object Context for Referring Expressions.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2018

Food recognition and recipe analysis: integrating visual content, context and external knowledge.

[BibT_eX]

[DOI]

CoRR, 2018

Focal Loss for Region Proposal Network.

[BibT_eX]

[DOI]

Chengpeng Chen

Proceedings of the Pattern Recognition and Computer Vision - First Chinese Conference, 2018

Session details: Grand Challenge-2.

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Session details: Grand Challenge-1.

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Session details: Deep-3 (Image Processing-Inpainting, Super-Resolution, Deblurring).

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Attentive Recurrent Neural Network for Weak-supervised Multi-label Image Classification.

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

A Novel Approach of Multiple Objects Segmentation Based on Graph Cut.

[BibT_eX]

[DOI]

Proceedings of the IEEE 1st Conference on Multimedia Information Processing and Retrieval, 2018

One-shot learning for RGB-D hand-held object recognition.

[BibT_eX]

[DOI]

Jian Sun

Proceedings of the 10th International Conference on Internet Multimedia Computing and Service, 2018

Deep Structured Learning for Visual Relationship Detection.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017

Being a Supercook: Joint Food Attributes and Multimodal Content Modeling for Recipe Retrieval and Exploration.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2017

Modeling Restaurant Context for Food Recognition.

[BibT_eX]

[DOI]

Ruihan Xu

IEEE Trans. Multim., 2017

Multi-Scale Multi-Feature Context Modeling for Scene Recognition in the Semantic Manifold.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2017

Modality-specific and hierarchical feature learning for RGB-D hand-held object recognition.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2017

Guest Editorial: Knowledge-Based Multimedia Computing.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2017

A survey on context-aware mobile visual recognition.

[BibT_eX]

[DOI]

Multim. Syst., 2017

Guest editorial: mobile visual tagging with mobile context.

[BibT_eX]

[DOI]

Multim. Syst., 2017

RGB-D Scene Recognition with Object-to-Object Relation.

[BibT_eX]

[DOI]

Chengpeng Chen

Proceedings of the 2017 ACM on Multimedia Conference, 2017

A Delicious Recipe Analysis Framework for Exploring Multi-Modal Recipes with Various Attributes.

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM on Multimedia Conference, 2017

Combining Models from Multiple Sources for RGB-D Scene Recognition.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Dual Track Multimodal Automatic Learning through Human-Robot Interaction.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Visual relationship detection with object spatial distribution.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Keyword-driven image captioning via Context-dependent Bilateral LSTM.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Indoor RGB-D Object Detection with the Guidance of Hand-Held Objects.

[BibT_eX]

[DOI]

Proceedings of the Internet Multimedia Computing and Service, 2017

ISIA at the ImageCLEF 2017 Image Caption Task.

[BibT_eX]

[DOI]

Proceedings of the Working Notes of CLEF 2017, 2017

Depth CNNs for RGB-D Scene Recognition: Learning from Scratch Better than Transferring from RGB-CNNs.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016

Category co-occurrence modeling for large scale scene recognition.

[BibT_eX]

[DOI]

Pattern Recognit., 2016

Scalable storyboards in handheld devices: applications and evaluation metrics.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2016

Guest Editorial: Image Analysis and Processing Leveraging Additional Information.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2016

Online web video topic detection and tracking with semi-supervised learning.

[BibT_eX]

[DOI]

Multim. Syst., 2016

Where and What to Eat: Simultaneous Restaurant and Dish Recognition from Food Image.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Information Processing - PCM 2016, 2016

Learning to Recognize Hand-Held Objects from Scratch.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Information Processing - PCM 2016, 2016

Heterogeneous Convolutional Neural Networks for Visual Recognition.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Information Processing - PCM 2016, 2016

Image Captioning with both Object and Scene Information.

[BibT_eX]

[DOI]

Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

RGB-D scene classification via heterogeneous model fusion.

[BibT_eX]

[DOI]

Xinda Liu

Xueming Wang

Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

RGB-D Object Recognition from Hand-Held Object Teaching.

[BibT_eX]

[DOI]

Leixian Qiao

Xue Li

Proceedings of the International Conference on Internet Multimedia Computing and Service, 2016

Scene Recognition with CNNs: Objects, Scales and Dataset Bias.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Joint Learning of CNN and LSTM for Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the Working Notes of CLEF 2016, 2016

2015

INSTRE: A New Benchmark for Instance-Level Object Retrieval and Recognition.

[BibT_eX]

[DOI]

Shuang Wang

ACM Trans. Multim. Comput. Commun. Appl., 2015

Geolocalized Modeling for Dish Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2015

Polysemious visual representation based on feature aggregation for large scale image applications.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2015

LSH-based semantic dictionary learning for large scale image understanding.

[BibT_eX]

[DOI]

Chenggang Clarence Yan

J. Vis. Commun. Image Represent., 2015

RGB-D Hand-Held Object Recognition Based on Heterogeneous Feature Fusion.

[BibT_eX]

[DOI]

J. Comput. Sci. Technol., 2015

Cluster-sensitive Structured Correlation Analysis for Web cross-modal retrieval.

[BibT_eX]

[DOI]

Neurocomputing, 2015

Online learning affinity measure with CovBoost for multi-target tracking.

[BibT_eX]

[DOI]

Neurocomputing, 2015

The Face Object based HEVC System for Video Call.

[BibT_eX]

[DOI]

EAI Endorsed Trans. Future Intell. Educ. Environ., 2015

Rich Image Description Based on Regions.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Hand-Object Sense: A Hand-held Object Recognition System Based on RGB-D Information.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

A probabilistic model for food image recognition in restaurants.

[BibT_eX]

[DOI]

Ruihan Xu

Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015

Joint multi-feature spatial context for scene recognition in the semantic manifold.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014

Preface: Internet multimedia computing and service.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2014

Relative image similarity learning with contextual information for Internet cross-media retrieval.

[BibT_eX]

[DOI]

Multim. Syst., 2014

Fusing multi-cues description for partial-duplicate image retrieval.

[BibT_eX]

[DOI]

Chenggang Clarence Yan

J. Vis. Commun. Image Represent., 2014

Multipath Convolutional-Recursive Neural Networks for Object Recognition.

[BibT_eX]

[DOI]

Proceedings of the Intelligent Information Processing VII, 2014

Cross media topic analytics based on synergetic content and user behavior modeling.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Graph-Density-based visual word vocabulary for image retrieval.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Multiple Feature Fusion Based Hand-held Object Recognition with RGB-D data.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Internet Multimedia Computing and Service, 2014

Region annotations in hashing based image retrieval.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Internet Multimedia Computing and Service, 2014

Semantic Features for Food Image Recognition with Geo-Constraints.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE International Conference on Data Mining Workshops, 2014

Large scale image understanding with non-convex multi-task learning.

[BibT_eX]

[DOI]

Proceedings of the 2014 5th International Conference on Game Theory for Networks, 2014

Accuracy and Specificity Trade-off in k -nearest Neighbors Classification.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2014, 2014

2013

Robust Spatial Consistency Graph Model for Partial Duplicate Image Retrieval.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2013

SSOCBT: A Robust Semisupervised Online CovBoost Tracker That Uses Samples Differently.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2013

Weighted visual vocabulary to balance the descriptive ability on general dataset.

[BibT_eX]

[DOI]

Yi Xie

Neurocomputing, 2013

Partial-Duplicate Image Retrieval via Saliency-Guided Visual Matching.

[BibT_eX]

[DOI]

IEEE Multim., 2013

Block-Based Face Recognition Using WLD.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Information Processing - PCM 2013, 2013

Cross Concept Local Fisher Discriminant Analysis for Image Classification.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Modeling, 19th International Conference, 2013

Flexible navigation in smartphones and tablets using scalable storyboards.

[BibT_eX]

[DOI]

Shuai Zheng

Proceedings of the International Conference on Multimedia Retrieval, 2013

ObjectSense: a scalable multi-objects recognition system based on partial-duplicate image retrieval.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Multimedia Retrieval, 2013

Cross-media topic detection associated with hot search queries.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Internet Multimedia Computing and Service, 2013

Multi-level Discriminative Dictionary Learning towards Hierarchical Visual Categorization.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

MIAR ICT Participation at Robot Vision 2013.

[BibT_eX]

[DOI]

Proceedings of the Working Notes for CLEF 2013 Conference , 2013

2012

S<sup>3</sup>MKL: Scalable Semi-Supervised Multiple Kernel Learning for Real-World Image Applications.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2012

Learning Hierarchical Semantic Description Via Mixed-Norm Regularization for Image Understanding.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2012

@ICT: attention-based virtual content insertion.

[BibT_eX]

[DOI]

Multim. Syst., 2012

Online selection of the best k-feature subset for object tracking.

[BibT_eX]

[DOI]

J. Vis. Commun. Image Represent., 2012

Nearest-neighbor method using multiple neighborhood similarities for social media data mining.

[BibT_eX]

[DOI]

Neurocomputing, 2012

Improving Image Distance Metric Learning by Embedding Semantic Relations.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Information Processing - PCM 2012, 2012

Effective Comic-Like Representations with Embedded Regions of Interest.

[BibT_eX]

[DOI]

Huiying Liu

Proceedings of the Advances in Multimedia Information Processing - PCM 2012, 2012

A Novel Framework for Web Video Thumbnail Generation.

[BibT_eX]

[DOI]

Proceedings of the Eighth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, 2012

Color Maximal-Dissimilarity Pattern for pedestrian detection.

[BibT_eX]

[DOI]

Proceedings of the 21st International Conference on Pattern Recognition, 2012

Cross community news event summary generation based on collaborative ranking.

[BibT_eX]

[DOI]

Proceedings of the 4th International Conference on Internet Multimedia Computing and Service, 2012

Multi-feature metric learning with knowledge transfer among semantics and social tagging.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

2011

Transferring Boosted Detectors Towards Viewpoint and Scene Adaptiveness.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2011

Special edition on semi-supervised learning for visual content analysis and understanding.

[BibT_eX]

[DOI]

Pattern Recognit., 2011

Human group activity analysis with fusion of motion and appearance information.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Detection and location of near-duplicate video sub-clips by finding dense subgraphs.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

News video story sentiment classification and ranking.

[BibT_eX]

[DOI]

Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

Matching Content-based Saliency Regions for partial-duplicate image retrieval.

[BibT_eX]

[DOI]

Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

Human tracking by structured body parts.

[BibT_eX]

[DOI]

Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Query sensitive dynamic web video thumbnail generation.

[BibT_eX]

[DOI]

Chunxi Liu

Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Online Vicept learning for web-scale image understanding.

[BibT_eX]

[DOI]

Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Fast common visual pattern detection via radiate geometric model.

[BibT_eX]

[DOI]

Lingyang Chu

Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Content-based intelligent video recorder with its implementation on sports video.

[BibT_eX]

[DOI]

Zhao Zhao

Proceedings of the ICIMCS 2011, 2011

Treat samples differently: Object tracking with semi-supervised online CovBoost.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2011

Learning image Vicept description via mixed-norm regularization for large scale semantic image search.

[BibT_eX]

[DOI]

Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Efficient l<sub>p</sub>-norm multiple feature metric learning for image categorization.

[BibT_eX]

[DOI]

Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011

2010

Affective Visualization and Retrieval for Music Video.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2010

Building contextual visual vocabulary for large-scale image applications.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Multimedia 2010, 2010

Memory matrix: a novel user experience for home video.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Multimedia 2010, 2010

Vicept: link visual features to concepts for large-scale image understanding.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Multimedia 2010, 2010

S3MKL: scalable semi-supervised multiple kernel learning for image data mining.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Multimedia 2010, 2010

Nearest-neighbor classification using unlabeled data for real world image application.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Multimedia 2010, 2010

The third eye: mining the visual cognition across multi-language communities.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Multimedia 2010, 2010

Adding Affine Invariant Geometric Constraint for Partial-Duplicate Image Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 20th International Conference on Pattern Recognition, 2010

Multiple Kernel Learning with High Order Kernels.

[BibT_eX]

[DOI]

Proceedings of the 20th International Conference on Pattern Recognition, 2010

Action Recognition Using Spatial-Temporal Context.

[BibT_eX]

[DOI]

Proceedings of the 20th International Conference on Pattern Recognition, 2010

Group Activity Recognition by Gaussian Processes Estimation.

[BibT_eX]

[DOI]

Proceedings of the 20th International Conference on Pattern Recognition, 2010

Bridging the gap between objective score and subjective preference in video quality assessment.

[BibT_eX]

[DOI]

Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

Event based news video people classification and ranking using multimodality features.

[BibT_eX]

[DOI]

Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

Fast copy detection based on Slice Entropy Scattergraph.

[BibT_eX]

[DOI]

Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

A close-up detection method for movies.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Image Processing, 2010

Multi-description of local interest point for partial-duplicate image retrieval.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Image Processing, 2010

Novel observation model for probabilistic object tracking.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

2009

Event Tactic Analysis Based on Broadcast Sports Video.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2009

A framework for flexible summarization of racquet sports video using multiple modalities.

[BibT_eX]

[DOI]

Comput. Vis. Image Underst., 2009

Video Shrinking by Auditory and Visual Cues.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Information Processing, 2009

Friend recommendation according to appearances on photos.

[BibT_eX]

[DOI]

Zhipeng Wu

Proceedings of the 17th International Conference on Multimedia 2009, 2009

Near-duplicate video matching with transformation recognition.

[BibT_eX]

[DOI]

Zhipeng Wu

Proceedings of the 17th International Conference on Multimedia 2009, 2009

Visual ContextRank for web image re-ranking.

[BibT_eX]

[DOI]

Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining, 2009

Robust copy detection by mining temporal self-similarities.

[BibT_eX]

[DOI]

Zhipeng Wu

Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Spatial-temporal video browsing for mobile environment based on visual attention analysis.

[BibT_eX]

[DOI]

Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Advertise gently - in-image advertising with low intrusiveness.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Image Processing, 2009

Personalized online video recommendation by neighborhood score propagation based global ranking.

[BibT_eX]

[DOI]

Chunxi Liu

Proceedings of the First International Conference on Internet Multimedia Computing and Service, 2009

Transfer pedestrian detector towards view-adaptiveness and efficiency.

[BibT_eX]

[DOI]

Proceedings of the 12th IEEE International Conference on Computer Vision Workshops, 2009

Content-Based Video Semantic Analysis.

[BibT_eX]

[DOI]

Proceedings of the Semantic Mining Technologies for Multimedia Databases., 2009

Semantic Classification and Annotation of Images.

[BibT_eX]

[DOI]

Proceedings of the Semantic Mining Technologies for Multimedia Databases., 2009

2008

Unsupervised texture classification: Automatically discover and classify texture patterns.

[BibT_eX]

[DOI]

Image Vis. Comput., 2008

Personalized MTV Affective Analysis Using User Profile.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Information Processing, 2008

Detecting Violent Scenes in Movies by Auditory and Visual Cues.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Information Processing, 2008

A Two-Stage Approach to Highlight Extraction in Sports Video by Using AdaBoost and Multi-modal.

[BibT_eX]

[DOI]

Shaojie Cai

Proceedings of the Advances in Multimedia Information Processing, 2008

i.MTV: an integrated system for mtv affective analysis.

[BibT_eX]

[DOI]

Proceedings of the 16th International Conference on Multimedia 2008, 2008

A generic virtual content insertion system based on visual attention analysis.

[BibT_eX]

[DOI]

Proceedings of the 16th International Conference on Multimedia 2008, 2008

Naming faces in broadcast news video by image google.

[BibT_eX]

[DOI]

Chunxi Liu

Proceedings of the 16th International Conference on Multimedia 2008, 2008

Matching images more efficiently with local descriptors.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Effective scene matching with local feature representatives.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Human reappearance detection based on on-line learning.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Affective MTV analysis based on arousal and valence features.

[BibT_eX]

[DOI]

Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

A pixel-wise local information-based background subtraction approach.

[BibT_eX]

[DOI]

Zhong Wei

Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Spatial-temporal attention analysis for home video.

[BibT_eX]

[DOI]

Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Coarse-to-fine video text detection.

[BibT_eX]

[DOI]

Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Lower attentive region detection for virtual content insertion in broadcast video.

[BibT_eX]

[DOI]

Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Shot classification for action movies based on motion characteristics.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Image Processing, 2008

Pedestrian detection via logistic multiple instance boosting.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Image Processing, 2008

Fast and effective text detection.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Image Processing, 2008

Object tracking using incremental 2D-LDA learning and Bayes inference.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Image Processing, 2008

People re-detection using Adaboost with sift and color correlogram.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Image Processing, 2008

Visual-aural attention modeling for talk show video highlight detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

Multiple Instance Boost Using Graph Embedding Based Decision Stump for Pedestrian Detection.

[BibT_eX]

[DOI]

Junbiao Pang

Proceedings of the Computer Vision, 2008

2007

Trajectory based event tactics analysis in broadcast sports video.

[BibT_eX]

[DOI]

Proceedings of the 15th International Conference on Multimedia 2007, 2007

Region-based visual attention analysis with its application in image browsing on small displays.

[BibT_eX]

[DOI]

Proceedings of the 15th International Conference on Multimedia 2007, 2007

Story Unit Segmentation with Friendly Acoustic Perception.

[BibT_eX]

[DOI]

Proceedings of the Multimedia Content Analysis and Mining, International Workshop, 2007

Highlight Ranking for Racquet Sports Video in User Attention Subspaces Based on Relevance Feedback.

[BibT_eX]

[DOI]

Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

An Effective Local Invariant Descriptor Combining Luminance and Color Information.

[BibT_eX]

[DOI]

Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

The Demo: A Real-Time Score Detection and Recognition Approach in Broadcast Basketball Sports Video.

[BibT_eX]

[DOI]

Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

A Real-Time Score Detection and Recognition Approach for Broadcast Basketball Video.

[BibT_eX]

[DOI]

Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Generating Video Sequence from Photo Image for Mobile Screens by Content Analysis.

[BibT_eX]

[DOI]

Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Mining Information of Attack-Defense Status from Soccer Video Based on Scene Analysis.

[BibT_eX]

[DOI]

Wen Gao

Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Monocular Tracking 3D People By Gaussian Process Spatio-Temporal Variable Model.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Image Processing, 2007

Mean-Shift Blob Tracking with Adaptive Feature Selection and Scale Adaptation.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Image Processing, 2007

2006

An effective method to detect and categorize digitized traditional Chinese paintings.

[BibT_eX]

[DOI]

Pattern Recognit. Lett., 2006

JDL at TRECVID 2006 Shot Boundary Detection.

[BibT_eX]

[DOI]

Proceedings of the 2006 TREC Video Retrieval Evaluation, 2006

Highlight Summarization in Sports Video Based on Replay Detection.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Extracting Story Units in Sports Video Based on Unsupervised Video Scene Clustering.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

2005

Visual Ontology Construction for Digitized Art Image Retrieval.

[BibT_eX]

[DOI]

J. Comput. Sci. Technol., 2005

Exciting event detection in broadcast soccer video with mid-level description and incremental learning.

[BibT_eX]

[DOI]

Proceedings of the 13th ACM International Conference on Multimedia, 2005

Video2Cartoon: generating 3D cartoon from broadcast soccer video.

[BibT_eX]

[DOI]

Proceedings of the 13th ACM International Conference on Multimedia, 2005

Playfield Detection Using Adaptive GMM and Its Application.

[BibT_eX]

[DOI]

Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004

An Ontology-based Approach to Retrieve Digitized Art Images.

[BibT_eX]

[DOI]

Tiejun Huang

Wen Gao

Proceedings of the 2004 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2004), 2004

Categorizing Traditional Chinese Painting Images.

[BibT_eX]

[DOI]