Shuqiang Jiang

Orcid: 0000-0002-1596-4326

According to our database1, Shuqiang Jiang authored at least 261 papers between 2004 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Lightweight Food Recognition via Aggregation Block and Feature Encoding.
ACM Trans. Multim. Comput. Commun. Appl., October, 2024

Toward Egocentric Compositional Action Anticipation with Adaptive Semantic Debiasing.
ACM Trans. Multim. Comput. Commun. Appl., May, 2024

Multi-state Ingredient Recognition via Adaptive Multi-centric Network.
IEEE Trans. Ind. Informatics, April, 2024

Deep Learning for Logo Detection: A Survey.
ACM Trans. Multim. Comput. Commun. Appl., March, 2024

Towards Food Image Retrieval via Generalization-Oriented Sampling and Loss Function Design.
ACM Trans. Multim. Comput. Commun. Appl., January, 2024

Synthesizing Knowledge-Enhanced Features for Real-World Zero-Shot Food Detection.
IEEE Trans. Image Process., 2024

Convolution-Enhanced Bi-Branch Adaptive Transformer With Cross-Task Interaction for Food Category and Ingredient Recognition.
IEEE Trans. Image Process., 2024

Preface to the Special Issue on Multimodal Collaborative Perception and Fusion Technology.
Int. J. Softw. Informatics, 2024

FoodSky: A Food-oriented Large Language Model that Passes the Chef and Dietetic Examination.
CoRR, 2024

Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation.
CoRR, 2024

DiffGen: Robot Demonstration Generation via Differentiable Physics Simulation, Differentiable Rendering, and Vision-Language Model.
CoRR, 2024

Editorial to the Special Issue on Image Content Understanding.
Comput. Informatics, 2024

Deep learning model for the automated detection and classification of central canal and neural foraminal stenosis upon cervical spine magnetic resonance imaging.
BMC Medical Imaging, 2024

Food Computing for Nutrition and Health.
Proceedings of the 40th International Conference on Data Engineering, ICDE 2024, 2024

Imagine Before Go: Self-Supervised Generative Map for Object Goal Navigation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Lookahead Exploration with Neural Radiance Representation for Continuous Vision-Language Navigation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

An Interactive Navigation Method with Effect-oriented Affordance.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

A Category Agnostic Model for Visual Rearrangment.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Large Scale Visual Food Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2023

Vision-based fruit recognition via multi-scale attention CNN.
Comput. Electron. Agric., July, 2023

Focus and Align: Learning Tube Tokens for Video-Language Pre-Training.
IEEE Trans. Multim., 2023

Multi-Object Navigation Using Potential Target Position Policy Function.
IEEE Trans. Image Process., 2023

MemBridge: Video-Language Pre-Training With Memory-Augmented Inter-Modality Bridge.
IEEE Trans. Image Process., 2023

Composite Object Relation Modeling for Few-Shot Scene Recognition.
IEEE Trans. Image Process., 2023

Ingredient Prediction via Context Learning Network With Class-Adaptive Asymmetric Loss.
IEEE Trans. Image Process., 2023

TransWeaver: Weave Image Pairs for Class Agnostic Common Object Detection.
IEEE Trans. Image Process., 2023

Dataset Bias in Few-Shot Image Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 2023

From Plate to Production: Artificial Intelligence in Modern Consumer-Driven Food Systems.
CoRR, 2023

CaMP: Causal Multi-policy Planning for Interactive Navigation in Multi-room Scenes.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

SeeDS: Semantic Separable Diffusion Synthesizer for Zero-shot Food Detection.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Generating Explanations for Embodied Action Decision from Visual Observation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Long-Short Term Policy for Visual Object Navigation.
IROS, 2023

A Cross-direction Task Decoupling Network for Small Logo Detection.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

GridMM: Grid Memory Map for Vision-and-Language Navigation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Layout-based Causal Inference for Object Navigation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Data-Free Knowledge Distillation via Feature Exchange and Activation Region Constraint.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Bi-Level Meta-Learning for Few-Shot Domain Generalization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

KERM: Knowledge Enhanced Reasoning for Vision-and-Language Navigation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
LogoDet-3K: A Large-scale Image Dataset for Logo Detection.
ACM Trans. Multim. Comput. Commun. Appl., 2022

Amorphous Region Context Modeling for Scene Recognition.
IEEE Trans. Multim., 2022

Ingredient-Guided Region Discovery and Relationship Modeling for Food Category-Ingredient Prediction.
IEEE Trans. Image Process., 2022

Applications of knowledge graphs for food science and industry.
Patterns, 2022

A Noise-robust Locality Transformer for Fine-grained Food Image Retrieval.
Proceedings of the 5th IEEE International Conference on Multimedia Information Processing and Retrieval, 2022

Generative Meta-Adversarial Network for Unseen Object Navigation.
Proceedings of the Computer Vision - ECCV 2022, 2022

Rethinking the Optimization of Average Precision: Only Penalizing Negative Instances before Positive Ones Is Enough.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Attribute-Guided Feature Learning for Few-Shot Image Recognition.
IEEE Trans. Multim., 2021

Hybrid-Attention Enhanced Two-Stream Fusion Network for Video Venue Prediction.
IEEE Trans. Multim., 2021

Plant Disease Recognition: A Large-Scale Benchmark Dataset and a Visual Region and Loss Reweighting Approach.
IEEE Trans. Image Process., 2021

Vision-Based Food Analysis for Automatic Dietary Assessment.
CoRR, 2021

Towards Building a Food Knowledge Graph for Internet of Food.
CoRR, 2021

Rethinking Ranking-based Loss Functions: Only Penalizing Negative Instances before Positive Ones is Enough.
CoRR, 2021

See More for Scene: Pairwise Consistency Learning for Scene Classification.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

ION: Instance-level Object Navigation.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

FoodLogoDet-1500: A Dataset for Large-Scale Food Logo Detection via Multi-Scale Feature Decoupling Network.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

AIxFood'21: 3rd Workshop on AIxFood.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

What If We Could Not See? Counterfactual Analysis for Egocentric Action Anticipation.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Hierarchical Object-to-Zone Graph for Object Navigation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020
Few-shot Food Recognition via Multi-view Representation Learning.
ACM Trans. Multim. Comput. Commun. Appl., 2020

Learning Scene Attribute for Scene Recognition.
IEEE Trans. Multim., 2020

A Two-Stage Triplet Network Training Framework for Image Retrieval.
IEEE Trans. Multim., 2020

Food Recommendation: Framework, Existing Solutions, and Challenges.
IEEE Trans. Multim., 2020

Image Representations With Spatial Object-to-Object Relations for RGB-D Scene Recognition.
IEEE Trans. Image Process., 2020

Multi-Task Deep Relative Attribute Learning for Visual Urban Perception.
IEEE Trans. Image Process., 2020

Multi-Scale Multi-View Deep Feature Aggregation for Food Recognition.
IEEE Trans. Image Process., 2020

Scene Recognition With Prototype-Agnostic Scene Layout.
IEEE Trans. Image Process., 2020

Multifaceted Analysis of Fine-Tuning in a Deep Model for Visual Recognition.
Trans. Data Sci., 2020

融合三元卷积神经网络与关系网络的小样本食品图像识别 (Few-shot Food Recognition Combining Triplet Convolutional Neural Network with Relation Network).
计算机科学, 2020

Deep neural networks for emerging multimedia computing and applications.
Neurocomputing, 2020

An Egocentric Action Anticipation Framework via Fusing Intuition and Analysis.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Generalized Zero-shot Learning with Multi-source Semantic Embeddings for Scene Recognition.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

ISIA Food-500: A Dataset for Large-Scale Food Recognition via Stacked Global-Local Attention Network.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Food Computing for Multimedia.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Expressional Region Retrieval.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Multi-attention Meta Learning for Few-shot Fine-grained Image Recognition.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Logo-2K+: A Large-Scale Logo Dataset for Scalable Logo Classification.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Deep Patch Representations with Shared Codebook for Scene Classification.
ACM Trans. Multim. Comput. Commun. Appl., 2019

Know More Say Less: Image Captioning Based on Scene Graphs.
IEEE Trans. Multim., 2019

Hierarchy-Dependent Cross-Platform Multi-View Feature Learning for Venue Category Prediction.
IEEE Trans. Multim., 2019

Learning Effective RGB-D Representations for Scene Recognition.
IEEE Trans. Image Process., 2019

Class Agnostic Image Common Object Detection.
IEEE Trans. Image Process., 2019

Instance-level object retrieval via deep region CNN.
Multim. Tools Appl., 2019

Hybrid incremental learning of new data and new classes for hand-held object recognition.
J. Vis. Commun. Image Represent., 2019

A Survey on Food Computing.
ACM Comput. Surv., 2019

Attention-based Densely Connected LSTM for Video Captioning.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Aberrance-aware Gradient-sensitive Attentions for Scene Recognition with RGB-D Videos.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

MUCH: Mutual Coupling Enhancement of Scene Recognition and Dense Captioning.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Ingredient-Guided Cascaded Multi-Attention Network for Food Recognition.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

A Real-Time Scene Recognition System Based on RGB-D Video Streams.
Proceedings of the International Conference on Multimodal Interaction, 2019

Learning Object Context for Dense Captioning.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
You Are What You Eat: Exploring Rich Recipe Information for Cross-Region Food Analysis.
IEEE Trans. Multim., 2018

Bundled Object Context for Referring Expressions.
IEEE Trans. Multim., 2018

Food recognition and recipe analysis: integrating visual content, context and external knowledge.
CoRR, 2018

Focal Loss for Region Proposal Network.
Proceedings of the Pattern Recognition and Computer Vision - First Chinese Conference, 2018

Session details: Grand Challenge-2.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Session details: Grand Challenge-1.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Session details: Deep-3 (Image Processing-Inpainting, Super-Resolution, Deblurring).
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Attentive Recurrent Neural Network for Weak-supervised Multi-label Image Classification.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

A Novel Approach of Multiple Objects Segmentation Based on Graph Cut.
Proceedings of the IEEE 1st Conference on Multimedia Information Processing and Retrieval, 2018

One-shot learning for RGB-D hand-held object recognition.
Proceedings of the 10th International Conference on Internet Multimedia Computing and Service, 2018

Deep Structured Learning for Visual Relationship Detection.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Being a Supercook: Joint Food Attributes and Multimodal Content Modeling for Recipe Retrieval and Exploration.
IEEE Trans. Multim., 2017

Modeling Restaurant Context for Food Recognition.
IEEE Trans. Multim., 2017

Multi-Scale Multi-Feature Context Modeling for Scene Recognition in the Semantic Manifold.
IEEE Trans. Image Process., 2017

Modality-specific and hierarchical feature learning for RGB-D hand-held object recognition.
Multim. Tools Appl., 2017

Guest Editorial: Knowledge-Based Multimedia Computing.
Multim. Tools Appl., 2017

A survey on context-aware mobile visual recognition.
Multim. Syst., 2017

Guest editorial: mobile visual tagging with mobile context.
Multim. Syst., 2017

RGB-D Scene Recognition with Object-to-Object Relation.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

A Delicious Recipe Analysis Framework for Exploring Multi-Modal Recipes with Various Attributes.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Combining Models from Multiple Sources for RGB-D Scene Recognition.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Dual Track Multimodal Automatic Learning through Human-Robot Interaction.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Visual relationship detection with object spatial distribution.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Keyword-driven image captioning via Context-dependent Bilateral LSTM.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Indoor RGB-D Object Detection with the Guidance of Hand-Held Objects.
Proceedings of the Internet Multimedia Computing and Service, 2017

ISIA at the ImageCLEF 2017 Image Caption Task.
Proceedings of the Working Notes of CLEF 2017, 2017

Depth CNNs for RGB-D Scene Recognition: Learning from Scratch Better than Transferring from RGB-CNNs.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Category co-occurrence modeling for large scale scene recognition.
Pattern Recognit., 2016

Scalable storyboards in handheld devices: applications and evaluation metrics.
Multim. Tools Appl., 2016

Guest Editorial: Image Analysis and Processing Leveraging Additional Information.
Multim. Tools Appl., 2016

Online web video topic detection and tracking with semi-supervised learning.
Multim. Syst., 2016

Where and What to Eat: Simultaneous Restaurant and Dish Recognition from Food Image.
Proceedings of the Advances in Multimedia Information Processing - PCM 2016, 2016

Learning to Recognize Hand-Held Objects from Scratch.
Proceedings of the Advances in Multimedia Information Processing - PCM 2016, 2016

Heterogeneous Convolutional Neural Networks for Visual Recognition.
Proceedings of the Advances in Multimedia Information Processing - PCM 2016, 2016

Image Captioning with both Object and Scene Information.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

RGB-D scene classification via heterogeneous model fusion.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

RGB-D Object Recognition from Hand-Held Object Teaching.
Proceedings of the International Conference on Internet Multimedia Computing and Service, 2016

Scene Recognition with CNNs: Objects, Scales and Dataset Bias.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Joint Learning of CNN and LSTM for Image Captioning.
Proceedings of the Working Notes of CLEF 2016, 2016

2015
INSTRE: A New Benchmark for Instance-Level Object Retrieval and Recognition.
ACM Trans. Multim. Comput. Commun. Appl., 2015

Geolocalized Modeling for Dish Recognition.
IEEE Trans. Multim., 2015

Polysemious visual representation based on feature aggregation for large scale image applications.
Multim. Tools Appl., 2015

LSH-based semantic dictionary learning for large scale image understanding.
J. Vis. Commun. Image Represent., 2015

RGB-D Hand-Held Object Recognition Based on Heterogeneous Feature Fusion.
J. Comput. Sci. Technol., 2015

Cluster-sensitive Structured Correlation Analysis for Web cross-modal retrieval.
Neurocomputing, 2015

Online learning affinity measure with CovBoost for multi-target tracking.
Neurocomputing, 2015

The Face Object based HEVC System for Video Call.
EAI Endorsed Trans. Future Intell. Educ. Environ., 2015

Rich Image Description Based on Regions.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Hand-Object Sense: A Hand-held Object Recognition System Based on RGB-D Information.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

A probabilistic model for food image recognition in restaurants.
Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015

Joint multi-feature spatial context for scene recognition in the semantic manifold.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014
Preface: Internet multimedia computing and service.
Multim. Tools Appl., 2014

Relative image similarity learning with contextual information for Internet cross-media retrieval.
Multim. Syst., 2014

Fusing multi-cues description for partial-duplicate image retrieval.
J. Vis. Commun. Image Represent., 2014

Multipath Convolutional-Recursive Neural Networks for Object Recognition.
Proceedings of the Intelligent Information Processing VII, 2014

Cross media topic analytics based on synergetic content and user behavior modeling.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Graph-Density-based visual word vocabulary for image retrieval.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Multiple Feature Fusion Based Hand-held Object Recognition with RGB-D data.
Proceedings of the International Conference on Internet Multimedia Computing and Service, 2014

Region annotations in hashing based image retrieval.
Proceedings of the International Conference on Internet Multimedia Computing and Service, 2014

Semantic Features for Food Image Recognition with Geo-Constraints.
Proceedings of the 2014 IEEE International Conference on Data Mining Workshops, 2014

Large scale image understanding with non-convex multi-task learning.
Proceedings of the 2014 5th International Conference on Game Theory for Networks, 2014

Accuracy and Specificity Trade-off in k -nearest Neighbors Classification.
Proceedings of the Computer Vision - ACCV 2014, 2014

2013
Robust Spatial Consistency Graph Model for Partial Duplicate Image Retrieval.
IEEE Trans. Multim., 2013

SSOCBT: A Robust Semisupervised Online CovBoost Tracker That Uses Samples Differently.
IEEE Trans. Circuits Syst. Video Technol., 2013

Weighted visual vocabulary to balance the descriptive ability on general dataset.
Neurocomputing, 2013

Partial-Duplicate Image Retrieval via Saliency-Guided Visual Matching.
IEEE Multim., 2013

Block-Based Face Recognition Using WLD.
Proceedings of the Advances in Multimedia Information Processing - PCM 2013, 2013

Cross Concept Local Fisher Discriminant Analysis for Image Classification.
Proceedings of the Advances in Multimedia Modeling, 19th International Conference, 2013

Flexible navigation in smartphones and tablets using scalable storyboards.
Proceedings of the International Conference on Multimedia Retrieval, 2013

ObjectSense: a scalable multi-objects recognition system based on partial-duplicate image retrieval.
Proceedings of the International Conference on Multimedia Retrieval, 2013

Cross-media topic detection associated with hot search queries.
Proceedings of the International Conference on Internet Multimedia Computing and Service, 2013

Multi-level Discriminative Dictionary Learning towards Hierarchical Visual Categorization.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

MIAR ICT Participation at Robot Vision 2013.
Proceedings of the Working Notes for CLEF 2013 Conference , 2013

2012
S<sup>3</sup>MKL: Scalable Semi-Supervised Multiple Kernel Learning for Real-World Image Applications.
IEEE Trans. Multim., 2012

Learning Hierarchical Semantic Description Via Mixed-Norm Regularization for Image Understanding.
IEEE Trans. Multim., 2012

@ICT: attention-based virtual content insertion.
Multim. Syst., 2012

Online selection of the best k-feature subset for object tracking.
J. Vis. Commun. Image Represent., 2012

Nearest-neighbor method using multiple neighborhood similarities for social media data mining.
Neurocomputing, 2012

Improving Image Distance Metric Learning by Embedding Semantic Relations.
Proceedings of the Advances in Multimedia Information Processing - PCM 2012, 2012

Effective Comic-Like Representations with Embedded Regions of Interest.
Proceedings of the Advances in Multimedia Information Processing - PCM 2012, 2012

A Novel Framework for Web Video Thumbnail Generation.
Proceedings of the Eighth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, 2012

Color Maximal-Dissimilarity Pattern for pedestrian detection.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Cross community news event summary generation based on collaborative ranking.
Proceedings of the 4th International Conference on Internet Multimedia Computing and Service, 2012

Multi-feature metric learning with knowledge transfer among semantics and social tagging.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

2011
Transferring Boosted Detectors Towards Viewpoint and Scene Adaptiveness.
IEEE Trans. Image Process., 2011

Special edition on semi-supervised learning for visual content analysis and understanding.
Pattern Recognit., 2011

Human group activity analysis with fusion of motion and appearance information.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Detection and location of near-duplicate video sub-clips by finding dense subgraphs.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

News video story sentiment classification and ranking.
Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

Matching Content-based Saliency Regions for partial-duplicate image retrieval.
Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

Human tracking by structured body parts.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Query sensitive dynamic web video thumbnail generation.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Online Vicept learning for web-scale image understanding.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Fast common visual pattern detection via radiate geometric model.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Content-based intelligent video recorder with its implementation on sports video.
Proceedings of the ICIMCS 2011, 2011

Treat samples differently: Object tracking with semi-supervised online CovBoost.
Proceedings of the IEEE International Conference on Computer Vision, 2011

Learning image Vicept description via mixed-norm regularization for large scale semantic image search.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Efficient l<sub>p</sub>-norm multiple feature metric learning for image categorization.
Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011

2010
Affective Visualization and Retrieval for Music Video.
IEEE Trans. Multim., 2010

Building contextual visual vocabulary for large-scale image applications.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Memory matrix: a novel user experience for home video.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Vicept: link visual features to concepts for large-scale image understanding.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

S3MKL: scalable semi-supervised multiple kernel learning for image data mining.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Nearest-neighbor classification using unlabeled data for real world image application.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

The third eye: mining the visual cognition across multi-language communities.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Adding Affine Invariant Geometric Constraint for Partial-Duplicate Image Retrieval.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Multiple Kernel Learning with High Order Kernels.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Action Recognition Using Spatial-Temporal Context.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Group Activity Recognition by Gaussian Processes Estimation.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Bridging the gap between objective score and subjective preference in video quality assessment.
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

Event based news video people classification and ranking using multimodality features.
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

Fast copy detection based on Slice Entropy Scattergraph.
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

A close-up detection method for movies.
Proceedings of the International Conference on Image Processing, 2010

Multi-description of local interest point for partial-duplicate image retrieval.
Proceedings of the International Conference on Image Processing, 2010

Novel observation model for probabilistic object tracking.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

2009
Event Tactic Analysis Based on Broadcast Sports Video.
IEEE Trans. Multim., 2009

A framework for flexible summarization of racquet sports video using multiple modalities.
Comput. Vis. Image Underst., 2009

Video Shrinking by Auditory and Visual Cues.
Proceedings of the Advances in Multimedia Information Processing, 2009

Friend recommendation according to appearances on photos.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Near-duplicate video matching with transformation recognition.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Visual ContextRank for web image re-ranking.
Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining, 2009

Robust copy detection by mining temporal self-similarities.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Spatial-temporal video browsing for mobile environment based on visual attention analysis.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Advertise gently - in-image advertising with low intrusiveness.
Proceedings of the International Conference on Image Processing, 2009

Personalized online video recommendation by neighborhood score propagation based global ranking.
Proceedings of the First International Conference on Internet Multimedia Computing and Service, 2009

Transfer pedestrian detector towards view-adaptiveness and efficiency.
Proceedings of the 12th IEEE International Conference on Computer Vision Workshops, 2009

Content-Based Video Semantic Analysis.
Proceedings of the Semantic Mining Technologies for Multimedia Databases., 2009

Semantic Classification and Annotation of Images.
Proceedings of the Semantic Mining Technologies for Multimedia Databases., 2009

2008
Unsupervised texture classification: Automatically discover and classify texture patterns.
Image Vis. Comput., 2008

Personalized MTV Affective Analysis Using User Profile.
Proceedings of the Advances in Multimedia Information Processing, 2008

Detecting Violent Scenes in Movies by Auditory and Visual Cues.
Proceedings of the Advances in Multimedia Information Processing, 2008

A Two-Stage Approach to Highlight Extraction in Sports Video by Using AdaBoost and Multi-modal.
Proceedings of the Advances in Multimedia Information Processing, 2008

i.MTV: an integrated system for mtv affective analysis.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

A generic virtual content insertion system based on visual attention analysis.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Naming faces in broadcast news video by image google.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Matching images more efficiently with local descriptors.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Effective scene matching with local feature representatives.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Human reappearance detection based on on-line learning.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Affective MTV analysis based on arousal and valence features.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

A pixel-wise local information-based background subtraction approach.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Spatial-temporal attention analysis for home video.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Coarse-to-fine video text detection.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Lower attentive region detection for virtual content insertion in broadcast video.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Shot classification for action movies based on motion characteristics.
Proceedings of the International Conference on Image Processing, 2008

Pedestrian detection via logistic multiple instance boosting.
Proceedings of the International Conference on Image Processing, 2008

Fast and effective text detection.
Proceedings of the International Conference on Image Processing, 2008

Object tracking using incremental 2D-LDA learning and Bayes inference.
Proceedings of the International Conference on Image Processing, 2008

People re-detection using Adaboost with sift and color correlogram.
Proceedings of the International Conference on Image Processing, 2008

Visual-aural attention modeling for talk show video highlight detection.
Proceedings of the IEEE International Conference on Acoustics, 2008

Multiple Instance Boost Using Graph Embedding Based Decision Stump for Pedestrian Detection.
Proceedings of the Computer Vision, 2008

2007
Trajectory based event tactics analysis in broadcast sports video.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

Region-based visual attention analysis with its application in image browsing on small displays.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

Story Unit Segmentation with Friendly Acoustic Perception.
Proceedings of the Multimedia Content Analysis and Mining, International Workshop, 2007

Highlight Ranking for Racquet Sports Video in User Attention Subspaces Based on Relevance Feedback.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

An Effective Local Invariant Descriptor Combining Luminance and Color Information.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

The Demo: A Real-Time Score Detection and Recognition Approach in Broadcast Basketball Sports Video.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

A Real-Time Score Detection and Recognition Approach for Broadcast Basketball Video.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Generating Video Sequence from Photo Image for Mobile Screens by Content Analysis.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Mining Information of Attack-Defense Status from Soccer Video Based on Scene Analysis.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Monocular Tracking 3D People By Gaussian Process Spatio-Temporal Variable Model.
Proceedings of the International Conference on Image Processing, 2007

Mean-Shift Blob Tracking with Adaptive Feature Selection and Scale Adaptation.
Proceedings of the International Conference on Image Processing, 2007

2006
An effective method to detect and categorize digitized traditional Chinese paintings.
Pattern Recognit. Lett., 2006

JDL at TRECVID 2006 Shot Boundary Detection.
Proceedings of the 2006 TREC Video Retrieval Evaluation, 2006

Highlight Summarization in Sports Video Based on Replay Detection.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Extracting Story Units in Sports Video Based on Unsupervised Video Scene Clustering.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

2005
Visual Ontology Construction for Digitized Art Image Retrieval.
J. Comput. Sci. Technol., 2005

Exciting event detection in broadcast soccer video with mid-level description and incremental learning.
Proceedings of the 13th ACM International Conference on Multimedia, 2005

Video2Cartoon: generating 3D cartoon from broadcast soccer video.
Proceedings of the 13th ACM International Conference on Multimedia, 2005

Playfield Detection Using Adaptive GMM and Its Application.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
An Ontology-based Approach to Retrieve Digitized Art Images.
Proceedings of the 2004 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2004), 2004

Categorizing Traditional Chinese Painting Images.
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

A new method to segment playfield and its applications in match analysis in sports video.
Proceedings of the 12th ACM International Conference on Multimedia, 2004


  Loading...