Meng Wang

Orcid: 0000-0002-3094-7735

Affiliations:
  • Hefei University of Technology, China
  • National University of Singapore, School of Computing, Singapore (former)
  • AKiiRA Media Systems Inc., Palo Alto, Ca, USA (former)
  • Microsoft Research Asia, Beijing, China (former)
  • University of Science and Technology of China, Hefei, China (PhD 2008)


According to our database1, Meng Wang authored at least 785 papers between 2005 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Optimizing low-rank adaptation with decomposed matrices and adaptive rank allocation.
Frontiers Comput. Sci., May, 2025

PSVMA+: Exploring Multi-Granularity Semantic-Visual Adaption for Generalized Zero-Shot Learning.
IEEE Trans. Pattern Anal. Mach. Intell., January, 2025

Cut-and-Paste: Subject-driven video editing with attention control.
Neural Networks, 2025

2024
Disentangled Cascaded Graph Convolution Networks for Multi-Behavior Recommendation.
Trans. Recomm. Syst., December, 2024

Hyperbolic Graph Learning for Social Recommendation.
IEEE Trans. Knowl. Data Eng., December, 2024

Say No to Freeloader: Protecting Intellectual Property of Your Deep Model.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

PosMLP-Video: Spatial and Temporal Relative Position Encoding for Efficient Video Recognition.
Int. J. Comput. Vis., December, 2024

Depth Matters: Spatial Proximity-Based Gaze Cone Generation for Gaze Following in Wild.
ACM Trans. Multim. Comput. Commun. Appl., November, 2024

Cross-Lingual Cross-Modal Retrieval With Noise-Robust Fine-Tuning.
IEEE Trans. Knowl. Data Eng., November, 2024

Advancing Weakly-Supervised Audio-Visual Video Parsing via Segment-Wise Pseudo Labeling.
Int. J. Comput. Vis., November, 2024

Learning Hierarchical Visual Transformation for Domain Generalizable Visual Matching and Recognition.
Int. J. Comput. Vis., November, 2024

Description-Enhanced Label Embedding Contrastive Learning for Text Classification.
IEEE Trans. Neural Networks Learn. Syst., October, 2024

Average User-Side Counterfactual Fairness for Collaborative Filtering.
ACM Trans. Inf. Syst., September, 2024

Image Manipulation Detection With Cascade Hierarchical Graph Representation.
IEEE Trans. Circuits Syst. Video Technol., September, 2024

Unpacking the Gap Box Against Data-Free Knowledge Distillation.
IEEE Trans. Pattern Anal. Mach. Intell., September, 2024

A Snippets Relation and Hard-Snippets Mask Network for Weakly-Supervised Temporal Action Localization.
IEEE Trans. Circuits Syst. Video Technol., August, 2024

Seeking False Hard Negatives for Graph Contrastive Learning.
IEEE Trans. Circuits Syst. Video Technol., August, 2024

Channel-Wise Interactive Learning for Remote Heart Rate Estimation From Facial Video.
IEEE Trans. Circuits Syst. Video Technol., June, 2024

AST-GCN: Augmented Spatial Temporal Graph Convolutional Neural Network for Gait Emotion Recognition.
IEEE Trans. Circuits Syst. Video Technol., June, 2024

Dual-Path TokenLearner for Remote Photoplethysmography-Based Physiological Measurement With Facial Videos.
IEEE Trans. Comput. Soc. Syst., June, 2024

Neighborhood-Enhanced Supervised Contrastive Learning for Collaborative Filtering.
IEEE Trans. Knowl. Data Eng., May, 2024

Graph-based Text Classification by Contrastive Learning with Text-level Graph Augmentation.
ACM Trans. Knowl. Discov. Data, May, 2024

Identity-Guided Collaborative Learning for Cloth-Changing Person Reidentification.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2024

Graph Pooling Inference Network for Text-based VQA.
ACM Trans. Multim. Comput. Commun. Appl., April, 2024

Visual-linguistic-stylistic Triple Reward for Cross-lingual Image Captioning.
ACM Trans. Multim. Comput. Commun. Appl., April, 2024

Exploring Resolution Fields for Scalable Image Compression With Uncertainty Guidance.
IEEE Trans. Circuits Syst. Video Technol., April, 2024

A Semantic Perception and CNN-Transformer Hybrid Network for Occluded Person Re-Identification.
IEEE Trans. Circuits Syst. Video Technol., April, 2024

Collaborative-Enhanced Prediction of Spending on Newly Downloaded Mobile Games under Consumption Uncertainty.
Dataset, March, 2024

Dynamic Emotional Transition Sampling and Emotional Guidance of Individuals Based on Conversation.
IEEE Trans. Comput. Soc. Syst., February, 2024

Detecting Depression With Heterogeneous Graph Neural Network in Clinical Interview Transcript.
IEEE Trans. Comput. Soc. Syst., February, 2024

A Multi-Scale Fusion and Transformer Based Registration Guided Speckle Noise Reduction for OCT Images.
IEEE Trans. Medical Imaging, January, 2024

SSD-MonoDETR: Supervised Scale-Aware Deformable Transformer for Monocular 3D Object Detection.
IEEE Trans. Intell. Veh., January, 2024

Progressive Stereo Image Dehazing Network via Cross-View Region Interaction.
IEEE Trans. Multim., 2024

Two-Step Discrete Hashing for Cross-Modal Retrieval.
IEEE Trans. Multim., 2024

Partial-Tuning Based Mixed-Modal Prototypes for Few-Shot Classification.
IEEE Trans. Multim., 2024

Embedded Heterogeneous Attention Transformer for Cross-Lingual Image Captioning.
IEEE Trans. Multim., 2024

FedSH: Towards Privacy-Preserving Text-Based Person Re-Identification.
IEEE Trans. Multim., 2024

Dual-View Curricular Optimal Transport for Cross-Lingual Cross-Modal Retrieval.
IEEE Trans. Image Process., 2024

Emotional Video Captioning With Vision-Based Emotion Interpretation Network.
IEEE Trans. Image Process., 2024

DeMPAA: Deployable Multi-Mini-Patch Adversarial Attack for Remote Sensing Image Classification.
IEEE Trans. Geosci. Remote. Sens., 2024

Understanding the vulnerability of skeleton-based Human Activity Recognition via black-box attack.
Pattern Recognit., 2024

Confidence-aware multi-modality learning for eye disease screening.
Medical Image Anal., 2024

Enhancing ID-based Recommendation with Large Language Models.
CoRR, 2024

TV-3DG: Mastering Text-to-3D Customized Generation with Visual Prompt.
CoRR, 2024

Scene-Text Grounding for Text-Based Video Question Answering.
CoRR, 2024

UniLearn: Enhancing Dynamic Facial Expression Recognition through Unified Pre-Training and Fine-Tuning on Images and Videos.
CoRR, 2024

TASAR: Transferable Attack on Skeletal Action Recognition.
CoRR, 2024

MedSAM-U: Uncertainty-Guided Auto Multi-Prompt Adaptation for Reliable MedSAM.
CoRR, 2024

SDI-Net: Toward Sufficient Dual-View Interaction for Low-light Stereo Image Enhancement.
CoRR, 2024

Vulnerabilities in AI-generated Image Detection: The Challenge of Adversarial Attacks.
CoRR, 2024

Rethinking Visual Content Refinement in Low-Shot CLIP Adaptation.
CoRR, 2024

PsycoLLM: Enhancing LLM for Psychological Understanding and Evaluation.
CoRR, 2024

MMAD: Multi-label Micro-Action Detection in Videos.
CoRR, 2024

A Whole-Process Certifiably Robust Aggregation Method Against Backdoor Attacks in Federated Learning.
CoRR, 2024

Enhancing Diagnostic Reliability of Foundation Model with Uncertainty Estimation in OCT Images.
CoRR, 2024

Common and Rare Fundus Diseases Identification Using Vision-Language Foundation Model with Knowledge of Over 400 Diseases.
CoRR, 2024

Instructing Prompt-to-Prompt Generation for Zero-Shot Learning.
CoRR, 2024

Once-for-All: Controllable Generative Image Compression with Dynamic Granularity Adaption.
CoRR, 2024

Gradually Vanishing Gap in Prototypical Network for Unsupervised Domain Adaptation.
CoRR, 2024

Dual-State Personalized Knowledge Tracing with Emotional Incorporation.
CoRR, 2024

MedRG: Medical Report Grounding with Multi-modal Large Language Model.
CoRR, 2024

Unified Static and Dynamic Network: Efficient Temporal Filtering for Video Grounding.
CoRR, 2024

BlindDiff: Empowering Degradation Modelling in Diffusion Models for Blind Image Super-Resolution.
CoRR, 2024

Improving Cognitive Diagnosis Models with Adaptive Relational Graph Neural Networks.
CoRR, 2024

Benchmarking Micro-action Recognition: Dataset, Methods, and Applications.
CoRR, 2024

Diving Deep into Regions: Exploiting Regional Information Transformer for Single Image Deraining.
CoRR, 2024

Training-free image style alignment for self-adapting domain shift on handheld ultrasound devices.
CoRR, 2024

Beyond Imitation: Generating Human Mobility from Context-aware Reasoning with Large Language Models.
CoRR, 2024

Label-aware debiased causal reasoning for Natural Language Inference.
AI Open, 2024

Collaborative-Enhanced Prediction of Spending on Newly Downloaded Mobile Games under Consumption Uncertainty.
Proceedings of the Companion Proceedings of the ACM on Web Conference 2024, 2024

Mixed Attention Network for Cross-domain Sequential Recommendation.
Proceedings of the 17th ACM International Conference on Web Search and Data Mining, 2024

Lightweight Embeddings for Graph Collaborative Filtering.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

Multimodality Invariant Learning for Multimedia-Based New Item Recommendation.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

Joint Spatial-Temporal Modeling and Contrastive Learning for Self-supervised Heart Rate Measurement.
Proceedings of the 3rd Vision-based Remote Physiological Signal Sensing Challenge & Workshop (RePSS 2024) co-located with the 33rd International Joint Conference on Artificial Intelligence (IJCAI 2024), 2024

AMTN: Attention-Enhanced Multimodal Temporal Network for Humor Detection.
Proceedings of the 5th on Multimodal Sentiment Analysis Challenge and Workshop: Social Perception and Humor, 2024

Cluster-Phys: Facial Clues Clustering Towards Efficient Remote Physiological Measurement.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

A Coarse to Fine Detection Method for Prohibited Object in X-ray Images Based on Progressive Transformer Decoder.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

DAT: Dialogue-Aware Transformer with Modality-Group Fusion for Human Engagement Estimation.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Maskable Retentive Network for Video Moment Retrieval.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

MAC 2024: Micro-Action Analysis Grand Challenge.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Pseudo Content Hallucination for Unpaired Image Captioning.
Proceedings of the 2024 International Conference on Multimedia Retrieval, 2024

Prototype Learning for Micro-gesture Classification.
Proceedings of IJCAI 2024 Workshop&Challenge on Micro-gesture Analysis for Hidden Emotion Understanding (MiGA 2024) co-located with 33rd International Joint Conference on Artificial Intelligence (IJCAI 2024), 2024

Enhancing Large Foundation Models to Identify Fundus Diseases Based on Contrastive Enhanced Low-Rank Adaptation Prompt.
Proceedings of the Ophthalmic Medical Image Analysis - 11th International Workshop, 2024

Path-Specific Causal Reasoning for Fairness-aware Cognitive Diagnosis.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

Graph Bottlenecked Social Recommendation.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

Diffusion-Based Cloud-Edge-Device Collaborative Learning for Next POI Recommendations.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

Popularity-Aware Alignment and Contrast for Mitigating Popularity Bias.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

Large Language Model-driven Meta-structure Discovery in Heterogeneous Information Network.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

AISA-DG: Automatic Implicit Style-Augmented Domain Generalization on Optic Disc/Cup Segmentation.
Proceedings of the IEEE International Symposium on Biomedical Imaging, 2024

OSIC: A New One-Stage Image Captioner Coined.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

DURRNET: Deep Unfolded Single Image Reflection Removal Network with Joint Prior.
Proceedings of the IEEE International Conference on Acoustics, 2024

Label-Anticipated Event Disentanglement for Audio-Visual Video Parsing.
Proceedings of the Computer Vision - ECCV 2024, 2024

Training A Small Emotional Vision Language Model for Visual Art Comprehension.
Proceedings of the Computer Vision - ECCV 2024, 2024

Region-Native Visual Tokenization.
Proceedings of the Computer Vision - ECCV 2024, 2024

Frequency Decoupling for Motion Magnification Via Multi-Level Isomorphic Architecture.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Structure Matters: Tackling the Semantic Discrepancy in Diffusion Models for Image Inpainting.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Data-Free Quantization via Pseudo-label Filtering.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Finding and Editing Multi-Modal Neurons in Pre-Trained Transformers.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

EulerMormer: Robust Eulerian Motion Magnification via Dynamic Filtering within Transformer.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

A Dual-Way Enhanced Framework from Text Matching Point of View for Multimodal Entity Linking.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

KPA-Tracker: Towards Robust and Real-Time Category-Level Articulated Object 6D Pose Tracking.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Object-Aware Adaptive-Positivity Learning for Audio-Visual Question Answering.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
GCRec: Graph-Augmented Capsule Network for Next-Item Recommendation.
IEEE Trans. Neural Networks Learn. Syst., December, 2023

Additive Feature Attribution Explainable Methods to Craft Adversarial Attacks for Text Classification and Text Regression.
IEEE Trans. Knowl. Data Eng., December, 2023

Verbal-Person Nets: Pose-Guided Multi-Granularity Language-to-Person Generation.
IEEE Trans. Neural Networks Learn. Syst., November, 2023

Graph Attention U-Net for Retinal Layer Surface Detection and Choroid Neovascularization Segmentation in OCT Images.
IEEE Trans. Medical Imaging, November, 2023

Data-Driven single image deraining: A Comprehensive review and new perspectives.
Pattern Recognit., November, 2023

Temporal-Relational hypergraph tri-Attention networks for stock trend prediction.
Pattern Recognit., November, 2023

Information-Enhanced Hierarchical Self-Attention Network for Multiturn Dialog Generation.
IEEE Trans. Comput. Soc. Syst., October, 2023

ViGT: proposal-free video grounding with a learnable token in the transformer.
Sci. China Inf. Sci., October, 2023

Quaternion Factorization Machines: A Lightweight Solution to Intricate Feature Interaction Modeling.
IEEE Trans. Neural Networks Learn. Syst., August, 2023

Spatiotemporal contrastive modeling for video moment retrieval.
World Wide Web (WWW), July, 2023

TBNet: A Two-Stream Boundary-Aware Network for Generic Image Manipulation Localization.
IEEE Trans. Knowl. Data Eng., July, 2023

Self-Guided Optimization Semi-Supervised Method for Joint Segmentation of Macular Hole and Cystoid Macular Edema in Retinal OCT Images.
IEEE Trans. Biomed. Eng., July, 2023

Contrastive Positive Sample Propagation Along the Audio-Visual Event Line.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

A Survey on Accuracy-Oriented Neural Recommendation: From Collaborative Filtering to Information-Rich Recommendation.
IEEE Trans. Knowl. Data Eng., May, 2023

User-based Hierarchical Network of Sina Weibo Emotion Analysis.
ACM Trans. Asian Low Resour. Lang. Inf. Process., May, 2023

Anti-jamming channel access in 5G ultra-dense networks: a game-theoretic learning approach.
Digit. Commun. Networks, April, 2023

Robust and fast low-rank deep convolutional feature recovery: toward information retention and accelerated convergence.
Knowl. Inf. Syst., March, 2023

LadRa-Net: Locally Aware Dynamic Reread Attention Net for Sentence Semantic Matching.
IEEE Trans. Neural Networks Learn. Syst., February, 2023

Boosting Hyperspectral Image Classification with Dual Hierarchical Learning.
ACM Trans. Multim. Comput. Commun. Appl., January, 2023

Bias and Debias in Recommender System: A Survey and Future Directions.
ACM Trans. Inf. Syst., 2023

A Novel Action Saliency and Context-Aware Network for Weakly-Supervised Temporal Action Localization.
IEEE Trans. Multim., 2023

A Text-Guided Generation and Refinement Model for Image Captioning.
IEEE Trans. Multim., 2023

Contextual Attention Network for Emotional Video Captioning.
IEEE Trans. Multim., 2023

Exploring Sparse Spatial Relation in Graph Inference for Text-Based VQA.
IEEE Trans. Image Process., 2023

Spherical Centralized Quantization for Fast Image Retrieval.
IEEE Trans. Image Process., 2023

Macroscopic-and-Microscopic Rain Streaks Disentanglement Network for Single-Image Deraining.
IEEE Trans. Image Process., 2023

A Multitemporal Scale and Spatial-Temporal Transformer Network for Temporal Action Localization.
IEEE Trans. Hum. Mach. Syst., 2023

Memorial GAN With Joint Semantic Optimization for Unpaired Image Captioning.
IEEE Trans. Cybern., 2023

Fast data-free model compression via dictionary-pair reconstruction.
Knowl. Inf. Syst., 2023

BIM and ANN-based rapid prediction approach for natural daylighting inside library spaces.
J. Intell. Fuzzy Syst., 2023

A Survey on Video Moment Localization.
ACM Comput. Surv., 2023

From Static to Dynamic: Adapting Landmark-Aware Image Models for Facial Expression Recognition in Videos.
CoRR, 2023

Clarity ChatGPT: An Interactive and Adaptive Processing System for Image Restoration and Enhancement.
CoRR, 2023

Dual-Path Temporal Map Optimization for Make-up Temporal Video Grounding.
CoRR, 2023

Dual-path TokenLearner for Remote Photoplethysmography-based Physiological Measurement with Facial Videos.
CoRR, 2023

A Multi-view Impartial Decision Network for Frontotemporal Dementia Diagnosis.
CoRR, 2023

You Can Mask More For Extremely Low-Bitrate Image Compression.
CoRR, 2023

SSD-MonoDTR: Supervised Scale-constrained Deformable Transformer for Monocular 3D Object Detection.
CoRR, 2023

MER 2023: Multi-label Learning, Modality Robustness, and Semi-Supervised Learning.
CoRR, 2023

Identity-Guided Collaborative Learning for Cloth-Changing Person Reidentification.
CoRR, 2023

Uncertainty-inspired Open Set Learning for Retinal Anomaly Identification.
CoRR, 2023

Multimodal Feature Extraction and Fusion for Emotional Reaction Intensity Estimation and Expression Classification in Videos with Transformers.
CoRR, 2023

Facial Affect Recognition based on Transformer Encoder and Audiovisual Fusion for the ABAW5 Challenge.
CoRR, 2023

Improving Audio-Visual Video Parsing with Pseudo Visual Labels.
CoRR, 2023

A Review of Uncertainty Estimation and its Application in Medical Imaging.
CoRR, 2023

Audio-Visual Segmentation with Semantics.
CoRR, 2023

TrFedDis: Trusted Federated Disentangling Network for Non-IID Domain Feature.
CoRR, 2023

EvidenceCap: Towards trustworthy medical image segmentation via evidential identity cap.
CoRR, 2023

Improving Recommendation Fairness via Data Augmentation.
Proceedings of the ACM Web Conference 2023, 2023

Generative-Contrastive Graph Learning for Recommendation.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Topic-enhanced Graph Neural Networks for Extraction-based Explainable Recommendation.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Learning Fine-grained User Interests for Micro-video Recommendation.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

ASM: Adaptive Sample Mining for In-The-Wild Facial Expression Recognition.
Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

Low-Light Image Enhancement Based on Mutual Guidance Between Enhancing Strength and Image Appearance.
Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

Attention and Relative Distance Alignment for Low-Resolution Facial Expression Recognition.
Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

Disentangling Cognitive Diagnosis with Limited Exercise Labels.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

JTMA: Joint Multimodal Feature Fusion and Temporal Multi-head Attention for Humor Detection.
Proceedings of the 4th on Multimodal Sentiment Analysis Challenge and Workshop: Mimicked Emotions, 2023

Temporal-aware Multimodal Feature Fusion for Sentiment Analysis.
Proceedings of the 4th on Multimodal Sentiment Analysis Challenge and Workshop: Mimicked Emotions, 2023

Exploiting Diverse Feature for Multimodal Sentiment Analysis.
Proceedings of the 4th on Multimodal Sentiment Analysis Challenge and Workshop: Mimicked Emotions, 2023

CropCap: Embedding Visual Cross-Partition Dependency for Image Captioning.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Unified Multi-modal Unsupervised Representation Learning for Skeleton-based Action Understanding.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Emotion-Prior Awareness Network for Emotional Video Captioning.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Category-Level Articulated Object 9D Pose Estimation via Reinforcement Learning.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

MER 2023: Multi-label Learning, Modality Robustness, and Semi-Supervised Learning.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Data Augmentation for Human Behavior Analysis in Multi-Person Conversations.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Unlearnable Examples Give a False Sense of Security: Piercing through Unexploitable Data with Learnable Examples.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

A Novel Temporal Channel Enhancement and Contextual Excavation Network for Temporal Action Localization.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Learning Style-Invariant Robust Representation for Generalizable Visual Instance Retrieval.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

GoRec: A Generative Cold-start Recommendation Framework.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Joint Skeletal and Semantic Embedding Loss for Micro-gesture Classification.
Proceedings of IJCAI-2023 Workshop&Challenge on Micro-gesture Analysis for Hidden Emotion Understanding (MiGA 2023) co-located with 32nd International Joint Conference on Artificial Intelligence (IJCAI 2023), 2023

Reliable Multimodality Eye Disease Screening via Mixture of Student's t Distributions.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

Federated Uncertainty-Aware Aggregation for Fundus Diabetic Retinopathy Staging.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

Uncertainty-Informed Mutual Learning for Joint Medical Image Classification and Segmentation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

Category-Independent Visual Explanation for Medical Deep Network Understanding.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

SAM-U: Multi-box Prompts Triggered Uncertainty Estimation for Reliable SAM in Medical Image.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023 Workshops, 2023

EmotionNAS: Two-stream Neural Architecture Search for Speech Emotion Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Hierarchical Context Modeling Network for Landmark Recognition.
Proceedings of the IEEE International Conference on Data Mining, 2023

Enhancing Factorization Machines with Generalized Metric Learning (Extended Abstract).
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

Cross-Modal Contrastive Learning for Event Extraction.
Proceedings of the Database Systems for Advanced Applications, 2023

ABAW5 Challenge: A Facial Affect Recognition Approach Utilizing Transformer Encoder and Audiovisual Fusion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Model Barrier: A Compact Un-Transferable Isolation Domain for Model Intellectual Property Protection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

LP-DIF: Learning Local Pattern-Specific Deep Implicit Function for 3D Objects and Scenes.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Fine-grained Audible Video Description.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Prompting Large Language Models with Answer Heuristics for Knowledge-Based Visual Question Answering.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Adaptive Data-Free Quantization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Multimodal Feature Extraction and Fusion for Emotional Reaction Intensity Estimation and Expression Classification in Videos with Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Domain Generalized Stereo Matching via Hierarchical Visual Transformation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Fair Representation Learning for Recommendation: A Mutual Information Perspective.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

MCL: Multi-Granularity Contrastive Learning Framework for Chinese NER.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Rethinking Data-Free Quantization as a Zero-Sum Game.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Energy-Efficient Transmit Probability-Power Control for Covert D2D Communications With Age of Information Constraints.
IEEE Trans. Veh. Technol., 2022

Introduction to the Special Section on Learning Representations, Similarity, and Associations in Dynamic Multimedia Environments.
ACM Trans. Multim. Comput. Commun. Appl., 2022

Decoupled Low-Light Image Enhancement.
ACM Trans. Multim. Comput. Commun. Appl., 2022

An Unsupervised Aspect-Aware Recommendation Model with Explanation Text Generation.
ACM Trans. Inf. Syst., 2022

Transductive Relation-Propagation With Decoupling Training for Few-Shot Learning.
IEEE Trans. Neural Networks Learn. Syst., 2022

Graph-Based Multimodal Sequential Embedding for Sign Language Translation.
IEEE Trans. Multim., 2022

Unpaired Image Captioning With semantic-Constrained Self-Learning.
IEEE Trans. Multim., 2022

MsTGANet: Automatic Drusen Segmentation From Retinal OCT Images.
IEEE Trans. Medical Imaging, 2022

DiffNet++: A Neural Influence and Interest Diffusion Network for Social Recommendation.
IEEE Trans. Knowl. Data Eng., 2022

A Survey on Large-Scale Machine Learning.
IEEE Trans. Knowl. Data Eng., 2022

Enhancing Factorization Machines With Generalized Metric Learning.
IEEE Trans. Knowl. Data Eng., 2022

Improving Image Similarity Learning by Adding External Memory.
IEEE Trans. Knowl. Data Eng., 2022

Dual-MGAN: An Efficient Approach for Semi-supervised Outlier Detection with Few Identified Anomalies.
ACM Trans. Knowl. Discov. Data, 2022

Speckle Noise Reduction for OCT Images Based on Image Style Transfer and Conditional GAN.
IEEE J. Biomed. Health Informatics, 2022

A Temporal-Aware Relation and Attention Network for Temporal Action Localization.
IEEE Trans. Image Process., 2022

Discriminative Style Learning for Cross-Domain Image Captioning.
IEEE Trans. Image Process., 2022

Video Moment Retrieval With Cross-Modal Neural Architecture Search.
IEEE Trans. Image Process., 2022

Personality Assessment Based on Multimodal Attention Network Learning With Category-Based Mean Square Error.
IEEE Trans. Image Process., 2022

Continual Referring Expression Comprehension via Dual Modular Memorization.
IEEE Trans. Image Process., 2022

Two-Branch Attention Network via Efficient Semantic Coupling for One-Shot Learning.
IEEE Trans. Image Process., 2022

Spatio-Temporal Collaborative Module for Efficient Action Recognition.
IEEE Trans. Image Process., 2022

Hierarchical Representation Network With Auxiliary Tasks for Video Captioning and Video Question Answering.
IEEE Trans. Image Process., 2022

Secure User Authentication Leveraging Keystroke Dynamics via Wi-Fi Sensing.
IEEE Trans. Ind. Informatics, 2022

WiGRUNT: WiFi-Enabled Gesture Recognition Using Dual-Attention Network.
IEEE Trans. Hum. Mach. Syst., 2022

DNA: Deeply Supervised Nonlinear Aggregation for Salient Object Detection.
IEEE Trans. Cybern., 2022

KTN: Knowledge Transfer Network for Learning Multiperson 2D-3D Correspondences.
IEEE Trans. Circuits Syst. Video Technol., 2022

Multi-Person Pose Estimation With Accurate Heatmap Regression and Greedy Association.
IEEE Trans. Circuits Syst. Video Technol., 2022

Adaptive Feature Aggregation in Deep Multi-Task Convolutional Neural Networks.
IEEE Trans. Circuits Syst. Video Technol., 2022

Emotional Conversation Generation With Bilingual Interactive Decoding.
IEEE Trans. Comput. Soc. Syst., 2022

Joint Segmentation of Multi-Class Hyper-Reflective Foci in Retinal Optical Coherence Tomography Images.
IEEE Trans. Biomed. Eng., 2022

FLAG: Faster Learning on Anchor Graph with Label Predictor Optimization.
IEEE Trans. Big Data, 2022

MAN: Mining Ambiguity and Noise for Facial Expression Recognition in the Wild.
Pattern Recognit. Lett., 2022

Multi-stage and multi-branch network with similar expressions label distribution learning for facial expression recognition.
Pattern Recognit. Lett., 2022

Graphonomy: Universal Image Parsing via Graph Reasoning and Transfer.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Pose-Guided Representation Learning for Person Re-Identification.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Context-Aware Graph Inference With Knowledge Distillation for Visual Dialog.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Differentiated Explanation of Deep Neural Networks With Skewed Distributions.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Dual Encoding for Video Retrieval by Text.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

A Highly Efficient Model to Study the Semantics of Salient Object Detection.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

AutoESD: a web tool for automatic editing sequence design for genetic manipulation of microorganisms.
Nucleic Acids Res., 2022

Editorial: Deep dictionary learning: Algorithm, theory and application.
Neurocomputing, 2022

Who is gambling? Finding cryptocurrency gamblers using multi-modal retrieval methods.
Int. J. Multim. Inf. Retr., 2022

Guest Editorial: Intelligent information processing and services in media convergence.
Int. J. Intell. Syst., 2022

A butterfly-like slot UWB antenna with WLAN band-notch characteristics for MIMO applications.
IEICE Electron. Express, 2022

Dual Expression Fusion: A Universal Microexpression Recognition Framework.
IEEE Multim., 2022

Long-Range Zero-Shot Generative Deep Network Quantization.
CoRR, 2022

Hybrid Multimodal Fusion for Humor Detection.
CoRR, 2022

Unsupervised Domain Adaptation via Style-Aware Self-intermediate Domain.
CoRR, 2022

T-Person-GAN: Text-to-Person Image Generation with Identity-Consistency and Manifold Mix-Up.
CoRR, 2022

Temporal Action Localization with Multi-temporal Scales.
CoRR, 2022

A Semantic-aware Attention and Visual Shielding Network for Cloth-changing Person Re-identification.
CoRR, 2022

KTN: Knowledge Transfer Network for Learning Multi-person 2D-3D Correspondences.
CoRR, 2022

ClusterQ: Semantic Feature Distribution Alignment for Data-Free Quantization.
CoRR, 2022

Semi-DRDNet Semi-supervised Detail-recovery Image Deraining Network via Unpaired Contrastive Learning.
CoRR, 2022

EmotionNAS: Two-stream Architecture Search for Speech Emotion Recognition.
CoRR, 2022

Compact Bidirectional Transformer for Image Captioning.
CoRR, 2022

Progressive learning with multi-scale attention network for cross-domain vehicle re-identification.
Sci. China Inf. Sci., 2022

FairCF: fairness-aware collaborative filtering.
Sci. China Inf. Sci., 2022

Bayesian feature interaction selection for factorization machines.
Artif. Intell., 2022

A Review-aware Graph Contrastive Learning Framework for Recommendation.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Thinking inside The Box: Learning Hypercube Representations for Group Recommendation.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Deep Multi-Resolution Mutual Learning for Image Inpainting.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

DVR: Micro-Video Recommendation Optimizing Watch-Time-Gain under Duration Bias.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

CRNet: Unsupervised Color Retention Network for Blind Motion Deblurring.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

FCL-GAN: A Lightweight and Real-Time Baseline for Unsupervised Blind Image Deblurring.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Hybrid Multimodal Fusion for Humor Detection.
Proceedings of the MuSe@MM 2022: Proceedings of the 3rd International on Multimodal Sentiment Analysis Workshop and Challenge, 2022

Early-Learning regularized Contrastive Learning for Cross-Modal Retrieval with Noisy Labels.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

SGINet: Toward Sufficient Interaction Between Single Image Deraining and Semantic Segmentation.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Differentiable Cross-modal Hashing via Multimodal Transformers.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Gloss Semantic-Enhanced Network with Online Back-Translation for Sign Language Production.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Robust Low-Rank Convolution Network for Image Denoising.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Delving Globally into Texture and Structure for Image Inpainting.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Hybrid Multimodal Feature Extraction, Mining and Fusion for Sentiment Analysis.
Proceedings of the MuSe@MM 2022: Proceedings of the 3rd International on Multimodal Sentiment Analysis Workshop and Challenge, 2022

Multimodal Temporal Attention in Sentiment Analysis.
Proceedings of the MuSe@MM 2022: Proceedings of the 3rd International on Multimodal Sentiment Analysis Workshop and Challenge, 2022

Multigranular Visual-Semantic Embedding for Cloth-Changing Person Re-identification.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Multi-view-based automatic method for multiple diseases screening in retinal OCT images.
Proceedings of the Medical Imaging 2022: Image Processing, 2022

Acute branch retinal artery occlusion segmentation based on Bayes posterior probability and deep learning.
Proceedings of the Medical Imaging 2022: Image Processing, 2022

TBraTS: Trusted Brain Tumor Segmentation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, 2022

Facing Annotation Redundancy: OCT Layer Segmentation with only 10 Annotated Pixels per Layer.
Proceedings of the Resource-Efficient Medical Image Analysis - First MICCAI Workshop, 2022

Tiny-Lesion Segmentation in OCT via Multi-scale Wavelet Enhanced Transformer.
Proceedings of the Ophthalmic Medical Image Analysis - 9th International Workshop, 2022

SpiroFi: Contactless Pulmonary Function Monitoring using WiFi Signal.
Proceedings of the 30th IEEE/ACM International Symposium on Quality of Service, 2022

Audio-Visual Segmentation.
Proceedings of the Computer Vision - ECCV 2022, 2022

Switchable Online Knowledge Distillation.
Proceedings of the Computer Vision - ECCV 2022, 2022

Deep Color Consistent Network for Low-Light Image Enhancement.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Collaborative Neural Social Recommendation.
IEEE Trans. Syst. Man Cybern. Syst., 2021

Adaptive Multi-Task Dual-Structured Learning with Its Application on Alzheimer's Disease Study.
ACM Trans. Internet Techn., 2021

Deep Coattention-Based Comparator for Relative Representation Learning in Person Re-Identification.
IEEE Trans. Neural Networks Learn. Syst., 2021

RMoR-Aion: Robust Multioutput Regression by Simultaneously Alleviating Input and Output Noises.
IEEE Trans. Neural Networks Learn. Syst., 2021

Twin-Incoherent Self-Expressive Locality-Adaptive Latent Dictionary Pair Learning for Classification.
IEEE Trans. Neural Networks Learn. Syst., 2021

Boosting Temporal Binary Coding for Large-Scale Video Search.
IEEE Trans. Multim., 2021

Kernelized Multiview Subspace Analysis By Self-Weighted Learning.
IEEE Trans. Multim., 2021

Learning Low-Rank Sparse Representations With Robust Relationship Inference for Image Memorability Prediction.
IEEE Trans. Multim., 2021

Semi-Supervised Capsule cGAN for Speckle Noise Reduction in Retinal OCT Images.
IEEE Trans. Medical Imaging, 2021

Automatic Staging for Retinopathy of Prematurity With Deep Feature Fusion and Ordinal Classification Strategy.
IEEE Trans. Medical Imaging, 2021

Flexible Auto-Weighted Local-Coordinate Concept Factorization: A Robust Framework for Unsupervised Clustering.
IEEE Trans. Knowl. Data Eng., 2021

DerainCycleGAN: Rain Attentive CycleGAN for Single Image Deraining and Rainmaking.
IEEE Trans. Image Process., 2021

Diversifying Inference Path Selection: Moving-Mobile-Network for Landmark Recognition.
IEEE Trans. Image Process., 2021

Toward Realistic Face Photo-Sketch Synthesis via Composition-Aided GANs.
IEEE Trans. Cybern., 2021

Generalized Incomplete Multiview Clustering With Flexible Locality Structure Diffusion.
IEEE Trans. Cybern., 2021

WiONE: One-Shot Learning for Environment-Robust Device-Free User Authentication via Commodity Wi-Fi in Man-Machine System.
IEEE Trans. Comput. Soc. Syst., 2021

Adversarial co-distillation learning for image recognition.
Pattern Recognit., 2021

Meta-learning based relation and representation learning networks for single-image deraining.
Pattern Recognit., 2021

Matrix Completion with Deterministic Sampling: Theories and Methods.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Dense Residual Network: Enhancing global dense feature flow for character recognition.
Neural Networks, 2021

Sequential image encoding for vision-to-language problems.
Multim. Tools Appl., 2021

Topic extraction from extremely short texts with variational manifold regularization.
Mach. Learn., 2021

Blind image quality assessment with channel attention based deep residual network and extended LargeVis dimensionality reduction.
J. Vis. Commun. Image Represent., 2021

Editorial: IMAVIS special issue on deep cross-media neural model for generating image descriptions.
Image Vis. Comput., 2021

Dual-Constrained Deep Semi-Supervised Coupled Factorization Network with Enriched Prior.
Int. J. Comput. Vis., 2021

LadRa-Net: Locally-Aware Dynamic Re-read Attention Net for Sentence Semantic Matching.
CoRR, 2021

Few-shot Learning with Global Relatedness Decoupled-Distillation.
CoRR, 2021

MCGNet: Partial Multi-view Few-shot Learning via Meta-alignment and Context Gated-aggregation.
CoRR, 2021

A Survey on Neural Recommendation: From Collaborative Filtering to Content and Context Enriched Recommendation.
CoRR, 2021

Quaternion Factorization Machines: A Lightweight Solution to Intricate Feature Interaction Modelling.
CoRR, 2021

Revisiting Deep Local Descriptor for Improved Few-Shot Classification.
CoRR, 2021

Learning Fair Representations for Bipartite Graph based Recommendation.
CoRR, 2021

Deep Adversarial Inconsistent Cognitive Sampling for Multi-view Progressive Subspace Clustering.
CoRR, 2021

Deep Subspace Mutual Learning for cancer subtypes prediction.
Bioinform., 2021

Learning Fair Representations for Recommendation: A Graph-based Perspective.
Proceedings of the WWW '21: The Web Conference 2021, 2021

Realize Ultra-Reliability and Low-Latency in Haptic Communication through Prediction.
Proceedings of the 13th International Conference on Wireless Communications and Signal Processing, 2021

Enhanced Graph Learning for Collaborative Filtering via Mutual Information Maximization.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

Deconfounded Video Moment Retrieval with Causal Intervention.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

Privileged Graph Distillation for Cold Start Recommendation.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

Set2setRank: Collaborative Set to Set Ranking for Implicit Feedback based Recommendation.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

GCM-Net: Towards Effective Global Context Modeling for Image Inpainting.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Pairwise VLAD Interaction Network for Video Question Answering.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Multimodal Emotion Recognition and Sentiment Analysis via Attention Enhanced Recurrent Model.
Proceedings of the MuSe '21: Proceedings of the 2nd on Multimodal Sentiment Analysis Challenge, 2021

ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

SANet: a self-adaptive network for hyperreflective foci segmentation in retinal OCT images.
Proceedings of the Medical Imaging 2021: Image Processing, Online, February 15-19, 2021, 2021

High-Resolution Hierarchical Adversarial Learning for OCT Speckle Noise Reduction.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2021 - 24th International Conference, Strasbourg, France, September 27, 2021

Learning Elastic Embeddings for Customizing On-Device Recommenders.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Cu-Segnet: Corneal Ulcer Segmentation Network.
Proceedings of the 18th IEEE International Symposium on Biomedical Imaging, 2021

Cycle Adaptive Multi-Target Weighting Network For Automated Diabetic Retinopathy Segmentation.
Proceedings of the 18th IEEE International Symposium on Biomedical Imaging, 2021

Binary MOGWO Based On Competition and Teaching for Computationally Complex Engineering Applications.
Proceedings of the International Joint Conference on Neural Networks, 2021

Embedding Extra Knowledge and A Dependency Tree Based on A Graph Attention Network for Aspect-based Sentiment Analysis.
Proceedings of the International Joint Conference on Neural Networks, 2021

Semi-Deraingan: A New Semi-Supervised Single Image Deraining.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Triplet Deep Subspace Clustering via Self-Supervised Data Augmentation.
Proceedings of the IEEE International Conference on Data Mining, 2021

Discriminative Additive Scale Loss for Deep Imbalanced Classification and Embedding.
Proceedings of the IEEE International Conference on Data Mining, 2021

Robust Low-rank Deep Feature Recovery in CNNs: Toward Low Information Loss and Fast Convergence.
Proceedings of the IEEE International Conference on Data Mining, 2021

Dictionary Pair-based Data-Free Fast Deep Neural Network Compression.
Proceedings of the IEEE International Conference on Data Mining, 2021

Semi-Autoregressive Transformer for Image Captioning.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

Motion Prediction using Trajectory Cues.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Positive Sample Propagation Along the Audio-Visual Event Line.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

DGA-Net: Dynamic Gaussian Attention Network for Sentence Semantic Matching.
Proceedings of the Artificial Intelligence - First CAAI International Conference, 2021

Meta-learned ID Embeddings for Online Inductive Recommendation.
Proceedings of the Information Retrieval - 27th China Conference, 2021

Partial-Label and Structure-constrained Deep Coupled Factorization Network.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Making the Relation Matters: Relation of Relation Learning Network for Sentence Semantic Matching.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Proposal-Free Video Grounding with Contextual Pyramid Network.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
OpenMRE: A Numerical Platform for MRE Study.
IEEE Trans. Syst. Man Cybern. Syst., 2020

Proposal Complementary Action Detection.
ACM Trans. Multim. Comput. Commun. Appl., 2020

Video Retrieval with Similarity-Preserving Deep Temporal Hashing.
ACM Trans. Multim. Comput. Commun. Appl., 2020

Large-Scale Question Tagging via Joint Question-Topic Embedding Learning.
ACM Trans. Inf. Syst., 2020

Robust Triple-Matrix-Recovery-Based Auto-Weighted Label Propagation for Classification.
IEEE Trans. Neural Networks Learn. Syst., 2020

Discriminative Local Sparse Representation by Robust Adaptive Dictionary Pair Learning.
IEEE Trans. Neural Networks Learn. Syst., 2020

Low-Light Image Enhancement With Semi-Decoupled Decomposition.
IEEE Trans. Multim., 2020

CPFNet: Context Pyramid Fusion Network for Medical Image Segmentation.
IEEE Trans. Medical Imaging, 2020

Joint Label Prediction Based Semi-Supervised Adaptive Concept Factorization for Robust Data Representation.
IEEE Trans. Knowl. Data Eng., 2020

A Hierarchical Attention Model for Social Contextual Image Recommendation.
IEEE Trans. Knowl. Data Eng., 2020

Generative Adversarial Active Learning for Unsupervised Outlier Detection.
IEEE Trans. Knowl. Data Eng., 2020

Cross-Domain Sentiment Encoding through Stochastic Word Embedding.
IEEE Trans. Knowl. Data Eng., 2020

Deep Neighborhood Component Analysis for Visual Similarity Modeling.
ACM Trans. Intell. Syst. Technol., 2020

Light Field Saliency Detection With Deep Convolutional Networks.
IEEE Trans. Image Process., 2020

Few-Shot Deep Adversarial Learning for Video-Based Person Re-Identification.
IEEE Trans. Image Process., 2020

Learning Hybrid Representation by Robust Dictionary Learning in Factorized Compressed Space.
IEEE Trans. Image Process., 2020

Online Multi-Expert Learning for Visual Tracking.
IEEE Trans. Image Process., 2020

Learning Symmetry Consistent Deep CNNs for Face Completion.
IEEE Trans. Image Process., 2020

Hierarchical Recurrent Deep Fusion Using Adaptive Clip Summarization for Sign Language Translation.
IEEE Trans. Image Process., 2020

Textual-Visual Reference-Aware Attention Network for Visual Dialog.
IEEE Trans. Image Process., 2020

An Interpretable Deep Architecture for Similarity Learning Built Upon Hierarchical Concepts.
IEEE Trans. Image Process., 2020

Joint Subspace Recovery and Enhanced Locality Driven Robust Flexible Discriminative Dictionary Learning.
IEEE Trans. Circuits Syst. Video Technol., 2020

Cross-Entropy Adversarial View Adaptation for Person Re-Identification.
IEEE Trans. Circuits Syst. Video Technol., 2020

Introduction to the Special Section on Contextual Object Analysis in Complex Scenes.
IEEE Trans. Circuits Syst. Video Technol., 2020

Multilevel fusion of multimodal deep features for porn streamer recognition in live video.
Pattern Recognit. Lett., 2020

Person re-identification based on multi-scale constraint network.
Pattern Recognit. Lett., 2020

Gated CNN: Integrating multi-scale feature layers for object detection.
Pattern Recognit., 2020

Self-attention driven adversarial similarity learning network.
Pattern Recognit., 2020

Vocabulary-Informed Zero-Shot and Open-Set Learning.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Intra- and Inter-modal Multilinear Pooling with Multitask Learning for Video Grounding.
Neural Process. Lett., 2020

DenseNet with Up-Sampling block for recognizing texts in images.
Neural Comput. Appl., 2020

Mutual-manifold regularized robust fast latent LRR for subspace recovery and learning.
Neural Comput. Appl., 2020

Social-Aware Collaborative Caching Based on User Preferences for D2D Content Sharing.
KSII Trans. Internet Inf. Syst., 2020

Personalized image quality assessment with Social-Sensed aesthetic preference.
Inf. Sci., 2020

Coarse-to-fine object detection in unmanned aerial vehicle imagery using lightweight convolutional neural network and deep motion saliency.
Neurocomputing, 2020

Toward Sensing Emotions With Deep Visual Analysis: A Long-Term Psychological Modeling Approach.
IEEE Multim., 2020

R<sup>2</sup>-Net: Relation of Relation Learning Network for Sentence Semantic Matching.
CoRR, 2020

SimpleMKKM: Simple Multiple Kernel K-means.
CoRR, 2020

Person Re-Identification via Active Hard Sample Mining.
CoRR, 2020

RCC-Dual-GAN: An Efficient Approach for Outlier Detection with Few Identified Anomalies.
CoRR, 2020

Semi-DerainGAN: A New Semi-supervised Single Image Deraining Network.
CoRR, 2020

Reinforced Negative Sampling over Knowledge Graph for Recommendation.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

Dual Learning for Explainable Recommendation: Towards Unifying User Preference Prediction and Review Generation.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

How to Retrain Recommender System?: A Sequential Meta-Learning Method.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

Tree-Augmented Cross-Modal Encoding for Complex-Query Video Retrieval.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

Joint Item Recommendation and Attribute Inference: An Adaptive Graph Convolutional Network Approach.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

Learning to Transfer Graph Embeddings for Inductive Graph based Recommendation.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

Try This Instead: Personalized and Interpretable Substitute Recommendation.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

LightGCN: Simplifying and Powering Graph Convolution Network for Recommendation.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

Deep Self-representative Concept Factorization Network for Representation Learning.
Proceedings of the 2020 SIAM International Conference on Data Mining, 2020

ASTA-Net: Adaptive Spatio-Temporal Attention Network for Person Re-Identification in Videos.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Deep Multimodal Neural Architecture Search.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Weakly-Supervised Video Object Grounding by Exploring Spatio-Temporal Contexts.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Dual Context-Aware Refinement Network for Person Search.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Memory-Augmented Relation Network for Few-Shot Learning.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Texture Semantically Aligned with Visibility-aware for Partial Person Re-identification.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

STRONG: Spatio-Temporal Reinforcement Learning for Cross-Modal Video Moment Localization.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Learning to Discretely Compose Reasoning Module Networks for Video Captioning.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Unsupervised Vehicle Re-identification with Progressive Adaptation.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Recurrent Relational Memory Network for Unsupervised Image Captioning.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Multi-Scale Spatial-Temporal Integration Convolutional Tube for Human Action Recognition.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

MDPL-net: Multi-layer Dictionary Learning Network with Added Skip Dense Connections.
Proceedings of the 20th IEEE International Conference on Data Mining, 2020

Feature Pyramid Transformer.
Proceedings of the Computer Vision - ECCV 2020, 2020

Large-Scale Few-Shot Learning via Multi-modal Knowledge Discovery.
Proceedings of the Computer Vision - ECCV 2020, 2020

Fully-Convolutional Intensive Feature Flow Neural Network for Text Recognition.
Proceedings of the ECAI 2020 - 24th European Conference on Artificial Intelligence, 29 August-8 September 2020, Santiago de Compostela, Spain, August 29 - September 8, 2020, 2020

Convolutional Dictionary Pair Learning Network for Image Representation Learning.
Proceedings of the ECAI 2020 - 24th European Conference on Artificial Intelligence, 29 August-8 September 2020, Santiago de Compostela, Spain, August 29 - September 8, 2020, 2020

Multilayer Collaborative Low-Rank Coding Network for Robust Deep Subspace Discovery.
Proceedings of the ECAI 2020 - 24th European Conference on Artificial Intelligence, 29 August-8 September 2020, Santiago de Compostela, Spain, August 29 - September 8, 2020, 2020

More Grounded Image Captioning by Distilling Image-Text Matching Model.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Enhanced Blind Face Restoration With Multi-Exemplar Images and Adaptive Spatial Feature Fusion.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Iterative Context-Aware Graph Inference for Visual Dialog.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Detail-recovery Image Deraining via Context Aggregation Networks.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Learning to Match on Graph for Fashion Compatibility Modeling.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Revisiting Graph Based Collaborative Filtering: A Linear Residual Graph Convolutional Network Approach.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Cross-Modality Retrieval by Joint Correlation Learning.
ACM Trans. Multim. Comput. Commun. Appl., 2019

Cross-Modality Feature Learning via Convolutional Autoencoder.
ACM Trans. Multim. Comput. Commun. Appl., 2019

Person Reidentification via Structural Deep Metric Learning.
IEEE Trans. Neural Networks Learn. Syst., 2019

3-D PersonVLAD: Learning Deep Global Representations for Video-Based Person Reidentification.
IEEE Trans. Neural Networks Learn. Syst., 2019

Geometry and Topology Preserving Hashing for SIFT Feature.
IEEE Trans. Multim., 2019

Quality-Aware Unpaired Image-to-Image Translation.
IEEE Trans. Multim., 2019

Unsupervised Nonnegative Adaptive Feature Extraction for Data Representation.
IEEE Trans. Knowl. Data Eng., 2019

Motion-Aware Compression and Transmission of Mesh Animation Sequences.
ACM Trans. Intell. Syst. Technol., 2019

A Framework of Joint Low-Rank and Sparse Regression for Image Memorability Prediction.
IEEE Trans. Circuits Syst. Video Technol., 2019

Kernel-Induced Label Propagation by Mapping for Semi-Supervised Classification.
IEEE Trans. Big Data, 2019

A Calibration Method for the Errors of Ring Laser Gyro in Rate-Biased Mode.
Sensors, 2019

Hierarchical Scene Parsing by Weakly Supervised Learning with Image Descriptions.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Robust auto-weighted projective low-rank and sparse recovery for visual representation.
Neural Networks, 2019

Personalized training through Kinect-based games for physical education.
J. Vis. Commun. Image Represent., 2019

MDS Coded Caching for Device-to-Device Content Sharing Against Eavesdropping.
KSII Trans. Internet Inf. Syst., 2019

A Stackelberg Game-Theoretic Solution to Win-Win Situation: A Presale Mechanism in Spectrum Market.
IEICE Trans. Inf. Syst., 2019

Support Vector Machine for Analyzing Contributions of Brain Regions During Task-State fMRI.
Frontiers Neuroinformatics, 2019

Fast DenseNet: Towards Efficient and Accurate Text Recognition with Fast Dense Networks.
CoRR, 2019

Diversifying Inference Path Selection: Moving-Mobile-Network for Landmark Recognition.
CoRR, 2019

Kernelized Multiview Subspace Analysis by Self-weighted Learning.
CoRR, 2019

Personality-Aware Probabilistic Map for Trajectory Prediction of Pedestrians.
CoRR, 2019

DRD-Net: Detail-recovery Image Deraining via Context Aggregation Networks.
CoRR, 2019

DeepIlluminance: Contextual Illuminance Estimation via Deep Neural Networks.
CoRR, 2019

Few-Shot Deep Adversarial Learning for Video-based Person Re-identification.
CoRR, 2019

Fog-Computing-Enabled Cognitive Network Function Virtualization for an Information-Centric Future Internet.
IEEE Commun. Mag., 2019

In Vitro Fertilization (IVF) Cumulative Pregnancy Rate Prediction From Basic Patient Characteristics.
IEEE Access, 2019

Trust Evaluation and Covert Communication-Based Secure Content Delivery for D2D Networks: A Hierarchical Matching Approach.
IEEE Access, 2019

Pattern-Aware Intelligent Anti-Jamming Communication: A Sequential Deep Reinforcement Learning Approach.
IEEE Access, 2019

Distributed Multichannel Access in High-Frequency Diversity Networks: A Multi-Agent Learning Approach With Correlated Equilibrium.
IEEE Access, 2019

Power Control in Relay-Assisted Anti-Jamming Systems: A Bayesian Three-Layer Stackelberg Game Approach.
IEEE Access, 2019

Fast Multi-Objective Optimization of Multi-Parameter Antenna Structures Based on Improved BPNN Surrogate Model.
IEEE Access, 2019

Covert Communication with Power Uncertainty for D2D Content Sharing.
Proceedings of the 11th International Conference on Wireless Communications and Signal Processing, 2019

UMC4M: A Verification Tool via Program Execution.
Proceedings of the Structured Object-Oriented Formal Language and Method, 2019

Interpretable Fashion Matching with Rich Attributes.
Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019

A Neural Influence Diffusion Model for Social Recommendation.
Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019

Neural Graph Collaborative Filtering.
Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019

Efficient Graph Based Multi-view Learning.
Proceedings of the MultiMedia Modeling - 25th International Conference, 2019

Robust Subspace Discovery by Block-diagonal Adaptive Locality-constrained Representation.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Question-Aware Tube-Switch Network for Video Question Answering.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Single-shot Semantic Image Inpainting with Densely Connected Generative Networks.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Multimodal Dialog System: Generating Responses via Adaptive Decoders.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Deep Adversarial Graph Attention Convolution Network for Text-Based Person Search.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

DADNet: Dilated-Attention-Deformable ConvNet for Crowd Counting.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Personalized Multimedia Item and Key Frame Recommendation.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Dual Visual Attention Network for Visual Dialog.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Dense Temporal Convolution Network for Sign Language Translation.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Connectionist Temporal Modeling of Video and Language: a Joint Model for Translation and Sign Labeling.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Approximate Optimal Transport for Continuous Densities with Copulas.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Parallel Temporal Encoder For Sign Language Translation.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Learning Structured Twin-Incoherent Twin-Projective Latent Dictionary Pairs for Classification.
Proceedings of the 2019 IEEE International Conference on Data Mining, 2019

A Coarse-to-Fine Multi-stream Hybrid Deraining Network for Single Image Deraining.
Proceedings of the 2019 IEEE International Conference on Data Mining, 2019

Adaptive Structure-Constrained Robust Latent Low-Rank Coding for Image Recovery.
Proceedings of the 2019 IEEE International Conference on Data Mining, 2019

Robust Unsupervised Flexible Auto-weighted Local-coordinate Concept Factorization for Image Clustering.
Proceedings of the IEEE International Conference on Acoustics, 2019

Adaptive Transfer Network for Cross-Domain Person Re-Identification.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Graphonomy: Universal Human Parsing via Graph Transfer Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

ClusterNet: Deep Hierarchical Cluster Network With Rigorously Rotation-Invariant Representation for Point Cloud Analysis.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

TransNFCM: Translation-Based Neural Fashion Compatibility Modeling.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Video Content Structure.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Online Early-Late Fusion Based on Adaptive HMM for Sign Language Recognition.
ACM Trans. Multim. Comput. Commun. Appl., 2018

Reversed Spectral Hashing.
IEEE Trans. Neural Networks Learn. Syst., 2018

Object Detection and Tracking Under Occlusion for Object-Level RGB-D Video Segmentation.
IEEE Trans. Multim., 2018

Product Adoption Rate Prediction in a Competitive Market.
IEEE Trans. Knowl. Data Eng., 2018

Low-Rank Multi-View Embedding Learning for Micro-Video Popularity Prediction.
IEEE Trans. Knowl. Data Eng., 2018

Person Re-Identification With Metric Learning Using Privileged Information.
IEEE Trans. Image Process., 2018

Self-Supervised Video Hashing With Hierarchical Binary Auto-Encoder.
IEEE Trans. Image Process., 2018

Zero-Shot Learning via Attribute Regression and Class Prototype Rectification.
IEEE Trans. Image Process., 2018

First-Person Daily Activity Recognition With Manipulated Object Proposals and Non-Linear Feature Fusion.
IEEE Trans. Circuits Syst. Video Technol., 2018

Topic driven multimodal similarity learning with multi-view voted convolutional features.
Pattern Recognit., 2018

Adaptive non-negative projective semi-supervised learning for inductive classification.
Neural Networks, 2018

Multimedia analysis with collective intelligence.
J. Vis. Commun. Image Represent., 2018

Attention driven multi-modal similarity learning.
Inf. Sci., 2018

Annotation modification for fine-grained visual recognition.
Neurocomputing, 2018

Evolutionary nonnegative matrix factorization with adaptive control of cluster quality.
Neurocomputing, 2018

pDisVPL: Probabilistic Discriminative Visual Part Learning for Image Classification.
IEEE Multim., 2018

3D PersonVLAD: Learning Deep Global Representations for Video-based Person Re-identification.
CoRR, 2018

Learning Symmetry Consistent Deep CNNs for Face Completion.
CoRR, 2018

SocialGCN: An Efficient Graph Convolutional Network based Model for Social Recommendation.
CoRR, 2018

MsCGAN: Multi-scale Conditional Generative Adversarial Networks for Person Image Generation.
CoRR, 2018

Explainable Social Contextual Image Recommendation with Hierarchical Attention.
CoRR, 2018

Matrix Completion with Nonuniform Sampling: Theories and Methods.
CoRR, 2018

Fast and Robust Subspace Clustering Using Random Projections.
CoRR, 2018

How Does Bug-Handling Effort Differ Among Different Programming Languages?
CoRR, 2018

Attentive Recurrent Social Recommendation.
Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018

Video Captioning Based on the Spatial-Temporal Saliency Tracing.
Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018

Video-Based Person Re-identification with Adaptive Multi-part Features Learning.
Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018

Semantic Image Inpainting with Progressive Generative Networks.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Connectionist Temporal Fusion for Sign Language Translation.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

USAR: An Interactive User-specific Aesthetic Ranking Framework for Images.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Scalable Active Learning by Approximated Error Reduction.
Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018

Robust Adaptive Label Propagation by Double Matrix Decomposition.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Robust Adaptive Low-Rank and Sparse Embedding for Feature Representation.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Robust Discriminative Projective Dictionary Pair Learning by Adaptive Representations.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Robust Projective Low-Rank and Sparse Representation by Robust Dictionary Learning.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Multi-Cue Correlation Filters for Robust Visual Tracking.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Single Low-Light Image Enhancement by Fusing Multiple Sources.
Proceedings of the Fourth IEEE International Conference on Multimedia Big Data, 2018

Hierarchical LSTM for Sign Language Translation.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Saliency Detection on Light Field: A Multi-Cue Approach.
ACM Trans. Multim. Comput. Commun. Appl., 2017

Enhancing Person Re-identification in a Self-Trained Subspace.
ACM Trans. Multim. Comput. Commun. Appl., 2017

VideoWhisper: Toward Discriminative Unsupervised Video Feature Learning With Attention-Based Recurrent Neural Networks.
IEEE Trans. Multim., 2017

Image Location Inference by Multisaliency Enhancement.
IEEE Trans. Multim., 2017

Stochastic Multiview Hashing for Large-Scale Near-Duplicate Video Retrieval.
IEEE Trans. Multim., 2017

Modeling the Evolution of Users' Preferences and Social Links in Social Networking Services.
IEEE Trans. Knowl. Data Eng., 2017

Learning on Big Graph: Label Inference and Regularization with Anchor Hierarchy.
IEEE Trans. Knowl. Data Eng., 2017

Learning Bregman Distance Functions for Structural Learning to Rank.
IEEE Trans. Knowl. Data Eng., 2017

User Vitality Ranking and Prediction in Social Networking Services: A Dynamic Network Perspective.
IEEE Trans. Knowl. Data Eng., 2017

Learning User Attributes via Mobile Social Multimedia Analytics.
ACM Trans. Intell. Syst. Technol., 2017

Visual Classification of Furniture Styles.
ACM Trans. Intell. Syst. Technol., 2017

Image Re-Ranking Based on Topic Diversity.
IEEE Trans. Image Process., 2017

Facial Age Estimation With Age Difference.
IEEE Trans. Image Process., 2017

Coherent Semantic-Visual Indexing for Large-Scale Image Retrieval in the Cloud.
IEEE Trans. Image Process., 2017

Unsupervised t-Distributed Video Hashing and Its Deep Hashing Extension.
IEEE Trans. Image Process., 2017

Constrained Low-Rank Learning Using Least Squares-Based Regularization.
IEEE Trans. Cybern., 2017

Tri-Clustered Tensor Completion for Social-Aware Image Tag Refinement.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

Scene Parsing by Weakly Supervised Learning with Image Descriptions.
CoRR, 2017

BIMTag: Concept-based automatic semantic annotation of online BIM product resources.
Adv. Eng. Informatics, 2017

Investigating Examination Behavior of Image Search Users.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

Deep Graph Laplacian Hashing for Image Retrieval.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

3DensiNet: A Robust Neural Network Architecture towards 3D Volumetric Object Prediction from 2D Image.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Towards Micro-video Understanding by Joint Sequential-Sparse Modeling.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Decoding dynamic auditory attention during naturalistic experience.
Proceedings of the 14th IEEE International Symposium on Biomedical Imaging, 2017

Fog computing based content-aware taxonomy for caching optimization in information-centric networks.
Proceedings of the 2017 IEEE Conference on Computer Communications Workshops, 2017

VIDEOWHISPER: Towards unsupervised learning of discriminative features of videos with RNN.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

CC-fog: Toward content-centric fog networks for E-health.
Proceedings of the 19th IEEE International Conference on e-Health Networking, 2017

Security Analysis of Simple Network Management Protocol Based IEEE P21451 Internet of Things.
Proceedings of the 15th IEEE Intl Conf on Dependable, 2017

2016
Learning from Collective Intelligence: Feature Learning Using Social Images and Tags.
ACM Trans. Multim. Comput. Commun. Appl., 2016

Spatiochromatic Context Modeling for Color Saliency Analysis.
IEEE Trans. Neural Networks Learn. Syst., 2016

Image Classification by Selective Regularized Subspace Learning.
IEEE Trans. Multim., 2016

Deep Aging Face Verification With Large Gaps.
IEEE Trans. Multim., 2016

Spin Contour.
IEEE Trans. Multim., 2016

Scalable Semi-Supervised Learning by Efficient Anchor Graph Regularization.
IEEE Trans. Knowl. Data Eng., 2016

EIC Editorial.
IEEE Trans. Knowl. Data Eng., 2016

Deep Fusion of Multiple Semantic Cues for Complex Event Recognition.
IEEE Trans. Image Process., 2016

Detecting Densely Distributed Graph Patterns for Fine-Grained Image Categorization.
IEEE Trans. Image Process., 2016

Beyond Object Proposals: Random Crop Pooling for Multi-Label Image Recognition.
IEEE Trans. Image Process., 2016

Enhancing Sketch-Based Image Retrieval by Re-Ranking and Relevance Feedback.
IEEE Trans. Image Process., 2016

Multi-View Object Retrieval via Multi-Scale Topic Models.
IEEE Trans. Image Process., 2016

An Efficient Tracking System by Orthogonalized Templates.
IEEE Trans. Ind. Electron., 2016

Large-Scale Aerial Image Categorization Using a Multitask Topological Codebook.
IEEE Trans. Cybern., 2016

Block Principal Component Analysis With Nongreedy ℓ<sub>1</sub>-Norm Maximization.
IEEE Trans. Cybern., 2016

Introduction of New Associate Editors.
IEEE Trans. Circuits Syst. Video Technol., 2016

Image detail enhancement with spatially guided filters.
Signal Process., 2016

Linear discrimination dictionary learning for shape descriptors.
Pattern Recognit. Lett., 2016

Spatially guided local Laplacian filter for nature image detail enhancement.
Multim. Tools Appl., 2016

Accurate online video tagging via probabilistic hybrid modeling.
Multim. Syst., 2016

Large-scale supervised similarity learning in networks.
Knowl. Inf. Syst., 2016

Robust geometric ℓ<sub>p</sub>-norm feature pooling for image classification and action recognition.
Image Vis. Comput., 2016

Behavior analysis in social networks: Challenges, technologies, and trends.
Neurocomputing, 2016

Local voting based multi-view embedding.
Neurocomputing, 2016

Active learning on anchorgraph with an improved transductive experimental design.
Neurocomputing, 2016

Learning content-social influential features for influence analysis.
Int. J. Multim. Inf. Retr., 2016

A Deep Structured Model with Radius-Margin Bound for 3D Human Activity Recognition.
Int. J. Comput. Vis., 2016

Scale-Aware Spatially Guided Mapping.
IEEE Multim., 2016

Event analysis in social multimedia: a survey.
Frontiers Comput. Sci., 2016

Trust Agent-Based Behavior Induction in Social Networks.
IEEE Intell. Syst., 2016

Visual Processing by a Unified Schatten-p Norm and ℓ<sub>q</sub> Norm Regularized Principal Component Pursuit.
CoRR, 2016

Regularized Taylor Echo State Networks for Predictive Control of Partially Observed Systems.
IEEE Access, 2016

Selective Level Set Segmentation Using Fuzzy Region Competition.
IEEE Access, 2016

Predicting Search User Examination with Visual Saliency.
Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, 2016

Learning A Task-Specific Deep Architecture For Clustering.
Proceedings of the 2016 SIAM International Conference on Data Mining, 2016

Play and Rewind: Optimizing Binary Representations of Videos by Self-Supervised Temporal Hashing.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Global Consistent Shape Correspondence for Efficient and Effective Active Shape Models.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Local Diffusion Map Signature for Symmetry-aware Non-rigid Shape Correspondence.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Empirical Risk Minimization for Metric Learning Using Privileged Information.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

A Relaxed Ranking-Based Factor Model for Recommender System from Implicit Feedback.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Sign language recognition based on adaptive HMMS with data augmentation.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

DisturbLabel: Regularizing CNN on the Loss Layer.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Learned Binary Spectral Shape Descriptor for 3D Shape Correspondence.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015
Retargeting Semantically-Rich Photos.
IEEE Trans. Multim., 2015

Understanding Blooming Human Groups in Social Networks.
IEEE Trans. Multim., 2015

Online Feature Selection with Group Structure Analysis.
IEEE Trans. Knowl. Data Eng., 2015

Visual Classification by ℓ<sub>1</sub>-Hypergraph Modeling.
IEEE Trans. Knowl. Data Eng., 2015

Disease Inference from Health-Related Questions via Sparse Deep Learning.
IEEE Trans. Knowl. Data Eng., 2015

Robust Multiview Feature Learning for RGB-D Image Understanding.
ACM Trans. Intell. Syst. Technol., 2015

Neighborhood Discriminant Hashing for Large-Scale Image Retrieval.
IEEE Trans. Image Process., 2015

Multimodal Deep Autoencoder for Human Pose Recovery.
IEEE Trans. Image Process., 2015

Image Annotation by Latent Community Detection and Multikernel Learning.
IEEE Trans. Image Process., 2015

An Attribute-Assisted Reranking Model for Web Image Search.
IEEE Trans. Image Process., 2015

An Automatic Three-Dimensional Scene Reconstruction System Using Crowdsourced Geo-Tagged Videos.
IEEE Trans. Ind. Electron., 2015

Image-Based Three-Dimensional Human Pose Recovery by Multiview Locality-Sensitive Sparse Retrieval.
IEEE Trans. Ind. Electron., 2015

Learning to Rank Using User Clicks and Visual Features for Image Retrieval.
IEEE Trans. Cybern., 2015

Facilitating Image Search With a Scalable and Compact Semantic Mapping.
IEEE Trans. Cybern., 2015

Event-Based Media Enrichment Using an Adaptive Probabilistic Hypergraph Model.
IEEE Trans. Cybern., 2015

Data-Driven Affective Filtering for Images and Videos.
IEEE Trans. Cybern., 2015

Crowded Scene Analysis: A Survey.
IEEE Trans. Circuits Syst. Video Technol., 2015

Learning Visual Semantic Relationships for Efficient Visual Retrieval.
IEEE Trans. Big Data, 2015

Semantic embedding for indoor scene recognition by weighted hypergraph learning.
Signal Process., 2015

Visual data denoising with a unified Schatten-p norm and ℓ<sub>q</sub> norm regularized principal component pursuit.
Pattern Recognit., 2015

Visual word expansion and BSIFT verification for large-scale image search.
Multim. Syst., 2015

Guest editorial: selected papers from ICIMCS 2013.
Multim. Syst., 2015

Towards efficient support relation extraction from RGBD images.
Inf. Sci., 2015

Indoor scene understanding via monocular RGB-D images.
Inf. Sci., 2015

Light field saliency vs. 2D saliency: A comparative study.
Neurocomputing, 2015

Image classification based on low-rank matrix recovery and Naive Bayes collaborative representation.
Neurocomputing, 2015

Robust visual tracking via multi-graph ranking.
Neurocomputing, 2015

Improved seam carving combining with 3D saliency for image retargeting.
Neurocomputing, 2015

Incorporating Non-sequential Behavior into Click Models.
Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2015

Gaze Shifting Kernel: Engineering Perceptually- Aware Features for Scene Categorization.
Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015

Biologically Inspired Media Quality Modeling.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Beyond Doctors: Future Health Prediction from Multimedia and Multimodal Observations.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Saliency Detection with a Deeper Investigation of Light Field.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Exploring feature space with semantic attributes.
Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015

Interaction part mining: A mid-level approach for fine-grained action recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

3D deep shape descriptor.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Does Vertical Bring more Satisfaction?: Predicting Search Satisfaction in a Heterogeneous Environment.
Proceedings of the 24th ACM International Conference on Information and Knowledge Management, 2015

2014
Memory recall based video search: Finding videos you have seen before based on your memory.
ACM Trans. Multim. Comput. Commun. Appl., 2014

Touch Saliency: Characteristics and Prediction.
IEEE Trans. Multim., 2014

PicWords: Render a Picture by Packing Keywords.
IEEE Trans. Multim., 2014

Product Aspect Ranking and Its Applications.
IEEE Trans. Knowl. Data Eng., 2014

Spatial Pooling of Heterogeneous Features for Image Classification.
IEEE Trans. Image Process., 2014

3-D Object Retrieval With Hausdorff Distance Learning.
IEEE Trans. Ind. Electron., 2014

Robust Face Recognition via Adaptive Sparse Representation.
IEEE Trans. Cybern., 2014

Image Annotation by Multiple-Instance Learning With Discriminative Feature Mapping and Selection.
IEEE Trans. Cybern., 2014

Audio Matters in Visual Attention.
IEEE Trans. Circuits Syst. Video Technol., 2014

Image clustering based on sparse patch alignment framework.
Pattern Recognit., 2014

Multimedia modeling.
Inf. Sci., 2014

Image quality assessment based on matching pursuit.
Inf. Sci., 2014

Semi-supervised multi-graph hashing for scalable similarity search.
Comput. Vis. Image Underst., 2014

Image Denoising with a Unified Schattern-p Norm and ℓ<sub>q</sub> Norm Regularization.
CoRR, 2014

Multifold Concept Relationships Metrics.
Proceedings of the Advances in Multimedia Information Processing - PCM 2014, 2014

Searching for Recent Celebrity Images in Microblog Platform.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

3D Human Activity Recognition with Reconfigurable Convolutional Neural Networks.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Evaluation on the Impact of Image Quality on Image Retrieval.
Proceedings of the International Conference on Internet Multimedia Computing and Service, 2014

Factorized Similarity Learning in Networks.
Proceedings of the 2014 IEEE International Conference on Data Mining, 2014

2013
Towards optimizing human labeling for interactive image tagging.
ACM Trans. Multim. Comput. Commun. Appl., 2013

Enhancing news organization for convenient retrieval and browsing.
ACM Trans. Multim. Comput. Commun. Appl., 2013

When Amazon Meets Google: Product Visualization by Exploring Multiple Web Sources.
ACM Trans. Internet Techn., 2013

Beyond Text QA: Multimedia Answer Generation by Harvesting Web Information.
IEEE Trans. Multim., 2013

Learning to Photograph: A Compositional Perspective.
IEEE Trans. Multim., 2013

Spectral Hashing With Semantically Consistent Graph for Image Indexing.
IEEE Trans. Multim., 2013

VideoPuzzle: Descriptive One-Shot Video Composition.
IEEE Trans. Multim., 2013

View-Based Discriminative Probabilistic Modeling for 3D Object Retrieval and Recognition.
IEEE Trans. Image Process., 2013

High-Order Local Spatial Context Modeling by Spatialized Random Forest.
IEEE Trans. Image Process., 2013

Visual-Textual Joint Relevance Learning for Tag-Based Social Image Search.
IEEE Trans. Image Process., 2013

Detecting Group Activities With Multi-Camera Context.
IEEE Trans. Circuits Syst. Video Technol., 2013

Indexing of large-scale multimedia signals.
Signal Process., 2013

Video recommendation over multiple information sources.
Multim. Syst., 2013

Automatic cartoon matching in computer-assisted animation production.
Neurocomputing, 2013

Multimedia recommendation: technology and techniques.
Proceedings of the 36th International ACM SIGIR conference on research and development in Information Retrieval, 2013

eHeritage of shadow puppetry: creation and manipulation.
Proceedings of the ACM Multimedia Conference, 2013

Online Group Feature Selection.
Proceedings of the IJCAI 2013, 2013

2012
In-video product annotation with web information mining.
ACM Trans. Multim. Comput. Commun. Appl., 2012

Towards the taxonomy-oriented categorization of yellow pages queries.
ACM Trans. Internet Techn., 2012

Oracle in Image Search: A Content-Based Approach to Performance Prediction.
ACM Trans. Inf. Syst., 2012

Interactive Video Indexing With Statistical Active Learning.
IEEE Trans. Multim., 2012

Movie2Comics: Towards a Lively Video Content Presentation.
IEEE Trans. Multim., 2012

Event Driven Web Video Summarization by Tag Localization and Key-Shot Identification.
IEEE Trans. Multim., 2012

Parallel Lasso for Large-Scale Video Concept Detection.
IEEE Trans. Multim., 2012

Semisupervised Multiview Distance Metric Learning for Cartoon Synthesis.
IEEE Trans. Image Process., 2012

Adaptive Hypergraph Learning and its Application in Image Classification.
IEEE Trans. Image Process., 2012

Multimodal Graph-Based Reranking for Web Image Search.
IEEE Trans. Image Process., 2012

3-D Object Retrieval and Recognition With Hypergraph Analysis.
IEEE Trans. Image Process., 2012

Intelligent photo clustering with user interaction and distance metric learning.
Pattern Recognit. Lett., 2012

Social media mining and search.
Multim. Tools Appl., 2012

Social image annotation via cross-domain subspace learning.
Multim. Tools Appl., 2012

k-Partite graph reinforcement and its application in multimedia information retrieval.
Inf. Sci., 2012

Semi-supervised distance metric learning based on local linear regression for data clustering.
Neurocomputing, 2012

Constructing visual tag dictionary by mining community-contributed media corpus.
Neurocomputing, 2012

Collaborative visual modeling for automatic image annotation via sparse model coding.
Neurocomputing, 2012

Optimizing social image search with multiple criteria: Relevance, diversity, and typicality.
Neurocomputing, 2012

Multimedia Question Answering.
IEEE Multim., 2012

Assistive tagging: A survey of multimedia tagging with human-computer joint exploration.
ACM Comput. Surv., 2012

3DMolNavi: A web-based retrieval and navigation tool for flexible molecular shape comparison.
BMC Bioinform., 2012

Modeling concept dynamics for large scale music search.
Proceedings of the 35th International ACM SIGIR conference on research and development in Information Retrieval, 2012

On Video Recommendation over Social Network.
Proceedings of the Advances in Multimedia Modeling - 18th International Conference, 2012

Touch saliency.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Image tag re-ranking by coupled probability transition.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Multimedia recommendation.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Harvesting visual concepts for image search with complex queries.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Street-to-shop: cross-scenario clothing retrieval via parts alignment and auxiliary set.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Hi, magic closet, tell me what to wear!
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Binary SIFT: towards efficient feature matching verification for image search.
Proceedings of the 4th International Conference on Internet Multimedia Computing and Service, 2012

Robust Non-negative Graph Embedding: Towards noisy data, unreliable graphs, and noisy labels.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Text Mining in Multimedia.
Proceedings of the Mining Text Data, 2012

2011
Video accessibility enhancement for hearing-impaired users.
ACM Trans. Multim. Comput. Commun. Appl., 2011

Toward the Optimization of Normalized Graph Laplacian.
IEEE Trans. Neural Networks, 2011

Utilizing Related Samples to Enhance Interactive Concept-Based Video Search.
IEEE Trans. Multim., 2011

Tag Tagging: Towards More Descriptive Keywords of Image Content.
IEEE Trans. Multim., 2011

Semi-Automatic Tagging of Photo Albums via Exemplar Selection and Tag Inference.
IEEE Trans. Multim., 2011

Less is More: Efficient 3-D Object Retrieval With Query View Selection.
IEEE Trans. Multim., 2011

Active learning in multimedia annotation and retrieval: A survey.
ACM Trans. Intell. Syst. Technol., 2011

Assemble New Object Detector With Few Examples.
IEEE Trans. Image Process., 2011

3D model retrieval using weighted bipartite graph matching.
Signal Process. Image Commun., 2011

Semi-automatic cartoon generation by motion planning.
Multim. Syst., 2011

Interactive multimedia computing.
Multim. Syst., 2011

VisionGo: Towards video retrieval with joint exploration of human and computer.
Inf. Sci., 2011

Real-Time Video Copy-Location Detection in Large-Scale Repositories.
IEEE Multim., 2011

Hierarchical organization of unstructured consumer reviews.
Proceedings of the 20th International Conference on World Wide Web, 2011

Multimedia answering: enriching text QA with media information.
Proceedings of the Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011

Product comparison using comparative relations.
Proceedings of the Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011

Optimizing multimodal reranking for web image search.
Proceedings of the Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011

Integrating rich information for video recommendation with multi-task rank aggregation.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Learning concept bundles for video search with complex queries.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Semantic point detector.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Multimedia tagging: past, present and future.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

News contextualization with geographic and visual information.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Tag-based social image search with visual-text joint hypergraph learning.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Locally regressive G-optimal design for image retrieval.
Proceedings of the 1st International Conference on Multimedia Retrieval, 2011

ShotTagger: tag location for internet videos.
Proceedings of the 1st International Conference on Multimedia Retrieval, 2011

An online video recommendation framework using rich information.
Proceedings of the ICIMCS 2011, 2011

Probabilistic indexing of media sequences.
Proceedings of the ICIMCS 2011, 2011

Predicting occupation via human clothing and contexts.
Proceedings of the IEEE International Conference on Computer Vision, 2011

Domain-Assisted Product Aspect Hierarchy Generation: Towards Hierarchical Organization of Unstructured Consumer Reviews.
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011

Aspect Ranking: Identifying Important Product Aspects from Online Consumer Reviews.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011

Tag-Based Social Image Search: Toward Relevant and Diverse Results.
Proceedings of the Social Media Modeling and Computing., 2011

2010
Joint Learning of Labels and Distance Metric.
IEEE Trans. Syst. Man Cybern. Part B, 2010

Visual query suggestion: Towards capturing user intent in internet image search.
ACM Trans. Multim. Comput. Commun. Appl., 2010

Towards a Relevant and Diverse Search of Social Images.
IEEE Trans. Multim., 2010

In-Image Accessibility Indication.
IEEE Trans. Multim., 2010

Accessible image search for colorblindness.
ACM Trans. Intell. Syst. Technol., 2010

Typicality-Based Visual Search Reranking.
IEEE Trans. Circuits Syst. Video Technol., 2010

Large-scale image and video search: Challenges, technologies, and trends.
J. Vis. Commun. Image Represent., 2010

Metric learning with feature decomposition for image categorization.
Neurocomputing, 2010

Retagging social images based on visual and semantic consistency.
Proceedings of the 19th International Conference on World Wide Web, 2010

Semi-automatic photo clustering with distance metric learning.
Proceedings of the Visual Communications and Image Processing 2010, 2010

Social Image Search with Diverse Relevance Ranking.
Proceedings of the Advances in Multimedia Modeling, 2010

Tagging tags.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Image retagging.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Movie2Comics: a feast of multimedia artwork.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Dynamic captioning: video accessibility enhancement for hearing impairment.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

iComics: automatic conversion of movie into comics.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Intelligent query: open another door to 3d object retrieval.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Efficient video duplicate detection via compact curve matching.
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

Audio-visual speaker identification with multi-view distance metric learning.
Proceedings of the International Conference on Image Processing, 2010

In-sequence video duplicate detection with fast point-to-line matching.
Proceedings of the International Conference on Image Processing, 2010

2009
Video Content Structure.
Proceedings of the Encyclopedia of Database Systems, 2009

Correlative Linear Neighborhood Propagation for Video Annotation.
IEEE Trans. Syst. Man Cybern. Part B, 2009

Beyond Distance Measurement: Constructing Neighborhood Similarity for Video Annotation.
IEEE Trans. Multim., 2009

Unified Video Annotation via Multigraph Learning.
IEEE Trans. Circuits Syst. Video Technol., 2009

Video semantic analysis based on structure-sensitive anisotropic manifold ranking.
Signal Process., 2009

Semi-supervised kernel density estimation for video annotation.
Comput. Vis. Image Underst., 2009

Tag ranking.
Proceedings of the 18th International Conference on World Wide Web, 2009

Concept-Dependent Image Annotation via Existence-Based Multiple-Instance Learning.
Proceedings of the IEEE International Conference on Systems, 2009

Concept representation based video indexing.
Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2009

Accommodating colorblind users in image search.
Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2009

Visual query suggestion.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Accessible image search.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Smart batch tagging of photo albums.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Robust Distance Metric Learning with Auxiliary Knowledge.
Proceedings of the IJCAI 2009, 2009

Active tagging for image indexing.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Efficient image and video re-coloring for colorblindness.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Tag quality improvement for social images.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Boost search relevance for tag-based social image retrieval.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

MSRA-MM 2.0: A Large-Scale Web Multimedia Dataset.
Proceedings of the ICDM Workshops 2009, 2009

Active Video Annotation.
Proceedings of the Semantic Mining Technologies for Multimedia Databases., 2009

Image/Video Semantic Analysis by Semi-Supervised Learning.
Proceedings of the Semantic Mining Technologies for Multimedia Databases., 2009

Adaptive Animation of Human Motion for E-Learning Applications.
Proceedings of the Methods and Applications for Advancing Distance Education Technologies, 2009

2008
Correlative multilabel video annotation with temporal kernels.
ACM Trans. Multim. Comput. Commun. Appl., 2008

MSRA atT TRECVID 2008: High-Level Feature Extraction and Automatic Search.
Proceedings of the TRECVID 2008 workshop participants notebook papers, 2008

Study on the combination of video concept detectors.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

2007
Interactive Video Annotation by Multi-Concept Multi-Modality Active Learning.
Int. J. Semantic Comput., 2007

Adaptive Animation of Human Motion for E-Learning Applications.
Int. J. Distance Educ. Technol., 2007

MSRA-USTC-SJTU at TRECVID 2007: High-Level Feature Extraction and Search.
Proceedings of the TRECVID 2007 workshop participants notebook papers, 2007

RMulti-Concept Multi-Modality Active Learning for Interactive Video Annotation.
Proceedings of the First IEEE International Conference on Semantic Computing (ICSC 2007), 2007

An Efficient Automatic Video Shot Size Annotation Scheme.
Proceedings of the Advances in Multimedia Modeling, 2007

Video annotation by graph-based learning with neighborhood similarity.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

Optimizing multi-graph learning: towards a unified video annotation scheme.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

Structure-sensitive manifold ranking for video concept detection.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

A Novel Multiple Instance Learning Approach for Image Retrieval Based on Adaboost Feature Selection.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Multi-Graph Semi-Supervised Learning for Video Semantic Feature Extraction.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Lazy Learning Based Efficient Video Annotation.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Salience Preserving Multi-Focus Image Fusion.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

An Interactive Video Annotation Frameowrk with Multiple Modalities.
Proceedings of the IEEE International Conference on Acoustics, 2007

2006
Microsoft Research Asia TRECVID 2006 High-Level Feature Extraction and Rushes Exploitation.
Proceedings of the 2006 TREC Video Retrieval Evaluation, 2006

Geometry shadow maps.
Proceedings of the 22nd Spring Conference on Computer Graphics, 2006

Manifold-ranking based video concept detection on large database and feature pool.
Proceedings of the 14th ACM International Conference on Multimedia, 2006

Automatic video annotation by semi-supervised learning with kernel density estimation.
Proceedings of the 14th ACM International Conference on Multimedia, 2006

Efficient semantic annotation method for indexing large personal video database.
Proceedings of the 8th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2006

Automatic video annotation based on co-adaptation and label correction.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2006), 2006

Enhanced Semi-Supervised Learning for Automatic Video Annotation.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Semi-Supervised Kernel Regression.
Proceedings of the 6th IEEE International Conference on Data Mining (ICDM 2006), 2006

An Automatic Video Semantic Annotation Scheme Based on Combination of Complementary Predictors.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
Semi-automatic video annotation based on active learning with multiple complementary predictors.
Proceedings of the 7th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2005


  Loading...