Richang Hong

Orcid: 0000-0001-5461-3986

According to our database1, Richang Hong authored at least 378 papers between 2007 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Optimizing low-rank adaptation with decomposed matrices and adaptive rank allocation.
Frontiers Comput. Sci., May, 2025

2024
Hyperbolic Graph Learning for Social Recommendation.
IEEE Trans. Knowl. Data Eng., December, 2024

Multimodal Graph Causal Embedding for Multimedia-Based Recommendation.
IEEE Trans. Knowl. Data Eng., December, 2024

Exploring and exploiting model uncertainty for robust visual question answering.
Multim. Syst., December, 2024

Special Issue on Conversational Information Seeking.
ACM Trans. Web, November, 2024

Learning Hierarchical Visual Transformation for Domain Generalizable Visual Matching and Recognition.
Int. J. Comput. Vis., November, 2024

Asymmetric Deformable Spatio-temporal Framework for Infrared Object Tracking.
ACM Trans. Multim. Comput. Commun. Appl., October, 2024

Dual Graph Neural Networks for Dynamic Users' Behavior Prediction on Social Networking Services.
IEEE Trans. Comput. Soc. Syst., October, 2024

Average User-Side Counterfactual Fairness for Collaborative Filtering.
ACM Trans. Inf. Syst., September, 2024

Math Word Problem Generation via Disentangled Memory Retrieval.
ACM Trans. Knowl. Discov. Data, June, 2024

One-Bit Supervision for Image Classification: Problem, Solution, and Beyond.
ACM Trans. Multim. Comput. Commun. Appl., April, 2024

Stage-Wise Magnitude-Based Pruning for Recurrent Neural Networks.
IEEE Trans. Neural Networks Learn. Syst., February, 2024

FTCM: Frequency-Temporal Collaborative Module for Efficient 3D Human Pose Estimation in Video.
IEEE Trans. Circuits Syst. Video Technol., February, 2024

Multimodal Hierarchical Graph Collaborative Filtering for Multimedia-Based Recommendation.
IEEE Trans. Comput. Soc. Syst., February, 2024

A Prompt-Based Topic-Modeling Method for Depression Detection on Low-Resource Data.
IEEE Trans. Comput. Soc. Syst., February, 2024

FRC-Net: A Simple Yet Effective Architecture for Low-Light Image Enhancement.
IEEE Trans. Consumer Electron., February, 2024

Group Multi-View Transformer for 3D Shape Analysis With Spatial Encoding.
IEEE Trans. Multim., 2024

DSIS-DPR:Structured Instance Segmentation and Diffusion Prior Refinement for Dental Anatomy Learning.
IEEE Trans. Multim., 2024

Comment-Context Dual Collaborative Masked Transformer Network for Fake News Detection.
IEEE Trans. Multim., 2024

Positive Unlabeled Fake News Detection via Multi-Modal Masked Transformer Network.
IEEE Trans. Multim., 2024

Iterative Adversarial Attack on Image-Guided Story Ending Generation.
IEEE Trans. Multim., 2024

Two-Step Discrete Hashing for Cross-Modal Retrieval.
IEEE Trans. Multim., 2024

Partial-Tuning Based Mixed-Modal Prototypes for Few-Shot Classification.
IEEE Trans. Multim., 2024

Embedded Heterogeneous Attention Transformer for Cross-Lingual Image Captioning.
IEEE Trans. Multim., 2024

SYRER: Synergistic Relational Reasoning for RGB-D Cross-Modal Re-Identification.
IEEE Trans. Multim., 2024

Semi-Supervised Domain Adaptation for Major Depressive Disorder Detection.
IEEE Trans. Multim., 2024

Active Factor Graph Network for Group Activity Recognition.
IEEE Trans. Image Process., 2024

Decomposing Relationship from 1-to-N into N 1-to-1 for Text-Video Retrieval.
CoRR, 2024

Towards Unified Multimodal Editing with Enhanced Knowledge Collaboration.
CoRR, 2024

InterMind: A Doctor-Patient-Family Interactive Depression Assessment System Empowered by Large Language Models.
CoRR, 2024

UniLearn: Enhancing Dynamic Facial Expression Recognition through Unified Pre-Training and Fine-Tuning on Images and Videos.
CoRR, 2024

Seeing is Believing? Enhancing Vision-Language Navigation using Visual Perturbations.
CoRR, 2024

SDI-Net: Toward Sufficient Dual-View Interaction for Low-light Stereo Image Enhancement.
CoRR, 2024

Towards Multimodal Emotional Support Conversation Systems.
CoRR, 2024

Controllable Relation Disentanglement for Few-Shot Class-Incremental Learning.
CoRR, 2024

Audio-Infused Automatic Image Colorization by Exploiting Audio Scene Semantics.
CoRR, 2024

Label-aware debiased causal reasoning for Natural Language Inference.
AI Open, 2024

Multimodality Invariant Learning for Multimedia-Based New Item Recommendation.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

Cascade Large Language Model via In-Context Learning for Depression Detection on Chinese Social Media.
Proceedings of the Pattern Recognition and Computer Vision - 7th Chinese Conference, 2024

AtomTool: Empowering Large Language Models with Tool Utilization Skills.
Proceedings of the Pattern Recognition and Computer Vision - 7th Chinese Conference, 2024

Fine-grained Feature Assisted Cross-modal Image-text Retrieval.
Proceedings of the Pattern Recognition and Computer Vision - 7th Chinese Conference, 2024

Advancing Incremental Few-Shot Semantic Segmentation via Semantic-Guided Relation Alignment and Adaptation.
Proceedings of the MultiMedia Modeling - 30th International Conference, 2024

Exploring Robust Face-Voice Matching in Multilingual Environments.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

DAT: Dialogue-Aware Transformer with Modality-Group Fusion for Human Engagement Estimation.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Pseudo Content Hallucination for Unpaired Image Captioning.
Proceedings of the 2024 International Conference on Multimedia Retrieval, 2024

Path-Specific Causal Reasoning for Fairness-aware Cognitive Diagnosis.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

Graph Bottlenecked Social Recommendation.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

Double Correction Framework for Denoising Recommendation.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

Gradient-Aware Logit Adjustment Loss for Long-Tailed Classifier.
Proceedings of the IEEE International Conference on Acoustics, 2024

Few-Shot Learner Parameterization by Diffusion Time-Steps.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Doubly Abductive Counterfactual Inference for Text-Based Image Editing.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
MS²-GNN: Exploring GNN-Based Multimodal Fusion Network for Depression Detection.
IEEE Trans. Cybern., December, 2023

Toward Optimal Real-Time Volumetric Video Streaming: A Rolling Optimization and Deep Reinforcement Learning Based Approach.
IEEE Trans. Circuits Syst. Video Technol., December, 2023

TIRA: Truth Inference via Reliability Aggregation on Object-Source Graph.
IEEE Trans. Knowl. Data Eng., November, 2023

Contrastive Video Question Answering via Video Graph Transformer.
IEEE Trans. Pattern Anal. Mach. Intell., November, 2023

Hierarchical Multifeature Fusion via Audio-Response-Level Modeling for Depression Detection.
IEEE Trans. Comput. Soc. Syst., October, 2023

Automatic Depression Detection via Learning and Fusing Features From Visual Cues.
IEEE Trans. Comput. Soc. Syst., October, 2023

Few-Shot Partial Multi-View Learning.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2023

Efficient and self-adaptive rationale knowledge base for visual commonsense reasoning.
Multim. Syst., October, 2023

LipFormer: Learning to Lipread Unseen Speakers Based on Visual-Landmark Transformers.
IEEE Trans. Circuits Syst. Video Technol., September, 2023

Crowdsourcing Truth Inference via Reliability-Driven Multi-View Graph Embedding.
ACM Trans. Knowl. Discov. Data, June, 2023

MEGCF: Multimodal Entity Graph Collaborative Filtering for Personalized Recommendation.
ACM Trans. Inf. Syst., April, 2023

Question-Guided Erasing-Based Spatiotemporal Attention Learning for Video Question Answering.
IEEE Trans. Neural Networks Learn. Syst., March, 2023

LCSNet: End-to-end Lipreading with Channel-aware Feature Selection.
ACM Trans. Multim. Comput. Commun. Appl., February, 2023

Joint Multi-Grained Popularity-Aware Graph Convolution Collaborative Filtering for Recommendation.
IEEE Trans. Comput. Soc. Syst., February, 2023

Generative Metric Learning for Adversarially Robust Open-world Person Re-Identification.
ACM Trans. Multim. Comput. Commun. Appl., January, 2023

Global Temporal Difference Network for Action Recognition.
IEEE Trans. Multim., 2023

A Text-Guided Generation and Refinement Model for Image Captioning.
IEEE Trans. Multim., 2023

MLP-JCG: Multi-Layer Perceptron With Joint-Coordinate Gating for Efficient 3D Human Pose Estimation.
IEEE Trans. Multim., 2023

Causal Interventional Training for Image Recognition.
IEEE Trans. Multim., 2023

Multimodal Graph Contrastive Learning for Multimedia-Based Recommendation.
IEEE Trans. Multim., 2023

Optimal Volumetric Video Streaming With Hybrid Saliency Based Tiling.
IEEE Trans. Multim., 2023

From Static to Dynamic: Adapting Landmark-Aware Image Models for Facial Expression Recognition in Videos.
CoRR, 2023

Deep Ensembles Meets Quantile Regression: Uncertainty-aware Imputation for Time Series.
CoRR, 2023

Viewport Prediction for Volumetric Video Streaming by Exploring Video Saliency and Trajectory Information.
CoRR, 2023

Clarity ChatGPT: An Interactive and Adaptive Processing System for Image Restoration and Enhancement.
CoRR, 2023

Grid Jigsaw Representation with CLIP: A New Perspective on Image Clustering.
CoRR, 2023

Exploring Transferability of Multimodal Adversarial Samples for Vision-Language Pre-training Models with Contrastive Learning.
CoRR, 2023

Embedded Heterogeneous Attention Transformer for Cross-lingual Image Captioning.
CoRR, 2023

Advancing Incremental Few-shot Semantic Segmentation via Semantic-guided Relation Alignment and Adaptation.
CoRR, 2023

Read, Diagnose and Chat: Towards Explainable and Interactive LLMs-Augmented Depression Detection in Social Media.
CoRR, 2023

Multimodal Feature Extraction and Fusion for Emotional Reaction Intensity Estimation and Expression Classification in Videos with Transformers.
CoRR, 2023

Improving Recommendation Fairness via Data Augmentation.
Proceedings of the ACM Web Conference 2023, 2023

Generative-Contrastive Graph Learning for Recommendation.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Topic-enhanced Graph Neural Networks for Extraction-based Explainable Recommendation.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Multimodal Counterfactual Learning Network for Multimedia-based Recommendation.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Low-Light Image Enhancement Based on Mutual Guidance Between Enhancing Strength and Image Appearance.
Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

Disentangling Cognitive Diagnosis with Limited Exercise Labels.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Decoupled Cross-Scale Cross-View Interaction for Stereo Image Enhancement in the Dark.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

CropCap: Embedding Visual Cross-Partition Dependency for Image Captioning.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Category-Level Articulated Object 9D Pose Estimation via Reinforcement Learning.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Unlearnable Examples Give a False Sense of Security: Piercing through Unexploitable Data with Learnable Examples.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

GoRec: A Generative Cold-start Recommendation Framework.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Grid Feature Jigsaw for Self-supervised Image Clustering.
Proceedings of the International Joint Conference on Neural Networks, 2023

Dual Video Summarization: From Frames to Captions.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

CITE: Compact Interactive TransformEr for Multilingual Image Captioning.
Proceedings of the 6th International Conference on Image and Graphics Processing, 2023

How to Use Language Expert to Assist Inference for Visual Commonsense Reasoning.
Proceedings of the IEEE International Conference on Data Mining, 2023

Adaptive Student Inference Network for Efficient Single Image Super-Resolution.
Proceedings of the IEEE International Conference on Data Mining, 2023

3D Human Pose Estimation with Spatio-Temporal Criss-Cross Attention.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Adaptive Data-Free Quantization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Multimodal Feature Extraction and Fusion for Emotional Reaction Intensity Estimation and Expression Classification in Videos with Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Fair Representation Learning for Recommendation: A Mutual Information Perspective.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Rethinking Data-Free Quantization as a Zero-Sum Game.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Inner Knowledge-based Img2Doc Scheme for Visual Question Answering.
ACM Trans. Multim. Comput. Commun. Appl., 2022

Revisiting Local Descriptor for Improved Few-Shot Classification.
ACM Trans. Multim. Comput. Commun. Appl., 2022

Graph-Based Multimodal Sequential Embedding for Sign Language Translation.
IEEE Trans. Multim., 2022

Deep Shape-Aware Person Re-Identification for Overcoming Moderate Clothing Changes.
IEEE Trans. Multim., 2022

Unpaired Image Captioning With semantic-Constrained Self-Learning.
IEEE Trans. Multim., 2022

DiffNet++: A Neural Influence and Interest Diffusion Network for Social Recommendation.
IEEE Trans. Knowl. Data Eng., 2022

Hierarchical Prototype Refinement With Progressive Inter-Categorical Discrimination Maximization for Few-Shot Learning.
IEEE Trans. Image Process., 2022

Exploring Self-Attention Graph Pooling With EEG-Based Topological Structure and Soft Label for Depression Detection.
IEEE Trans. Affect. Comput., 2022

Learning to Compose and Reason with Language Tree Structures for Visual Grounding.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Special issue on cross-modal retrieval and analysis.
Int. J. Multim. Inf. Retr., 2022

RGCF: Refined graph convolution collaborative filtering with concise and expressive embedding.
Intell. Data Anal., 2022

Stereo Image Rain Removal via Dual-View Mutual Attention.
CoRR, 2022

Seeing Through The Noisy Dark: Toward Real-world Low-Light Image Enhancement and Denoising.
CoRR, 2022

A Topic-Attentive Transformer-based Model For Multimodal Depression Detection.
CoRR, 2022

ClusterQ: Semantic Feature Distribution Alignment for Data-Free Quantization.
CoRR, 2022

A Review-aware Graph Contrastive Learning Framework for Recommendation.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Math Word Problem Generation with Memory Retrieval.
Proceedings of the Pattern Recognition and Computer Vision - 5th Chinese Conference, 2022

One-Stage Image Inpainting with Hybrid Attention.
Proceedings of the MultiMedia Modeling - 28th International Conference, 2022

CRNet: Unsupervised Color Retention Network for Blind Motion Deblurring.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

FCL-GAN: A Lightweight and Real-Time Baseline for Unsupervised Blind Image Deblurring.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Early-Learning regularized Contrastive Learning for Cross-Modal Retrieval with Noisy Labels.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

SGINet: Toward Sufficient Interaction Between Single Image Deraining and Semantic Segmentation.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Robust Attention Deraining Network for Synchronous Rain Streaks and Raindrops Removal.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Differentiable Cross-modal Hashing via Multimodal Transformers.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Gloss Semantic-Enhanced Network with Online Back-Translation for Sign Language Production.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Robust Low-Rank Convolution Network for Image Denoising.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

OCR-oriented Master Object for Text Image Captioning.
Proceedings of the ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27, 2022

Multi-scale Spatial Representation Learning via Recursive Hermite Polynomial Networks.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Towards Feature Distribution Alignment and Diversity Enhancement for Data-Free Quantization.
Proceedings of the IEEE International Conference on Data Mining, 2022

Self-Supervised Cross Domain Social Recommendation.
Proceedings of the ICCAI '22: 8th International Conference on Computing and Artificial Intelligence, Tianjin, China, March 18, 2022

Switchable Online Knowledge Distillation.
Proceedings of the Computer Vision - ECCV 2022, 2022

Vibration-Based Uncertainty Estimation for Learning from Limited Supervision.
Proceedings of the Computer Vision - ECCV 2022, 2022

Deep Color Consistent Network for Low-Light Image Enhancement.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Collaborative Neural Social Recommendation.
IEEE Trans. Syst. Man Cybern. Syst., 2021

Deep Attributed Network Embedding by Preserving Structure and Attribute Information.
IEEE Trans. Syst. Man Cybern. Syst., 2021

RMoR-Aion: Robust Multioutput Regression by Simultaneously Alleviating Input and Output Noises.
IEEE Trans. Neural Networks Learn. Syst., 2021

Temporal Textual Localization in Video via Adversarial Bi-Directional Interaction Networks.
IEEE Trans. Multim., 2021

Visual Question Answering With Dense Inter- and Intra-Modality Interactions.
IEEE Trans. Multim., 2021

DCR: A Unified Framework for Holistic/Partial Person ReID.
IEEE Trans. Multim., 2021

Exploiting Subspace Relation in Semantic Labels for Cross-Modal Hashing.
IEEE Trans. Knowl. Data Eng., 2021

Diversifying Inference Path Selection: Moving-Mobile-Network for Landmark Recognition.
IEEE Trans. Image Process., 2021

Graph-Based Multi-Interaction Network for Video Question Answering.
IEEE Trans. Image Process., 2021

Fine-Grained Fashion Similarity Prediction by Attribute-Specific Embedding Learning.
IEEE Trans. Image Process., 2021

A lightweight multi-scale aggregated model for detecting aerial images captured by UAVs.
J. Vis. Commun. Image Represent., 2021

Random walk based distributed representation learning and prediction on Social Networking Services.
Inf. Sci., 2021

Multimodal Semantics-Based Supervised Latent Dirichlet Allocation for Event Classification.
IEEE Multim., 2021

Few-shot Learning with Global Relatedness Decoupled-Distillation.
CoRR, 2021

MCGNet: Partial Multi-view Few-shot Learning via Meta-alignment and Context Gated-aggregation.
CoRR, 2021

Revisiting Deep Local Descriptor for Improved Few-Shot Classification.
CoRR, 2021

Learning Fair Representations for Bipartite Graph based Recommendation.
CoRR, 2021

Deep Adversarial Inconsistent Cognitive Sampling for Multi-view Progressive Subspace Clustering.
CoRR, 2021

Learning Fair Representations for Recommendation: A Graph-based Perspective.
Proceedings of the WWW '21: The Web Conference 2021, 2021

Enhanced Graph Learning for Collaborative Filtering via Mutual Information Maximization.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

Privileged Graph Distillation for Cold Start Recommendation.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

Set2setRank: Collaborative Set to Set Ranking for Implicit Feedback based Recommendation.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

NASTER: Non-local Attentional Scene Text Recognizer.
Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021

2020
Video Retrieval with Similarity-Preserving Deep Temporal Hashing.
ACM Trans. Multim. Comput. Commun. Appl., 2020

Fast Matrix Factorization With Nonuniform Weights on Missing Data.
IEEE Trans. Neural Networks Learn. Syst., 2020

Knowledge-Based Topic Model for Multi-Modal Social Event Analysis.
IEEE Trans. Multim., 2020

A Hierarchical Attention Model for Social Contextual Image Recommendation.
IEEE Trans. Knowl. Data Eng., 2020

Cross-Domain Sentiment Encoding through Stochastic Word Embedding.
IEEE Trans. Knowl. Data Eng., 2020

Deep Neighborhood Component Analysis for Visual Similarity Modeling.
ACM Trans. Intell. Syst. Technol., 2020

A Joint Neural Model for User Behavior Prediction on Social Networking Platforms.
ACM Trans. Intell. Syst. Technol., 2020

Group-Group Loss-Based Global-Regional Feature Learning for Vehicle Re-Identification.
IEEE Trans. Image Process., 2020

Joint Subspace Recovery and Enhanced Locality Driven Robust Flexible Discriminative Dictionary Learning.
IEEE Trans. Circuits Syst. Video Technol., 2020

Cross-Entropy Adversarial View Adaptation for Person Re-Identification.
IEEE Trans. Circuits Syst. Video Technol., 2020

Movie Question Answering via Textual Memory and Plot Graph.
IEEE Trans. Circuits Syst. Video Technol., 2020

Gated CNN: Integrating multi-scale feature layers for object detection.
Pattern Recognit., 2020

Advance on large scale near-duplicate video retrieval.
Frontiers Comput. Sci., 2020

Bi-direction Context Propagation Network for Real-time Semantic Segmentation.
CoRR, 2020

The Balanced Loss Curriculum Learning.
IEEE Access, 2020

Dual Learning for Explainable Recommendation: Towards Unifying User Preference Prediction and Review Generation.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

Estimation-Action-Reflection: Towards Deep Interaction Between Conversational and Recommender Systems.
Proceedings of the WSDM '20: The Thirteenth ACM International Conference on Web Search and Data Mining, 2020

Joint Item Recommendation and Attribute Inference: An Adaptive Graph Convolutional Network Approach.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

Learning to Transfer Graph Embeddings for Inductive Graph based Recommendation.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

One-bit Supervision for Image Classification.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Joint Sketch-Attribute Learning for Fine-Grained Face Synthesis.
Proceedings of the MultiMedia Modeling - 26th International Conference, 2020

WFN-PSC: weighted-fusion network with poly-scale convolution for image dehazing.
Proceedings of the MMAsia 2020: ACM Multimedia Asia, 2020

Dual Context-Aware Refinement Network for Person Search.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Dual Hierarchical Temporal Convolutional Network with QA-Aware Dynamic Normalization for Video Story Question Answering.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Memory-Augmented Relation Network for Few-Shot Learning.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Adversarial Video Moment Retrieval by Jointly Modeling Ranking and Localization.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

A Text-Guided Graph Structure for Image Captioning.
Proceedings of the 2020 IEEE International Conference on Multimedia & Expo Workshops, 2020

Real-World Person Re-Identification via Degradation Invariance Learning.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Creating Something From Nothing: Unsupervised Knowledge Distillation for Cross-Modal Hashing.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Revisiting Graph Based Collaborative Filtering: A Linear Residual Graph Convolutional Network Approach.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Correction to: On fusing the latent deep CNN feature for image classification.
World Wide Web, 2019

On fusing the latent deep CNN feature for image classification.
World Wide Web, 2019

Eigenvector-Based Distance Metric Learning for Image Classification and Retrieval.
ACM Trans. Multim. Comput. Commun. Appl., 2019

Cross-Modality Feature Learning via Convolutional Autoencoder.
ACM Trans. Multim. Comput. Commun. Appl., 2019

Introduction to the Special Issue on the Cross-Media Analysis for Visual Question Answering.
ACM Trans. Multim. Comput. Commun. Appl., 2019

Table of Contents: Online Supplement Volume 15, Number 2s.
ACM Trans. Multim. Comput. Commun. Appl., 2019

Deep Item-based Collaborative Filtering for Top-N Recommendation.
ACM Trans. Inf. Syst., 2019

Deep Representation Learning With Part Loss for Person Re-Identification.
IEEE Trans. Image Process., 2019

Collective Reconstructive Embeddings for Cross-Modal Hashing.
IEEE Trans. Image Process., 2019

Anomaly-Tolerant Network Traffic Estimation via Noise-Immune Temporal Matrix Completion Model.
IEEE J. Sel. Areas Commun., 2019

Diversifying Inference Path Selection: Moving-Mobile-Network for Landmark Recognition.
CoRR, 2019

A Neural Influence Diffusion Model for Social Recommendation.
Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019

Efficient Graph Based Multi-view Learning.
Proceedings of the MultiMedia Modeling - 25th International Conference, 2019

Robust Subspace Discovery by Block-diagonal Adaptive Locality-constrained Representation.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

MMGCN: Multi-modal Graph Convolution Network for Personalized Recommendation of Micro-video.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Single-shot Semantic Image Inpainting with Densely Connected Generative Networks.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Multimodal Dialog System: Generating Responses via Adaptive Decoders.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Deep Adversarial Graph Attention Convolution Network for Text-Based Person Search.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Erasing-based Attention Learning for Visual Question Answering.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Deep Spatial Pyramid Features Collaborative Reconstruction for Partial Person ReID.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Personalized Multimedia Item and Key Frame Recommendation.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Densely Connected Attention Flow for Visual Question Answering.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

A Coarse-to-Fine Multi-stream Hybrid Deraining Network for Single Image Deraining.
Proceedings of the 2019 IEEE International Conference on Data Mining, 2019

Adaptive Transfer Network for Cross-Domain Person Re-Identification.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Product Adoption Rate Prediction in a Competitive Market.
IEEE Trans. Knowl. Data Eng., 2018

Efficient Correlation Tracking via Center-Biased Spatial Regularization.
IEEE Trans. Image Process., 2018

Sequential Video VLAD: Training the Aggregation Locally and Temporally.
IEEE Trans. Image Process., 2018

Self-Supervised Video Hashing With Hierarchical Binary Auto-Encoder.
IEEE Trans. Image Process., 2018

Pooling the Convolutional Layers in Deep ConvNets for Video Action Recognition.
IEEE Trans. Circuits Syst. Video Technol., 2018

Guest Editorial: Query understanding.
Neurocomputing, 2018

pDisVPL: Probabilistic Discriminative Visual Part Learning for Image Classification.
IEEE Multim., 2018

Fast Matrix Factorization with Non-Uniform Weights on Missing Data.
CoRR, 2018

SocialGCN: An Efficient Graph Convolutional Network based Model for Social Recommendation.
CoRR, 2018

Explainable Social Contextual Image Recommendation with Hierarchical Attention.
CoRR, 2018

IGCV2: Interleaved Structured Sparse Convolutional Neural Networks.
CoRR, 2018

Attentive Group Recommendation.
Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018

Enhancing Low-Light Images with JPEG Artifact Based on Image Decomposition.
Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018

Knowledge-aware Multimodal Fashion Chatbot.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Knowledge-aware Multimodal Dialogue Systems.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Quality Matters: Assessing cQA Pair Quality via Transductive Multi-View Learning.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Online Filter Weakening and Pruning for Efficient Convnets.
Proceedings of the 2018 IEEE International Conference on Multimedia and Expo, 2018

Online Filter Clustering and Pruning for Efficient Convnets.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Facial Expression Recognition with Data Augmentation and Compact Feature Learning.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Interleaved Structured Sparse Convolutional Neural Networks.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Multi-Cue Correlation Filters for Robust Visual Tracking.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Enhanced Text-Guided Attention Model for Image Captioning.
Proceedings of the Fourth IEEE International Conference on Multimedia Big Data, 2018

Single Low-Light Image Enhancement by Fusing Multiple Sources.
Proceedings of the Fourth IEEE International Conference on Multimedia Big Data, 2018

Movie Question Answering: Remembering the Textual Cues for Layered Visual Contents.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Enhancing Person Re-identification in a Self-Trained Subspace.
ACM Trans. Multim. Comput. Commun. Appl., 2017

Inverse Sparse Group Lasso Model for Robust Object Tracking.
IEEE Trans. Multim., 2017

VideoWhisper: Toward Discriminative Unsupervised Video Feature Learning With Attention-Based Recurrent Neural Networks.
IEEE Trans. Multim., 2017

Image Location Inference by Multisaliency Enhancement.
IEEE Trans. Multim., 2017

Stochastic Multiview Hashing for Large-Scale Near-Duplicate Video Retrieval.
IEEE Trans. Multim., 2017

Modeling the Evolution of Users' Preferences and Social Links in Social Networking Services.
IEEE Trans. Knowl. Data Eng., 2017

User Vitality Ranking and Prediction in Social Networking Services: A Dynamic Network Perspective.
IEEE Trans. Knowl. Data Eng., 2017

Learning User Attributes via Mobile Social Multimedia Analytics.
ACM Trans. Intell. Syst. Technol., 2017

Visual Classification of Furniture Styles.
ACM Trans. Intell. Syst. Technol., 2017

Augmented Collaborative Filtering for Sparseness Reduction in Personalized POI Recommendation.
ACM Trans. Intell. Syst. Technol., 2017

Fast and Orthogonal Locality Preserving Projections for Dimensionality Reduction.
IEEE Trans. Image Process., 2017

Facial Age Estimation With Age Difference.
IEEE Trans. Image Process., 2017

Coherent Semantic-Visual Indexing for Large-Scale Image Retrieval in the Cloud.
IEEE Trans. Image Process., 2017

Unsupervised t-Distributed Video Hashing and Its Deep Hashing Extension.
IEEE Trans. Image Process., 2017

Perceptually Guided Photo Retargeting.
IEEE Trans. Cybern., 2017

Dual graph-regularized Constrained Nonnegative Matrix Factorization for Image Clustering.
KSII Trans. Internet Inf. Syst., 2017

Editorial: Good practices in multimedia modeling.
Neurocomputing, 2017

Computational Social Indicators: A Case Study of Chinese University Ranking.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

Generating Chinese Poems from Images Based on Neural Network.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

Deep Graph Laplacian Hashing for Image Retrieval.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

Enhancing Micro-video Understanding by Harnessing External Sounds.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

VIDEOWHISPER: Towards unsupervised learning of discriminative features of videos with RNN.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

2016
Image Categorization by Learning a Propagated Graphlet Path.
IEEE Trans. Neural Networks Learn. Syst., 2016

Flickr Circles: Aesthetic Tendency Discovery by Multi-View Regularized Topic Modeling.
IEEE Trans. Multim., 2016

Detecting Densely Distributed Graph Patterns for Fine-Grained Image Categorization.
IEEE Trans. Image Process., 2016

Beyond Object Proposals: Random Crop Pooling for Multi-Label Image Recognition.
IEEE Trans. Image Process., 2016

Enhancing Sketch-Based Image Retrieval by Re-Ranking and Relevance Feedback.
IEEE Trans. Image Process., 2016

Unified Photo Enhancement by Discovering Aesthetic Communities From Flickr.
IEEE Trans. Image Process., 2016

Multi-View Object Retrieval via Multi-Scale Topic Models.
IEEE Trans. Image Process., 2016

An Efficient Tracking System by Orthogonalized Templates.
IEEE Trans. Ind. Electron., 2016

Large-Scale Aerial Image Categorization Using a Multitask Topological Codebook.
IEEE Trans. Cybern., 2016

Weakly Supervised Multilabel Clustering and its Applications in Computer Vision.
IEEE Trans. Cybern., 2016

Perceptual Attributes Optimization for Multivideo Summarization.
IEEE Trans. Cybern., 2016

A Biologically Inspired Automatic System for Media Quality Assessment.
IEEE Trans Autom. Sci. Eng., 2016

Image detail enhancement with spatially guided filters.
Signal Process., 2016

Spatially guided local Laplacian filter for nature image detail enhancement.
Multim. Tools Appl., 2016

基于低秩稀疏分解与协作表示的图像分类算法 (Image Classification Algorithm Based on Low Rank and Sparse Decomposition and Collaborative Representation).
计算机科学, 2016

Multicolumn Bidirectional Long Short-Term Memory for Mobile Devices-Based Human Activity Recognition.
IEEE Internet Things J., 2016

Visual summarization of image collections by fast RANSAC.
Neurocomputing, 2016

Recent developments on deep big vision.
Neurocomputing, 2016

Dimensionality reduction on Anchorgraph with an efficient Locality Preserving Projection.
Neurocomputing, 2016

Learning content-social influential features for influence analysis.
Int. J. Multim. Inf. Retr., 2016

Collaborative Sparse Coding for Multiview Action Recognition.
IEEE Multim., 2016

Scale-Aware Spatially Guided Mapping.
IEEE Multim., 2016

A Spatial-Temporal Probabilistic Matrix Factorization Model for Point-of-Interest Recommendation.
Proceedings of the 2016 SIAM International Conference on Data Mining, 2016

Retrieving Images by Multiple Samples via Fusing Deep Features.
Proceedings of the Advances in Multimedia Information Processing - PCM 2016, 2016

Play and Rewind: Optimizing Binary Representations of Videos by Self-Supervised Temporal Hashing.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Mental Visual Indexing: Towards Fast Video Browsing.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

An Intention-Aware Interactive System for Mobile Video Browsing.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Point-of-Interest Recommendations: Learning Potential Check-ins from Friends.
Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016

A Relaxed Ranking-Based Factor Model for Recommender System from Implicit Feedback.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Image Classification via fusing the latent deep CNN feature.
Proceedings of the International Conference on Internet Multimedia Computing and Service, 2016

Cascaded Interactional Targeting Network for Egocentric Video Analysis.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015
Probabilistic Skimlets Fusion for Summarizing Multiple Consumer Landmark Videos.
IEEE Trans. Multim., 2015

Uniting Keypoints: Local Visual Information Fusion for Large-Scale Image Search.
IEEE Trans. Multim., 2015

Understanding Blooming Human Groups in Social Networks.
IEEE Trans. Multim., 2015

Visual Understanding with RGB-D Sensors: An Introduction to the Special Issue.
ACM Trans. Intell. Syst. Technol., 2015

BSIFT: Toward Data-Independent Codebook for Large Scale Image Search.
IEEE Trans. Image Process., 2015

Full-Space Local Topology Extraction for Cross-Modal Retrieval.
IEEE Trans. Image Process., 2015

Compact and Discriminative Descriptor Inference Using Multi-Cues.
IEEE Trans. Image Process., 2015

Image Annotation by Latent Community Detection and Multikernel Learning.
IEEE Trans. Image Process., 2015

An Automatic Three-Dimensional Scene Reconstruction System Using Crowdsourced Geo-Tagged Videos.
IEEE Trans. Ind. Electron., 2015

Crowded Scene Analysis: A Survey.
IEEE Trans. Circuits Syst. Video Technol., 2015

Learning Visual Semantic Relationships for Efficient Visual Retrieval.
IEEE Trans. Big Data, 2015

Camouflage texture evaluation using a saliency map.
Multim. Syst., 2015

Towards efficient support relation extraction from RGBD images.
Inf. Sci., 2015

Pooling the Convolutional Layers in Deep ConvNets for Action Recognition.
CoRR, 2015

Gaze Shifting Kernel: Engineering Perceptually- Aware Features for Scene Categorization.
Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015

Biologically Inspired Media Quality Modeling.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Cross-Domain Collaborative Learning in Social Multimedia.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Beyond Doctors: Future Health Prediction from Multimedia and Multimodal Observations.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Image Classification and Retrieval are ONE.
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

Regularizing Flat Latent Variables with Hierarchical Structures.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Exploring feature space with semantic attributes.
Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015

Lace fabric image retrieval based on multi-scale and rotation invariant LBP.
Proceedings of the 7th International Conference on Internet Multimedia Computing and Service, 2015

Generative Models for Mining Latent Aspects and Their Ratings from Short Reviews.
Proceedings of the 2015 IEEE International Conference on Data Mining, 2015

Point-of-Interest Recommender Systems: A Separate-Space Perspective.
Proceedings of the 2015 IEEE International Conference on Data Mining, 2015

Interaction part mining: A mid-level approach for fine-grained action recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014
PicWords: Render a Picture by Packing Keywords.
IEEE Trans. Multim., 2014

Spectral-Spatial Constraint Hyperspectral Image Classification.
IEEE Trans. Geosci. Remote. Sens., 2014

Image Annotation by Multiple-Instance Learning With Discriminative Feature Mapping and Selection.
IEEE Trans. Cybern., 2014

Image clustering based on sparse patch alignment framework.
Pattern Recognit., 2014

Special issue on contextual vision computing.
Mach. Vis. Appl., 2014

Image quality assessment based on matching pursuit.
Inf. Sci., 2014

Directional projection based image fusion quality metric.
Inf. Sci., 2014

On improving behavior subtraction.
Proceedings of the 2014 IEEE International Conference on Systems, Man, and Cybernetics, 2014

Multifold Concept Relationships Metrics.
Proceedings of the Advances in Multimedia Information Processing - PCM 2014, 2014

Searching for Recent Celebrity Images in Microblog Platform.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Evaluation on the Impact of Image Quality on Image Retrieval.
Proceedings of the International Conference on Internet Multimedia Computing and Service, 2014

2013
General Subspace Learning With Corrupted Training Data Via Graph Embedding.
IEEE Trans. Image Process., 2013

Accurate Estimation of Human Body Orientation From RGB-D Sensors.
IEEE Trans. Cybern., 2013

Marginalized multi-layer multi-instance kernel for video concept detection.
Signal Process., 2013

Multimedia encyclopedia construction by mining web knowledge.
Signal Process., 2013

Video recommendation over multiple information sources.
Multim. Syst., 2013

Texture-adaptive hole-filling algorithm in raster-order for three-dimensional video applications.
Neurocomputing, 2013

Advertising object in web videos.
Neurocomputing, 2013

eHeritage of shadow puppetry: creation and manipulation.
Proceedings of the ACM Multimedia Conference, 2013

Image matching by fast random sample consensus.
Proceedings of the International Conference on Internet Multimedia Computing and Service, 2013

Hierarchical Part Matching for Fine-Grained Visual Categorization.
Proceedings of the IEEE International Conference on Computer Vision, 2013

2012
In-video product annotation with web information mining.
ACM Trans. Multim. Comput. Commun. Appl., 2012

Interactive Video Indexing With Statistical Active Learning.
IEEE Trans. Multim., 2012

Movie2Comics: Towards a Lively Video Content Presentation.
IEEE Trans. Multim., 2012

Event Driven Web Video Summarization by Tag Localization and Key-Shot Identification.
IEEE Trans. Multim., 2012

Camera Constraint-Free View-Based 3-D Object Retrieval.
IEEE Trans. Image Process., 2012

Learning from social media network.
Neurocomputing, 2012

Multimedia Question Answering.
IEEE Multim., 2012

On Video Recommendation over Social Network.
Proceedings of the Advances in Multimedia Modeling - 18th International Conference, 2012

View-based 3D object retrieval by bipartite graph matching.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Harvesting visual concepts for image search with complex queries.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

2011
Video accessibility enhancement for hearing-impaired users.
ACM Trans. Multim. Comput. Commun. Appl., 2011

Beyond search: Event-driven summarization for web videos.
ACM Trans. Multim. Comput. Commun. Appl., 2011

Image annotation by <i>k</i>NN-sparse graph-based label propagation over noisily tagged web images.
ACM Trans. Intell. Syst. Technol., 2011

Multiple feature hashing for real-time large scale near-duplicate video retrieval.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Videoader: a video advertising system based on intelligent analysis of visual content.
Proceedings of the ICIMCS 2011, 2011

2010
Joint Learning of Labels and Distance Metric.
IEEE Trans. Syst. Man Cybern. Part B, 2010

Multimedia Question Answering.
Scholarpedia, 2010

Question Answering over Community-Contributed Web Videos.
IEEE Multim., 2010

Estimating Poses of World's Photos with Geographic Metadata.
Proceedings of the Advances in Multimedia Modeling, 2010

Learning Cooking Techniques from YouTube.
Proceedings of the Advances in Multimedia Modeling, 2010

Mediapedia: Mining Web Knowledge to Construct Multimedia Encyclopedia.
Proceedings of the Advances in Multimedia Modeling, 2010

Video Reference: A Video Question Answering Engine.
Proceedings of the Advances in Multimedia Modeling, 2010

Movie2Comics: a feast of multimedia artwork.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Dynamic captioning: video accessibility enhancement for hearing impairment.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

iComics: automatic conversion of movie into comics.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

W2Go: a travel guidance system by automatic landmark ranking.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Exploring large scale data for multimedia QA: an initial study.
Proceedings of the 9th ACM International Conference on Image and Video Retrieval, 2010

2009
Beyond Distance Measurement: Constructing Neighborhood Similarity for Video Annotation.
IEEE Trans. Multim., 2009

Unified Video Annotation via Multigraph Learning.
IEEE Trans. Circuits Syst. Video Technol., 2009

Semi-supervised kernel density estimation for video annotation.
Comput. Vis. Image Underst., 2009

Image Fusion Quality Metrics by Directional Projection.
Proceedings of the IEEE International Conference on Systems, 2009

Inferring semantic concepts from community-contributed images and noisy tags.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Scalable detection of partial near-duplicate videos by visual-temporal consistency.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

ViewFocus: explore places of interests on Google maps using photos with view direction filtering.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Event driven summarization for web videos.
Proceedings of the first SIGMM workshop on Social media, 2009

From text question-answering to multimedia QA on web-scale media resources.
Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining, 2009

NUS-WIDE: a real-world web image database from National University of Singapore.
Proceedings of the 8th ACM International Conference on Image and Video Retrieval, 2009

2008
A Quality Metric Based on Color Similarity for Image Fusion.
Int. J. Inf. Acquis., 2008

A Projection-Based Metric for the Quantitative Evaluation of Pixel-Level Image Fusion.
Proceedings of the Fourth International Conference on Natural Computation, 2008

2007
Lazy Learning Based Efficient Video Annotation.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Salience Preserving Multi-Focus Image Fusion.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

A Probability Model for Image Annotation.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007


  Loading...