Jinhui Tang

Pattern Recognit. Lett., 2024

EventCrab: Harnessing Frame and Point Synergy for Event-based Action Recognition and Beyond.

[BibT_eX]

[DOI]

CoRR, 2024

FTMoMamba: Motion Generation with Frequency and Text State Space Models.

[BibT_eX]

[DOI]

CoRR, 2024

FedMLLM: Federated Fine-tuning MLLM on Multimodal Heterogeneity Data.

[BibT_eX]

[DOI]

CoRR, 2024

The Solution for Temporal Action Localisation Task of Perception Test Challenge 2024.

[BibT_eX]

[DOI]

CoRR, 2024

Prototypical Prompting for Text-to-image Person Re-identification.

[BibT_eX]

[DOI]

CoRR, 2024

Locality-aware Cross-modal Correspondence Learning for Dense Audio-Visual Events Localization.

[BibT_eX]

[DOI]

CoRR, 2024

A Recover-then-Discriminate Framework for Robust Anomaly Detection.

[BibT_eX]

[DOI]

CoRR, 2024

Real-Time 4K Super-Resolution of Compressed AVIF Images. AIS 2024 Challenge Survey.

[BibT_eX]

[DOI]

Ioannis Katsavounidis

CoRR, 2024

Dynamic in Static: Hybrid Visual Correspondence for Self-Supervised Video Object Segmentation.

[BibT_eX]

[DOI]

CoRR, 2024

The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report.

[BibT_eX]

[DOI]

CoRR, 2024

Collaborative Feedback Discriminative Propagation for Video Super-Resolution.

[BibT_eX]

[DOI]

CoRR, 2024

Context-Semantic Quality Awareness Network for Fine-Grained Visual Categorization.

[BibT_eX]

[DOI]

CoRR, 2024

STAF: 3D Human Mesh Recovery from Video with Spatio-Temporal Alignment Fusion.

[BibT_eX]

[DOI]

CoRR, 2024

Prototypical Prompting for Text-to-image Person Re-identification.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Dual-view Pyramid Network for Video Frame Interpolation.

[BibT_eX]

[DOI]

Ming Yang

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

DVF: Advancing Robust and Accurate Fine-Grained Image Retrieval with Retrieval Guidelines.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

TrGa: Reconsidering the Application of Graph Neural Networks in Two-View Correspondence Pruning.

[BibT_eX]

[DOI]

Luanyuan Dai

Xiaoyu Du

Florin-Alexandru Vasluianu

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

DTS-TPT: Dual Temporal-Sync Test-time Prompt Tuning for Zero-shot Activity Recognition.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

ColorMNet: A Memory-Based Deep Spatial-Temporal Feature Propagation Network for Video Colorization.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

NTIRE 2024 Challenge on Blind Enhancement of Compressed Image: Methods and Results.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Unleashing Network Potentials for Semantic Scene Completion.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

NTIRE 2024 Image Shadow Removal Challenge Report.

[BibT_eX]

[DOI]

Santosh Kumar Vipparthi

Ahmad 'Athif Mohd Faudzi

Santosh Kumar Vipparthi

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report.

[BibT_eX]

[DOI]

Ahmad Mahmoudi-Aznaveh

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Real-Time 4K Super-Resolution of Compressed AVIF Images. AIS 2024 Challenge Survey.

[BibT_eX]

[DOI]

Marcos V. Conde

Zhijun Lei

Wen Li

Ioannis Katsavounidis

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

PTSR: Prefix-Target Graph-based Sequential Recommendation.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

MGNet: Learning Correspondences via Multiple Graphs.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

SI-Net: spatial interaction network for deepfake detection.

[BibT_eX]

[DOI]

Multim. Syst., October, 2023

Vision Transformer With Hybrid Shifted Windows for Gastrointestinal Endoscopy Image Classification.

[BibT_eX]

[DOI]

Wei Wang

Xin Yang

IEEE Trans. Circuits Syst. Video Technol., September, 2023

Boosting Few-Shot Fine-Grained Recognition With Background Suppression and Foreground Alignment.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., August, 2023

Progressive Instance-Aware Feature Learning for Compositional Action Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., August, 2023

Cascaded Deep Video Deblurring Using Temporal Sharpness Prior and Non-Local Spatial-Temporal Similarity.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., August, 2023

HiGCIN: Hierarchical Graph-Based Cross Inference Network for Group Activity Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

Multi-Granularity Anchor-Contrastive Representation Learning for Semi-Supervised Skeleton-Based Action Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

Deep Semantic Multimodal Hashing Network for Scalable Image-Text and Video-Text Retrievals.

[BibT_eX]

[DOI]

Lu Jin

IEEE Trans. Neural Networks Learn. Syst., April, 2023

Single Image Deraining Using Residual Channel Attention Networks.

[BibT_eX]

[DOI]

Di Wang

J. Comput. Sci. Technol., April, 2023

Augmented FCN: rethinking context modeling for semantic segmentation.

[BibT_eX]

[DOI]

Dong Zhang

Liyan Zhang

Sci. China Inf. Sci., April, 2023

CLIP-Driven Fine-Grained Text-Image Person Re-Identification.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2023

Centralized Feature Pyramid for Object Detection.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2023

Multi-Granularity Denoising and Bidirectional Alignment for Weakly Supervised Semantic Segmentation.

[BibT_eX]

[DOI]

Tao Chen

Yazhou Yao

IEEE Trans. Image Process., 2023

ISTVT: Interpretable Spatial-Temporal Video Transformer for Deepfake Detection.

[BibT_eX]

[DOI]

IEEE Trans. Inf. Forensics Secur., 2023

Learning Transferable Discriminative Knowledge From Attribute-Aligned Hyperspectral Images.

[BibT_eX]

[DOI]

IEEE Trans. Geosci. Remote. Sens., 2023

W-HMR: Human Mesh Recovery in World Space with Weak-supervised Camera Calibration and Orientation Correction.

[BibT_eX]

[DOI]

CoRR, 2023

Learning Comprehensive Representations with Richer Self for Text-to-Image Person Re-Identification.

[BibT_eX]

[DOI]

CoRR, 2023

Towards Unified Deep Image Deraining: A Survey and A New Benchmark.

[BibT_eX]

[DOI]

CoRR, 2023

M3Net: Multi-view Encoding, Matching, and Fusion for Few-shot Fine-grained Action Recognition.

[BibT_eX]

[DOI]

CoRR, 2023

VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset.

[BibT_eX]

[DOI]

CoRR, 2023

Triplet Contrastive Learning for Unsupervised Vehicle Re-identification.

[BibT_eX]

[DOI]

CoRR, 2023

DLGSANet: Lightweight Dynamic Local and Global Self-Attention Networks for Image Super-Resolution.

[BibT_eX]

[DOI]

CoRR, 2023

Learning Comprehensive Representations with Richer Self for Text-to-Image Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Pedestrian-specific Bipartite-aware Similarity Learning for Text-based Person Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Slowfast Diversity-aware Prototype Learning for Egocentric Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Foreground/Background-Masked Interaction Learning for Spatio-temporal Action Detection.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

M3Net: Multi-view Encoding, Matching, and Fusion for Few-shot Fine-grained Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Spatially-Adaptive Feature Modulation for Efficient Image Super-Resolution.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

DLGSANet: Lightweight Dynamic Local and Global Self-Attention Network for Image Super-Resolution.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Multi-scale Residual Low-Pass Filter Network for Image Deblurring.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

SVMV: Spatiotemporal Variance-Supervised Motion Volume for Video Frame Interpolation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Semantic Scene Completion with Cleaner Self.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

NTIRE 2023 Challenge on Stereo Image Super-Resolution: Methods and Results.

[BibT_eX]

[DOI]

Fredrik K. Gustafsson

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Deep Discriminative Spatial and Temporal Network for Efficient Video Deblurring.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

NTIRE 2023 Challenge on Efficient Super-Resolution: Methods and Results.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

NTIRE 2023 Challenge on Image Denoising: Methods and Results.

[BibT_eX]

[DOI]

Javier Vazquez-Corral

Konstantinos G. Derpanis

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Efficient Deep Models for Real-Time 4K Image Super-Resolution. NTIRE 2023 Benchmark and Report.

[BibT_eX]

[DOI]

Bahri Batuhan Bilecen

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Video-Text Pre-training with Learned Regions for Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Causal Inference with Knowledge Distilling and Curriculum Learning for Unbiased VQA.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2022

Position-Aware Participation-Contributed Temporal Dynamic Model for Group Activity Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., 2022

RGB-D DSO: Direct Sparse Odometry With RGB-D Cameras for Indoor Scenes.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2022

Sub-Region Localized Hashing for Fine-Grained Image Retrieval.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2022

Learning Discriminative Cross-Modality Features for RGB-D Saliency Detection.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2022

Bi-Directional Pseudo-Three-Dimensional Network for Video Frame Interpolation.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2022

Self-Guided Image Dehazing Using Progressive Feature Fusion.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2022

LiSiam: Localization Invariance Siamese Network for Deepfake Detection.

[BibT_eX]

[DOI]

Jian Wang

Yunlian Sun

IEEE Trans. Inf. Forensics Secur., 2022

Learning attention-guided pyramidal features for few-shot fine-grained recognition.

[BibT_eX]

[DOI]

Pattern Recognit., 2022

Fine-Grained Image Analysis With Deep Learning: A Survey.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

Coherence Constrained Graph LSTM for Group Activity Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

Spatiotemporal Co-Attention Recurrent Neural Networks for Human-Skeleton Motion Prediction.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

CTNet: Context-Based Tandem Network for Semantic Segmentation.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

Learning Spatially Variant Linear Representation Models for Joint Filtering.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

iFlowGAN: An Invertible Flow-Based Generative Adversarial Network for Unsupervised Image-to-Image Translation.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

A selection function for pitched instrument source separation.

[BibT_eX]

[DOI]

Yukai Gong

Multim. Syst., 2022

High Dimensional Convolution Acceleration via Tensor Decomposition.

[BibT_eX]

[DOI]

J. Circuits Syst. Comput., 2022

Convolutional-capsule network for gastrointestinal endoscopy image classification.

[BibT_eX]

[DOI]

Int. J. Intell. Syst., 2022

Dual Convolutional Neural Networks for Low-Level Vision.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2022

SLLEN: Semantic-aware Low-light Image Enhancement Network.

[BibT_eX]

[DOI]

CoRR, 2022

Deep Learning for Medical Image Segmentation: Tricks, Challenges and Future Directions.

[BibT_eX]

[DOI]

CoRR, 2022

Contextual and selective attention networks for image captioning.

[BibT_eX]

[DOI]

Sci. China Inf. Sci., 2022

ShuffleMixer: An Efficient ConvNet for Image Super-Resolution.

[BibT_eX]

[DOI]

Long Sun

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Graph Reasoning Transformer for Image Parsing.

[BibT_eX]

[DOI]

Dong Zhang

Kwang-Ting Cheng

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Look Less Think More: Rethinking Compositional Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Heterogeneous Learning for Scene Graph Generation.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Invariant Representation Learning for Multimedia Recommendation.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Hierarchical Feature Alignment Network for Unsupervised Video Object Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

PNP: Robust Learning from Noisy Labels by Probabilistic Noise Prediction.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

NTIRE 2022 Challenge on Learning the Super-Resolution Space.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

NTIRE 2022 Burst Super-Resolution Challenge.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

2021

Host-Parasite: Graph LSTM-in-LSTM for Group Activity Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., 2021

Hierarchical Long Short-Term Concurrent Memory for Human Interaction Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2021

Physics-Based Generative Adversarial Models for Image Restoration and Beyond.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2021

Video Anomaly Detection with Sparse Coding Inspired Deep Neural Networks.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2021

Bi-branch network for dynamic scene deblurring.

[BibT_eX]

[DOI]

Zhong-Hui Duan

Comput. Vis. Image Underst., 2021

Video-Text Pre-training with Learned Regions.

[BibT_eX]

[DOI]

CoRR, 2021

CTNet: Context-based Tandem Network for Semantic Segmentation.

[BibT_eX]

[DOI]

Yanpeng Sun

CoRR, 2021

Semi-supervised local feature selection for data classification.

[BibT_eX]

[DOI]

Sci. China Inf. Sci., 2021

Reproducibility Companion Paper: Visual Relation of Interest Detection.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Learning a Tree-Structured Channel-Wise Refinement Network for Efficient Image Deraining.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Self-Regulation for Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Deep Blind Video Super-resolution.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Improving OCR-Based Image Captioning by Incorporating Geometrical Relationship.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Generalized Focal Loss V2: Learning Reliable Localization Quality Estimation for Dense Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Learning a Cascaded Non-Local Residual Network for Super-Resolving Blurry Images.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

2020

Fast Matrix Factorization With Nonuniform Weights on Missing Data.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., 2020

Joint Label Prediction Based Semi-Supervised Adaptive Concept Factorization for Robust Data Representation.

[BibT_eX]

[DOI]

IEEE Trans. Knowl. Data Eng., 2020

Adversarial Training Towards Robust Multimedia Recommender System.

[BibT_eX]

[DOI]

IEEE Trans. Knowl. Data Eng., 2020

Task-Oriented Network for Image Dehazing.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2020

Facial Age and Expression Synthesis Using Ordinal Ranking Adversarial Networks.

[BibT_eX]

[DOI]

IEEE Trans. Inf. Forensics Secur., 2020

Facial Age Synthesis With Label Distribution-Guided Generative Adversarial Network.

[BibT_eX]

[DOI]

IEEE Trans. Inf. Forensics Secur., 2020

Recursive Discriminative Subspace Learning With $\ell_{1}$ -Norm Distance Constraint.

[BibT_eX]

[DOI]

IEEE Trans. Cybern., 2020

Speed Up Bilateral Filtering via Sparse Approximation on a Learned Cosine Dictionary.

[BibT_eX]

[DOI]

Liang Tang

IEEE Trans. Circuits Syst. Video Technol., 2020

Deep supervised feature selection for social relationship recognition.

[BibT_eX]

[DOI]

Pattern Recognit. Lett., 2020

Deep multi-person kinship matching and recognition for family photos.

[BibT_eX]

[DOI]

Pattern Recognit., 2020

Discriminative supplementary representation learning for novel-category classification.

[BibT_eX]

[DOI]

Qiuli Liu

Neurocomputing, 2020

Weakly-supervised Semantic Guided Hashing for Social Image Retrieval.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2020

Interactive Fusion of Multi-level Features for Compositional Activity Recognition.

[BibT_eX]

[DOI]

CoRR, 2020

Deep Blind Video Super-resolution.

[BibT_eX]

[DOI]

CoRR, 2020

Causal Intervention for Weakly-Supervised Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Generalized Focal Loss: Learning Qualified and Distributed Bounding Boxes for Dense Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Distilling knowledge in causal inference for unbiased visual question answering.

[BibT_eX]

[DOI]

Proceedings of the MMAsia 2020: ACM Multimedia Asia, 2020

Reproducibility Companion Paper: Instance of Interest Detection.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Visual Relation of Interest Detection.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Multimodal Attention with Image Text Spatial Relationship for OCR-Based Image Captioning.

[BibT_eX]

[DOI]

Jing Wang

Jiebo Luo

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

BlockMix: Meta Regularization and Self-Calibrated Inference for Metric-Based Meta-Learning.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Weakly-Supervised Image Hashing through Masked Visual-Semantic Graph-based Reasoning.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

How to Learn Item Representation for Cold-Start Multimedia Recommendation?

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Feature Pyramid Transformer.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Social Adaptive Module for Weakly-Supervised Group Activity Recognition.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Cascaded Deep Video Deblurring Using Temporal Sharpness Prior.

[BibT_eX]

[DOI]

Haoran Bai

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Image Formation Model Guided Deep Image Super-Resolution.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Integrating Dense LiDAR-Camera Road Detection Maps by a Multi-Modal CRF Model.

[BibT_eX]

[DOI]

IEEE Trans. Veh. Technol., 2019

Source Resolvability of Spatial-Smoothing-Based Subspace Methods: A Hadamard Product Perspective.

[BibT_eX]

[DOI]

Zai Yang

Petre Stoica

IEEE Trans. Signal Process., 2019

Show, Reward, and Tell: Adversarial Visual Story Generation.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2019

Modeling Embedding Dimension Correlations via Convolutional Neural Collaborative Filtering.

[BibT_eX]

[DOI]

ACM Trans. Inf. Syst., 2019

Weighted Mixed-Norm Regularized Regression for Robust Face Identification.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., 2019

Deep Semantic-Preserving Ordinal Hashing for Cross-Modal Similarity Search.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., 2019

On the Sample Complexity of Multichannel Frequency Estimation via Convex Optimization.

[BibT_eX]

[DOI]

IEEE Trans. Inf. Theory, 2019

Deep Ordinal Hashing With Spatial Attention.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2019

Interpreting and Extending the Guided Filter via Cyclic Coordinate Descent.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2019

Social Anchor-Unit Graph Regularized Tensor Completion for Large-Scale Image Retagging.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2019

Richer Convolutional Features for Edge Detection.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2019

Deep Collaborative Embedding for Social Image Understanding.

[BibT_eX]

[DOI]

Tao Mei

IEEE Trans. Pattern Anal. Mach. Intell., 2019

Image annotation refinement via 2P-KNN based group sparse reconstruction.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2019

Multimedia retrieval by deep hashing with multilevel similarity learning.

[BibT_eX]

[DOI]

J. Vis. Commun. Image Represent., 2019

Deep Semantic Multimodal Hashing Network for Scalable Multimedia Retrieval.

[BibT_eX]

[DOI]

CoRR, 2019

Cauchy Matrix Factorization for Tag-Based Social Image Retrieval.

[BibT_eX]

[DOI]

IEEE Access, 2019

Temporal Action Localization Based on Temporal Evolution Model and Multiple Instance Learning.

[BibT_eX]

[DOI]

Proceedings of the MultiMedia Modeling - 25th International Conference, 2019

Selective Attention Network for Image Dehazing and Deraining.

[BibT_eX]

[DOI]

Xiao Liang

Runde Li

Venkateswararao Cherukuri

Proceedings of the MMAsia '19: ACM Multimedia Asia, Beijing, China, December 16-18, 2019, 2019

Visual-Inertial State Estimation with Pre-integration Correction for Robust Mobile Augmented Reality.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

Instance of Interest Detection.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

Crowd Counting via Multi-layer Regression.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

Hierarchical Visual Relationship Detection.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

Aligning Linguistic Words and Visual Semantic Units for Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

Progressive Image Enhancement under Aesthetic Guidance.

[BibT_eX]

[DOI]

Proceedings of the 2019 on International Conference on Multimedia Retrieval, 2019

Convolutional Auto-encoding of Sentence Topics for Image Paragraph Generation.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Road Detection through CRF based LiDAR-Camera Fusion.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Robotics and Automation, 2019

Few-Shot Image Recognition With Knowledge Transfer.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Adaptive Context Network for Scene Parsing.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Spatially Variant Linear Representation Models for Joint Filtering.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

NTIRE 2019 Image Dehazing Challenge Report.

[BibT_eX]

[DOI]

Pablo Navarrete Michelini

Harshjeet Singh Aulakh

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

2018

Discriminative Deep Quantization Hashing for Face Image Retrieval.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., 2018

Robust Structured Nonnegative Matrix Factorization for Image Representation.

[BibT_eX]

[DOI]

Xiaofei He

IEEE Trans. Neural Networks Learn. Syst., 2018

Modeling Multimodal Clues in a Hybrid Deep Learning Framework for Video Classification.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2018

Semantic Neighbor Graph Hashing for Multimodal Retrieval.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2018

Weakly Supervised Multimodal Hashing for Scalable Social Image Retrieval.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2018

Image Classification With Tailored Fine-Grained Dictionaries.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2018

Supervised deep hashing for scalable face image retrieval.

[BibT_eX]

[DOI]

Xiang Zhu

Pattern Recognit., 2018

Personalized Age Progression with Bi-Level Aging Dictionary Learning.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2018

Person re-identification with activity prediction based on hierarchical spatial-temporal model.

[BibT_eX]

[DOI]

Neurocomputing, 2018

Visual understanding by mining social media: recent advances and challenges.

[BibT_eX]

[DOI]

Xueming Wang

Frontiers Comput. Sci., 2018

Fast Matrix Factorization with Non-Uniform Weights on Missing Data.

[BibT_eX]

[DOI]

CoRR, 2018

Latent Dirichlet Truth Discovery: Separating Trustworthy and Untrustworthy Components in Data Sources.

[BibT_eX]

[DOI]

IEEE Access, 2018

Matrix Entropy Driven Maximum Margin Feature Learning.

[BibT_eX]

[DOI]

Dong Zhang

Proceedings of the PRICAI 2018: Trends in Artificial Intelligence, 2018

Designing by Training: Acceleration Neural Network for Fast High-Dimensional Convolution.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Effective Action Detection Using Temporal Context and Posterior Probability of Length.

[BibT_eX]

[DOI]

Xinran Liu

Yan Song

Proceedings of the MultiMedia Modeling - 24th International Conference, 2018

Participation-Contributed Temporal Dynamic Model for Group Activity Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Unregularized Auto-Encoder with Generative Adversarial Networks for Image Generation.

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Cascaded Feature Augmentation with Diffusion for Image Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Outer Product-based Neural Collaborative Filtering.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Learning Dual Convolutional Neural Networks for Low-Level Vision.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Single Image Dehazing via Conditional Generative Adversarial Network.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

DotaNet: Two-Stream Match-Recurrent Neural Networks for Predicting Social Game Result.

[BibT_eX]

[DOI]

Zhen Qi

Xiangbo Shu

Proceedings of the Fourth IEEE International Conference on Multimedia Big Data, 2018

Attributes Consistent Faces Generation Under Arbitrary Poses.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2018, 2018

Show, Reward and Tell: Automatic Generation of Narrative Paragraph From Photo Stream by Adversarial Training.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017

Multi-Grained Random Fields for Mitosis Identification in Time-Lapse Phase Contrast Microscopy Image Sequences.

[BibT_eX]

[DOI]

IEEE Trans. Medical Imaging, 2017

LEGO-MM: LEarning Structured Model by Probabilistic loGic Ontology Tree for MultiMedia.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2017

Weakly Supervised Deep Matrix Factorization for Social Image Understanding.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2017

Semi-Supervised Image-to-Video Adaptation for Video Action Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Cybern., 2017

Tri-Clustered Tensor Completion for Social-Aware Image Tag Refinement.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2017

Human Parsing with Contextualized Convolutional Neural Network.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2017

Captioning Videos Using Large-Scale Image Corpus.

[BibT_eX]

[DOI]

J. Comput. Sci. Technol., 2017

Multimedia news QA: Extraction and visualization integration with multiple-source information.

[BibT_eX]

[DOI]

Xueming Wang

Image Vis. Comput., 2017

Computational face reader based on facial attribute estimation.

[BibT_eX]

[DOI]

Neurocomputing, 2017

Learning discriminative supplementary features to attributes for novel-category classification.

[BibT_eX]

[DOI]

Qiuli Liu

Proceedings of the 2017 IEEE Visual Communications and Image Processing, 2017

Recovering Overlapping Partials for Monaural Perfect Harmonic Musical Sound Separation Using Modified Common Amplitude Modulation.

[BibT_eX]

[DOI]

Yukai Gong

Xiangbo Shu

Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

Mini Neural Networks for Effective and Efficient Mobile Album Organization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

Wheel: Accelerating CNNs with Distributed GPUs via Hybrid Parallelism and Alternate Strategy.

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM on Multimedia Conference, 2017

Discriminative Deep Hashing for Scalable Face Image Retrieval.

[BibT_eX]

[DOI]

Jie Lin

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Multi-part boosting LSTMS for skeleton based human activity analysis.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Multimedia & Expo Workshops, 2017

Inception Single Shot MultiBox Detector for object detection.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Multimedia & Expo Workshops, 2017

Sketch-Based Image Retrieval with Multiple Binary HoG Descriptor.

[BibT_eX]

[DOI]

Tianqi Wang

Liyan Zhang

Proceedings of the Internet Multimedia Computing and Service, 2017

Bidirectional Multimodal Recurrent Neural Networks with Refined Visual Features for Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the Internet Multimedia Computing and Service, 2017

CAD: Scale Invariant Framework for Real-Time Object Detection.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

Concurrence-Aware Long Short-Term Sub-Memories for Person-Person Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2017

Hardware-Efficient Guided Image Filtering for Multi-label Problem.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Unsupervised Video Summaries Using Multiple Features and Image Quality.

[BibT_eX]

[DOI]

Proceedings of the Third IEEE International Conference on Multimedia Big Data, 2017

Weakly-Supervised Deep Nonnegative Low-Rank Model for Social Image Tag Refinement and Assignment.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016

Generalized Deep Transfer Networks for Knowledge Propagation in Heterogeneous Domains.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2016

Multimedia News Summarization in Search.

[BibT_eX]

[DOI]

ACM Trans. Intell. Syst. Technol., 2016

Beyond Object Proposals: Random Crop Pooling for Multi-Label Image Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2016

Nonconvex Nonsmooth Low Rank Minimization via Iteratively Reweighted Nuclear Norm.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2016

Deep Learning Driven Visual Path Prediction From a Single Image.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2016

Patch-Set-Based Representation for Alignment-Free Image Set Classification.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2016

Kinship-Guided Age Progression.

[BibT_eX]

[DOI]

Pattern Recognit., 2016

A Deterministic Analysis for LRR.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2016

Overlapping community detection based on node location analysis.

[BibT_eX]

[DOI]

Knowl. Based Syst., 2016

Local structure based multi-phase collaborative representation for face recognition with single sample per person.

[BibT_eX]

[DOI]

Inf. Sci., 2016

Vision-based two-step brake detection method for vehicle collision avoidance.

[BibT_eX]

[DOI]

Neurocomputing, 2016

Age progression: Current technologies and applications.

[BibT_eX]

[DOI]

Neurocomputing, 2016

Linear Time Illumination Invariant Stereo Matching.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2016

Joint depth map interpolation and segmentation with planar surface model.

[BibT_eX]

[DOI]

Proceedings of the SIGGRAPH ASIA 2016, Macao, December 5-8, 2016 - Technical Briefs, 2016

Computational Face Reader.

[BibT_eX]

[DOI]

Proceedings of the MultiMedia Modeling - 22nd International Conference, 2016

Object-aware Deep Network for Commodity Image Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, 2016

A multi-phase sparse probability framework via entropy minimization for single sample face recognition.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Supervised Quantization for Similarity Search.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015

RGB-D Object Recognition via Incorporating Latent Data Structure and Prior Knowledge.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2015

Weakly Supervised Deep Metric Learning for Community-Contributed Image Retrieval.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2015

Real-Time System for Driver Fatigue Detection by RGB-D Camera.

[BibT_eX]

[DOI]

Liyan Zhang

Fan Liu

ACM Trans. Intell. Syst. Technol., 2015

Robust Multiview Feature Learning for RGB-D Image Understanding.

[BibT_eX]

[DOI]

ACM Trans. Intell. Syst. Technol., 2015

Local Structure-Based Sparse Representation for Face Recognition.

[BibT_eX]

[DOI]

ACM Trans. Intell. Syst. Technol., 2015

Constructing a Nonnegative Low-Rank and Sparse Graph With Data-Adaptive Features.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2015

Double Nuclear Norm-Based Matrix Decomposition for Occluded Image Recovery and Background Modeling.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2015

Neighborhood Discriminant Hashing for Large-Scale Image Retrieval.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2015

Unsupervised Feature Selection via Nonnegative Spectral Analysis and Redundancy Control.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2015

Describing Trajectory of Surface Patch for Human Action Recognition on RGB and Depth Videos.

[BibT_eX]

[DOI]

Yan Song

Shi Liu

IEEE Signal Process. Lett., 2015

Online Topic-Aware Influence Maximization.

[BibT_eX]

[DOI]

Proc. VLDB Endow., 2015

Efficient and Robust Specular Highlight Removal.

[BibT_eX]

[DOI]

Qingxiong Yang

Narendra Ahuja

IEEE Trans. Pattern Anal. Mach. Intell., 2015

Robust Structured Subspace Learning for Data Representation.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2015

Tag ranking based on salient region graph propagation.

[BibT_eX]

[DOI]

Multim. Syst., 2015

NIF-based seam carving for image resizing.

[BibT_eX]

[DOI]

Multim. Syst., 2015

Saliency-based content-aware lifestyle image mosaics.

[BibT_eX]

[DOI]

J. Vis. Commun. Image Represent., 2015

Local Similarity based Discriminant Analysis for Face Recognition.

[BibT_eX]

[DOI]

KSII Trans. Internet Inf. Syst., 2015

Different Users, Different Opinions: Predicting Search Satisfaction with Mouse Movement Information.

[BibT_eX]

[DOI]

Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2015

Deep kinship verification.

[BibT_eX]

[DOI]

Proceedings of the 17th IEEE International Workshop on Multimedia Signal Processing, 2015

Deep Matrix Factorization for social image tag refinement and assignment.

[BibT_eX]

[DOI]

Proceedings of the 17th IEEE International Workshop on Multimedia Signal Processing, 2015

What Shall I Look Like after N Years?

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Weakly-Shared Deep Transfer Networks for Heterogeneous-Domain Knowledge Propagation.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Partially Common-Semantic Pursuit for RGB-D Object Recognition.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Semantic-aware Hashing for Social Image Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

A novel video dehazing method based on temporal visual coherence.

[BibT_eX]

[DOI]

Xinguang Xiang

Yang Cheng

Proceedings of the 7th International Conference on Internet Multimedia Computing and Service, 2015

Personalized Age Progression with Aging Dictionary.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Human Parsing with Contextualized Convolutional Neural Network.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Sparse composite quantization.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Activity Prediction Based on Spatiotemporal Model in a Multiple Cameras Network.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Multimedia Big Data, BigMM 2015, 2015

2014

Modified Principal Component Analysis: An Integration of Multiple Similarity Subspace Models.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., 2014

Personalized Geo-Specific Tag Recommendation for Photos on Social Websites.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2014

Product Aspect Ranking and Its Applications.

[BibT_eX]

[DOI]

IEEE Trans. Knowl. Data Eng., 2014

A Unified Geolocation Framework for Web Videos.

[BibT_eX]

[DOI]

ACM Trans. Intell. Syst. Technol., 2014

Scalable Similarity Search With Topology Preserving Hashing.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2014

Multi-Label Image Categorization With Sparse Factor Representation.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2014

Learning Discriminative Dictionary for Group Sparse Representation.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2014

Learning-Based Bipartite Graph Matching for View-Based 3D Model Retrieval.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2014

Joint Video Frame Set Division and Low-Rank Decomposition for Background Subtraction.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2014

Body Surface Context: A New Robust Feature for Action Recognition From Depth Videos.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2014

Typicality ranking: beyond accuracy for video semantic annotation.

[BibT_eX]

[DOI]

Xian-Sheng Hua

Multim. Tools Appl., 2014

Social media mining and knowledge discovery.

[BibT_eX]

[DOI]

Multim. Syst., 2014

Projective Matrix Factorization with unified embedding for social image tagging.

[BibT_eX]

[DOI]

Comput. Vis. Image Underst., 2014

Constructing a Non-Negative Low Rank and Sparse Graph with Data-Adaptive Features.

[BibT_eX]

[DOI]

CoRR, 2014

What Can We Learn about Motion Videos from Still Images?

[BibT_eX]

[DOI]

Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Augmented Image Retrieval using Multi-order Object Layout with Attributes.

[BibT_eX]

[DOI]

Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Cuteness Recognition and Localization in the Photos of Animals.

[BibT_eX]

[DOI]

Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Fudan-NJUST at MediaEval 2014: Violent Scenes Detection Using Deep Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Working Notes Proceedings of the MediaEval 2014 Workshop, 2014

Hand-Crafted Features or Machine Learnt Features? Together They Improve RGB-D Object Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE International Symposium on Multimedia, 2014

Local structure based sparse representation for face recognition with single sample per person.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Generalized Nonconvex Nonsmooth Low-Rank Minimization.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Term Selection and Result Reranking for Question Retrieval by Exploiting Hierarchical Classification.

[BibT_eX]

[DOI]

Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, 2014

2013

Towards optimizing human labeling for interactive image tagging.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2013

Near-lossless semantic video summarization and its applications to video analysis.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2013

Cross-Space Affinity Learning with Its Application to Movie Recommendation.

[BibT_eX]

[DOI]

IEEE Trans. Knowl. Data Eng., 2013

Sparse Tensor Discriminant Analysis.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2013

Accurate Estimation of Human Body Orientation From RGB-D Sensors.

[BibT_eX]

[DOI]

IEEE Trans. Cybern., 2013

Improving Bottom-up Saliency Detection by Looking into Neighbors.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2013

Designing Template-Free Predictor for Targeting Protein-Ligand Binding Sites with Classifier Ensemble and Spatial Clustering.

[BibT_eX]

[DOI]

IEEE ACM Trans. Comput. Biol. Bioinform., 2013

Label-specific training set construction from web resource for image annotation.

[BibT_eX]

[DOI]

Signal Process., 2013

Combining global and local matching of multiple features for precise item image retrieval.

[BibT_eX]

[DOI]

Multim. Syst., 2013

Sparse representations for image and video analysis.

[BibT_eX]

[DOI]

J. Vis. Commun. Image Represent., 2013

Intelligent processing techniques for semantic-based image and video retrieval.

[BibT_eX]

[DOI]

Neurocomputing, 2013

WLBP: Weber local binary pattern for local image description.

[BibT_eX]

[DOI]

Fan Liu

Zhenmin Tang

Neurocomputing, 2013

Hybrid image summarization by hypergraph partition.

[BibT_eX]

[DOI]

Minxian Li

Chunxia Zhao

Neurocomputing, 2013

An Eye States Detection Method by Using WLBP.

[BibT_eX]

[DOI]

Fan Liu

Zhenmin Tang

Proceedings of the 2013 IEEE Seventh International Conference on Semantic Computing, 2013

Real-Time Head Pose Estimation by RGB-D Camera.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Information Processing - PCM 2013, 2013

Cross Concept Local Fisher Discriminant Analysis for Image Classification.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Modeling, 19th International Conference, 2013

Saliency-Based Content-Aware Image Mosaics.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Modeling, 19th International Conference, 2013

Topology preserving hashing for similarity search.

[BibT_eX]

[DOI]

Proceedings of the ACM Multimedia Conference, 2013

Strong geometrical consistency in large scale partial-duplicate image search.

[BibT_eX]

[DOI]

Junqiang Wang

Yu-Gang Jiang

Proceedings of the ACM Multimedia Conference, 2013

Multimedia LEGO: Learning Structured Model by Probabilistic Logic Ontology Tree.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE 13th International Conference on Data Mining, 2013

Correntropy Induced L2 Graph for Robust Subspace Clustering.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2013

Binary Code Ranking with Weighted Hamming Distance.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Weakly-Supervised Dual Clustering for Image Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Community question topic categorization via hierarchical kernelized classification.

[BibT_eX]

[DOI]

Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

2012

Label-to-region with continuity-biased bi-layer sparsity priors.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2012

Learning Semantics From Multimedia Web Resources: An Introduction to the Special Issue.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2012

Introduction to the Special Section on Distance Metric Learning in Intelligent Systems.

[BibT_eX]

[DOI]

ACM Trans. Intell. Syst. Technol., 2012

Semantic-Gap-Oriented Active Learning for Multilabel Image Annotation.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2012

Camera Constraint-Free View-Based 3-D Object Retrieval.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2012

Social media mining and search.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2012

Active Learning on Sparse Graph for Image Annotation.

[BibT_eX]

[DOI]

Minxian Li

Chunxia Zhao

KSII Trans. Internet Inf. Syst., 2012

Looking into the world on Google Maps with view direction estimated photos.

[BibT_eX]

[DOI]

Neurocomputing, 2012

The Method for Constructing Block Sparse Measurement Matrix Based on Orthogonal Vectors.

[BibT_eX]

[DOI]

Ruizhen Zhao

Zhou Qin

Proceedings of the Advances in Multimedia Information Processing - PCM 2012, 2012

Abnormal behavior recognition system for ATM monitoring by RGB-D camera.

[BibT_eX]

[DOI]

Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Query expansion enhancement by fast binary matching.

[BibT_eX]

[DOI]

Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Low rank metric learning for social image retrieval.

[BibT_eX]

[DOI]

Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Beyond local image features: Scene calssification using supervised semantic representation.

[BibT_eX]

[DOI]

Proceedings of the 19th IEEE International Conference on Image Processing, 2012

Tag ranking by propagating relevance over tag and image graphs.

[BibT_eX]

[DOI]

Proceedings of the 4th International Conference on Internet Multimedia Computing and Service, 2012

Omni-range spatial contexts for visual classification.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

2011

Beyond search: Event-driven summarization for web videos.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2011

Image annotation by kNN-sparse graph-based label propagation over noisily tagged web images.

[BibT_eX]

[DOI]

ACM Trans. Intell. Syst. Technol., 2011

Interactive multimedia computing.

[BibT_eX]

[DOI]

Multim. Syst., 2011

Label-Specific Training Set Construction from Web Resource for Image Annotation

[BibT_eX]

[DOI]

CoRR, 2011

Capturing a great photo via learning from community-contributed photo collections.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Towards multi-semantic image annotation with graph regularized exclusive group lasso.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

iSearch: towards precise retrieval of item image.

[BibT_eX]

[DOI]

Proceedings of the ICIMCS 2011, 2011

Effective image representation based on bi-layer visual codebook.

[BibT_eX]

[DOI]

Proceedings of the First Asian Conference on Pattern Recognition, 2011

2010

Image Annotation by Graph-Based Inference With Integrated Multiple/Single Instance Representations.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2010

Image Classification With Kernelized Spatial-Context.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2010

Automatic Detection and Analysis of Player Action in Moving Background Sports Video Sequences.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2010

Multimedia Question Answering.

[BibT_eX]

[DOI]

Tat-Seng Chua

Richang Hong

Scholarpedia, 2010

Metric learning with feature decomposition for image categorization.

[BibT_eX]

[DOI]

Neurocomputing, 2010

View-based 3D model retrieval with probabilistic graph model.

[BibT_eX]

[DOI]

Neurocomputing, 2010

Estimating Poses of World's Photos with Geographic Metadata.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Modeling, 2010

Mediapedia: Mining Web Knowledge to Construct Multimedia Encyclopedia.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Modeling, 2010

One person labels one million images.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Multimedia 2010, 2010

W2Go: a travel guidance system by automatic landmark ranking.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Multimedia 2010, 2010

Nonparametric Label-to-Region by search.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Exploring large scale data for multimedia QA: an initial study.

[BibT_eX]

[DOI]

Proceedings of the 9th ACM International Conference on Image and Video Retrieval, 2010

2009

Correlative Linear Neighborhood Propagation for Video Annotation.

[BibT_eX]

[DOI]

IEEE Trans. Syst. Man Cybern. Part B, 2009

Beyond Distance Measurement: Constructing Neighborhood Similarity for Video Annotation.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2009

Unified Video Annotation via Multigraph Learning.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2009

Video semantic analysis based on structure-sensitive anisotropic manifold ranking.

[BibT_eX]

[DOI]

Signal Process., 2009

Two-Dimensional Multilabel Active Learning with an Efficient Online Adaptation Model for Image Classification.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2009

Image Fusion Quality Metrics by Directional Projection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Systems, 2009

Graph-Based Pairwise Learning to Rank for Video Search.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Modeling, 2009

Inferring semantic concepts from community-contributed images and noisy tags.

[BibT_eX]

[DOI]

Proceedings of the 17th International Conference on Multimedia 2009, 2009

Pornprobe: an LDA-SVM based pornography detection system.

[BibT_eX]

[DOI]

Proceedings of the 17th International Conference on Multimedia 2009, 2009

ViewFocus: explore places of interests on Google maps using photos with view direction filtering.

[BibT_eX]

[DOI]

Proceedings of the 17th International Conference on Multimedia 2009, 2009

Label to region by bi-layer sparsity priors.

[BibT_eX]

[DOI]

Proceedings of the 17th International Conference on Multimedia 2009, 2009

ACM SIGMM the first workshop on web-scale multimedia corpus (WSMC09).

[BibT_eX]

[DOI]

Benoit Huet

Alexander G. Hauptmann

Proceedings of the 17th International Conference on Multimedia 2009, 2009

Event driven summarization for web videos.

[BibT_eX]

[DOI]

Proceedings of the first SIGMM workshop on Social media, 2009

From text question-answering to multimedia QA on web-scale media resources.

[BibT_eX]

[DOI]

Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining, 2009

An efficient sparse metric learning in high-dimensional space via l1-penalized log-determinant regularization.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual International Conference on Machine Learning, 2009

NUS-WIDE: a real-world web image database from National University of Singapore.

[BibT_eX]

[DOI]

Proceedings of the 8th ACM International Conference on Image and Video Retrieval, 2009

Active Video Annotation.

[BibT_eX]

[DOI]

Proceedings of the Semantic Mining Technologies for Multimedia Databases., 2009

Image/Video Semantic Analysis by Semi-Supervised Learning.

[BibT_eX]

[DOI]