2025
MCCE-REC: MLLM-Driven Cross-Modal Contrastive Entropy Model for Zero-Shot Referring Expression Comprehension.
IEEE Trans. Circuits Syst. Video Technol., January, 2025
Cross-Modal Cognitive Consensus Guided Audio-Visual Segmentation.
IEEE Trans. Multim., 2025
High efficiency deep image compression via channel-wise scale adaptive latent representation learning.
Signal Process. Image Commun., 2025
Broad feature extraction and multi-directional imbalanced weighted broad learning system for the unsupervised stereo matching method.
Expert Syst. Appl., 2025
Scoring structure regularized gradient boosting network for blind image quality assessment.
Displays, 2025
2024
Continual Cross-Domain Image Compression via Entropy Prior Guided Knowledge Distillation and Scalable Decoding.
IEEE Trans. Circuits Syst. Video Technol., September, 2024
Robust Unpaired Image Dehazing via Adversarial Deformation Constraint.
IEEE Trans. Circuits Syst. Video Technol., September, 2024
Learning Offset Probability Distribution for Accurate Object Detection.
ACM Trans. Multim. Comput. Commun. Appl., May, 2024
TridentCap: Image-Fact-Style Trident Semantic Framework for Stylized Image Captioning.
IEEE Trans. Circuits Syst. Video Technol., May, 2024
Towards Continual Egocentric Activity Recognition: A Multi-Modal Egocentric Activity Dataset for Continual Learning.
IEEE Trans. Multim., 2024
CrowdCaption++: Collective-Guided Crowd Scenes Captioning.
IEEE Trans. Multim., 2024
Logit Variated Product Quantization Based on Parts Interaction and Metric Learning With Knowledge Distillation for Fine-Grained Image Retrieval.
IEEE Trans. Multim., 2024
Deep Progressive Asymmetric Quantization Based on Causal Intervention for Fine-Grained Image Retrieval.
IEEE Trans. Multim., 2024
Visual and Textual Prior Guided Mask Assemble for Few-Shot Segmentation and Beyond.
IEEE Trans. Multim., 2024
CSLNSpeech: Solving the extended speech separation problem with the help of Chinese sign language.
Speech Commun., 2024
Blessing few-shot segmentation via semi-supervised learning with noisy support images.
Pattern Recognit., 2024
Few-shot class incremental learning via prompt transfer and knowledge distillation.
Image Vis. Comput., 2024
Advancing zero-shot semantic segmentation through attribute correlations.
Neurocomputing, 2024
Class similarity weighted knowledge distillation for few shot incremental learning.
Neurocomputing, 2024
VLM-guided Explicit-Implicit Complementary novel class semantic learning for few-shot object detection.
Expert Syst. Appl., 2024
ARIC: An Activity Recognition Dataset in Classroom Surveillance Images.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
Cognition Transferring and Decoupling for Text-supervised Egocentric Semantic Segmentation.
CoRR, 2024
Few-Shot Continual Learning for Activity Recognition in Classroom Surveillance Images.
CoRR, 2024
Distribution-Level Memory Recall for Continual Learning: Preserving Knowledge and Avoiding Confusion.
CoRR, 2024
No Re-Train, More Gain: Upgrading Backbones with Diffusion Model for Few-Shot Segmentation.
CoRR, 2024
IoU-CLIP: IoU-Aware Language-Image Model Tuning for Open Vocabulary Object Detection.
Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2024
Robust Real-World Image Dehazing via Knowledge Guided Conditional Diffusion Model Finetuning.
Proceedings of the 26th IEEE International Workshop on Multimedia Signal Processing, 2024
DP-RSCAP: Dual Prompt-Based Scene and Entity Network for Remote Sensing Image Captioning.
Proceedings of the IGARSS 2024, 2024
Vision-Sensor Attention Based Continual Multimodal Egocentric Activity Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2024
GM-DETR: Generalized Muiltispectral DEtection TRansformer with Efficient Fusion Encoder for Visible-Infrared Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Prompt-Driven Referring Image Segmentation with Instance Contrasting.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Dual-Consistency Model Inversion for Non-Exemplar Class Incremental Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
HumanFormer: Human-centric Prompting Multi-modal Perception Transformer for Referring Crowd Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Inertial Strengthened CLIP model for Zero-shot Multimodal Egocentric Activity Recognition.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2024
2023
Dual-graph hierarchical interaction network for referring image segmentation.
Displays, December, 2023
Disturbed Augmentation Invariance for Unsupervised Visual Representation Learning.
IEEE Trans. Circuits Syst. Video Technol., November, 2023
Misaligned RGB-Infrared Object Detection via Adaptive Dual-Discrepancy Calibration.
Remote. Sens., October, 2023
Multi-directional broad learning system for the unsupervised stereo matching method.
Pattern Recognit., October, 2023
Cross-Modal Recurrent Semantic Comprehension for Referring Image Segmentation.
IEEE Trans. Circuits Syst. Video Technol., July, 2023
Bias-Correction Feature Learner for Semi-Supervised Instance Segmentation.
IEEE Trans. Multim., 2023
What Happens in Crowd Scenes: A New Dataset About Crowd Scenes for Image Captioning.
IEEE Trans. Multim., 2023
Forgetting to Remember: A Scalable Incremental Learning Framework for Cross-Task Blind Image Quality Assessment.
IEEE Trans. Multim., 2023
Unsupervised Visual Representation Learning via Multi-Dimensional Relationship Alignment.
IEEE Trans. Image Process., 2023
Reading Various Types of Pointer Meters Under Extreme Motion Blur.
IEEE Trans. Instrum. Meas., 2023
DRDet: Dual-Angle Rotated Line Representation for Oriented Object Detection.
IEEE Trans. Geosci. Remote. Sens., 2023
Task-Specific Loss for Robust Instance Segmentation With Noisy Class Labels.
IEEE Trans. Circuits Syst. Video Technol., 2023
GAB-Net: A Robust Detector for Remote Sensing Object Detection Under Dramatic Scale Variation and Complex Backgrounds.
IEEE Geosci. Remote. Sens. Lett., 2023
GFR: Generic feature representations for class incremental learning.
Neurocomputing, 2023
ISM-Net: Mining incremental semantics for class incremental learning.
Neurocomputing, 2023
GRSDet: Learning to Generate Local Reverse Samples for Few-shot Object Detection.
CoRR, 2023
Learning with Noisy Low-Cost MOS for Image Quality Assessment via Dual-Bias Calibration.
CoRR, 2023
Class-Prompting Transformer for Incremental Semantic Segmentation.
IEEE Access, 2023
Where to Forget: A New Attention Stability Metric for Continual Learning Evaluation.
Proceedings of the Digital Multimedia Communications, 2023
Semi-Supervised Few-Shot Segmentation with Noisy Support Images.
Proceedings of the IEEE International Conference on Image Processing, 2023
The Elliptic Energy Loss for Rotated Object Detection in Aerial Images.
Proceedings of the IEEE International Conference on Image Processing, 2023
Confusion Mixup Regularized Multimodal Fusion Network for Continual Egocentric Activity Recognition.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
MFAT: A Multi-Level Feature Aggregated Transformer for Person Re-Identification.
Proceedings of the IEEE International Conference on Acoustics, 2023
Instance-Wise Adaptive Tuning and Caching for Vision-Language Models.
Proceedings of the ECAI 2023 - 26th European Conference on Artificial Intelligence, September 30 - October 4, 2023, Kraków, Poland, 2023
Incrementer: Transformer for Class-Incremental Semantic Segmentation with Knowledge Distillation Focusing on Old Class.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
CafeBoost: Causal Feature Boost to Eliminate Task-Induced Bias for Class Incremental Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
2022
Bal-R$^2$CNN: High Quality Recurrent Object Detection With Balance Optimization.
IEEE Trans. Multim., 2022
Segmenting Beyond the Bounding Box for Instance Segmentation.
IEEE Trans. Circuits Syst. Video Technol., 2022
POS-Trends Dynamic-Aware Model for Video Caption.
IEEE Trans. Circuits Syst. Video Technol., 2022
ASFlow: Unsupervised Optical Flow Learning With Adaptive Pyramid Sampling.
IEEE Trans. Circuits Syst. Video Technol., 2022
Category boundary re-decision by component labels to improve generation of class activation map.
Neurocomputing, 2022
Real-time panoptic segmentation with relationship between adjacent pixels and boundary prediction.
Neurocomputing, 2022
Instance-level Context Attention Network for instance segmentation.
Neurocomputing, 2022
Dynamic Perceptual Quality Ranking based Autofocus Method for Projector.
Proceedings of the 24th IEEE International Workshop on Multimedia Signal Processing, 2022
RefCrowd: Grounding the Target in Crowd with Referring Expressions.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
2021
Non-Homogeneous Haze Removal via Artificial Scene Prior and Bidimensional Graph Reasoning.
IEEE Trans. Image Process., 2021
Hierarchical class grouping with orthogonal constraint for class activation map generation.
Neural Comput. Appl., 2021
Behaviour detection in crowded classroom scenes via enhancing features robust to scale and perspective variations.
IET Image Process., 2021
Single Image Dehazing Via Region Adaptive Two-Shot Network.
IEEE Multim., 2021
Few-Shot Segmentation via Complementary Prototype Learning and Cascaded Refinement.
Proceedings of the Pattern Recognition and Computer Vision - 4th Chinese Conference, 2021
Remember and Reuse: Cross-Task Blind Image Quality Assessment via Relevance-aware Incremental Learning.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021
2020
Hierarchical Context Features Embedding for Object Detection.
IEEE Trans. Multim., 2020
Subjective and Objective De-Raining Quality Assessment Towards Authentic Rain Image.
IEEE Trans. Circuits Syst. Video Technol., 2020
Weakly Supervised Semantic Segmentation by a Class-Level Multiple Group Cosegmentation and Foreground Fusion Strategy.
IEEE Trans. Circuits Syst. Video Technol., 2020
HeadNet: An End-to-End Adaptive Relational Network for Head Detection.
IEEE Trans. Circuits Syst. Video Technol., 2020
Multi-Scale Shape Adaptive Network for Raindrop Detection and Removal from a Single Image.
Sensors, 2020
An efficient and compact 3D local descriptor based on the weighted height image.
Inf. Sci., 2020
Discriminative deep metric learning for asymmetric discrete hashing.
Neurocomputing, 2020
A Symmetric Fully Convolutional Residual Network With DCRF for Accurate Tooth Segmentation.
IEEE Access, 2020
Mining Larger Class Activation Map with Common Attribute Labels.
Proceedings of the 2020 IEEE International Conference on Visual Communications and Image Processing, 2020
Mono is Enough: Instance Segmentation from Single Annotated Sample.
Proceedings of the 2020 IEEE International Conference on Visual Communications and Image Processing, 2020
A New Bounding Box based Pseudo Annotation Generation Method for Semantic Segmentation.
Proceedings of the 2020 IEEE International Conference on Visual Communications and Image Processing, 2020
A Unified Single Image De-raining Model via Region Adaptive Coupled Network.
Proceedings of the 2020 IEEE International Conference on Visual Communications and Image Processing, 2020
Haze-robust image understanding via context-aware deep feature refinement.
Proceedings of the 22nd IEEE International Workshop on Multimedia Signal Processing, 2020
A New Local Transformation Module for Few-Shot Segmentation.
Proceedings of the MultiMedia Modeling - 26th International Conference, 2020
Language-Aware Fine-Grained Object Representation for Referring Expression Comprehension.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
Learn to Pay Attention Via Switchable Attention for Image Recognition.
Proceedings of the 3rd IEEE Conference on Multimedia Information Processing and Retrieval, 2020
Region Adaptive Two-Shot Network For Single Image Dehazing.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2020
Single Image Dehazing Via Artificial Multiple Shots And Multidimensional Context.
Proceedings of the IEEE International Conference on Image Processing, 2020
Learning with Noisy Class Labels for Instance Segmentation.
Proceedings of the Computer Vision - ECCV 2020, 2020
VisDrone-DET2020: The Vision Meets Drone Object Detection in Image Challenge Results.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020
Blind Tone-mapped Image Quality Assessment and Enhancement via Disentangled Representation Learning.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020
2019
A<sup>2</sup>RMNet: Adaptively Aspect Ratio Multi-Scale Network for Object Detection in Remote Sensing Images.
Remote. Sens., 2019
基于镜头分割与空域注意力模型的视频广告分类方法 (Video Advertisement Classification Method Based on Shot Segmentation and Spatial Attention Model).
计算机科学, 2019
Multi Information Fusion Network for Saliency Quality Assessment.
IEICE Trans. Inf. Syst., 2019
Subjective and Objective De-raining Quality Assessment Towards Authentic Rain Image.
CoRR, 2019
Class Activation Map Generation by Representative Class Selection and Multi-Layer Feature Fusion.
CoRR, 2019
Hierarchy Neighborhood Discriminative Hashing for An Unified View of Single-Label and Multi-Label Image retrieval.
CoRR, 2019
A New Deep Segmentation Quality Assessment Network for Refining Bounding Box Based Segmentation.
IEEE Access, 2019
A New Few-shot Segmentation Network Based on Class Representation.
Proceedings of the 2019 IEEE Visual Communications and Image Processing, 2019
Incorporating Non-local and Task-specific Features for Instance Segmentation.
Proceedings of the 21st IEEE International Workshop on Multimedia Signal Processing, 2019
Bounding Box based Annotation Generation for Semantic Segmentation by Boundary Detection.
Proceedings of the 2019 International Symposium on Intelligent Signal Processing and Communication Systems, 2019
Blind Image Sharpness Assessment And Enhancement via Deep Auxiliary Learning.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019
Beyond Synthetic Data: A Blind Deraining Quality Assessment Metric Towards Authentic Rain Image.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019
Instance Segmentation by Learning Deep Feature in Embedding Space.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019
Class Activation Map Generation by Multiple Level Class Grouping and Orthogonal Constraint.
Proceedings of the 2019 Digital Image Computing: Techniques and Applications, 2019
2018
Hierarchical Parsing Net: Semantic Scene Parsing From Global Scene to Objects.
IEEE Trans. Multim., 2018
Seeds-Based Part Segmentation by Seeds Propagation and Region Convexity Decomposition.
IEEE Trans. Multim., 2018
Generic Proposal Evaluator: A Lazy Learning Strategy Toward Blind Proposal Quality Assessment.
IEEE Trans. Intell. Transp. Syst., 2018
A Perceptually Weighted Rank Correlation Indicator for Objective Image Quality Assessment.
IEEE Trans. Image Process., 2018
LETRIST: Locally Encoded Transform Feature Histogram for Rotation-Invariant Texture Classification.
IEEE Trans. Circuits Syst. Video Technol., 2018
Globally Measuring the Similarity of Superpixels by Binary Edge Maps for Superpixel Clustering.
IEEE Trans. Circuits Syst. Video Technol., 2018
An Unsupervised Method to Extract Video Object via Complexity Awareness and Object Local Parts.
IEEE Trans. Circuits Syst. Video Technol., 2018
Toward a Blind Quality Metric for Temporally Distorted Streaming Video.
IEEE Trans. Broadcast., 2018
Global and local semantics-preserving based deep hashing for cross-modal retrieval.
Neurocomputing, 2018
A Propagation Method for Multi Object Tracklet Repair.
IEICE Trans. Inf. Syst., 2018
Salient Object Detection and Segmentation via Ultra-Contrast.
IEEE Access, 2018
Weakly Supervised Semantic Segmentation by Multiple Group Cosegmentation.
Proceedings of the IEEE Visual Communications and Image Processing, 2018
Boosting Scene Parsing Performance via Reliable Scale Prediction.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018
Key-Word-Aware Network for Referring Expression Image Segmentation.
Proceedings of the Computer Vision - ECCV 2018, 2018
A classification and clustering method for tracking multiple objects.
Proceedings of the IEEE 8th Annual Computing and Communication Workshop and Conference, 2018
Adaptive Multi-Scale Information Flow for Object Detection.
Proceedings of the British Machine Vision Conference 2018, 2018
2017
Blind Image Quality Assessment Based on Rank-Order Regularized Regression.
IEEE Trans. Multim., 2017
Learning Efficient Binary Codes From High-Level Feature Representations for Multilabel Image Retrieval.
IEEE Trans. Multim., 2017
Video Object Segmentation via Global Consistency Aware Query Strategy.
IEEE Trans. Multim., 2017
Weakly Supervised Part Proposal Segmentation From Multiple Images.
IEEE Trans. Image Process., 2017
Semi-supervised manifold-embedded hashing with joint feature representation and classifier learning.
Pattern Recognit., 2017
Manifold-ranking embedded order preserving hashing for image semantic retrieval.
J. Vis. Commun. Image Represent., 2017
L2SSP: Robust keypoint description using local second-order statistics with soft-pooling.
Neurocomputing, 2017
A New Multiple Group Cosegmentation Model by Proposal Selection Strategy.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2017
Segmentation quality evaluation based on multi-scale convolutional neural networks.
Proceedings of the 2017 IEEE Visual Communications and Image Processing, 2017
A CNN-based segmentation model for segmenting foreground by a probability map.
Proceedings of the 2017 International Symposium on Intelligent Signal Processing and Communication Systems, 2017
Blind proposal quality assessment via deep objectness representation and local linear regression.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017
2016
Blind Image Quality Assessment Based on Multichannel Feature Fusion and Label Transfer.
IEEE Trans. Circuits Syst. Video Technol., 2016
Beyond pixels: A comprehensive survey from bottom-up to semantic image segmentation and cosegmentation.
J. Vis. Commun. Image Represent., 2016
Person re-identification based on multi-region-set ensembles.
J. Vis. Commun. Image Represent., 2016
Cosegmentation of multiple image groups.
Comput. Vis. Image Underst., 2016
Q-DNN: A quality-aware deep neural network for blind assessment of enhanced images.
Proceedings of the 2016 Visual Communications and Image Processing, 2016
Part propagation for local part segmentation.
Proceedings of the 2016 Visual Communications and Image Processing, 2016
QualityNet: Segmentation quality evaluation with deep convolutional networks.
Proceedings of the 2016 Visual Communications and Image Processing, 2016
2015
Fast HEVC Inter CU Decision Based on Latent SAD Estimation.
IEEE Trans. Multim., 2015
Constrained Directed Graph Clustering and Segmentation Propagation for Multiple Foregrounds Cosegmentation.
IEEE Trans. Circuits Syst. Video Technol., 2015
Exploring space-frequency co-occurrences via local quantized patterns for texture representation.
Pattern Recognit., 2015
No reference image quality assessment metric via multi-domain structural information and piecewise regression.
J. Vis. Commun. Image Represent., 2015
2014
A Fast HEVC Inter CU Selection Method Based on Pyramid Motion Divergence.
IEEE Trans. Multim., 2014
MRF-Based Fast HEVC Inter CU Decision With the Variance of Absolute Differences.
IEEE Trans. Multim., 2014
Repairing Bad Co-Segmentation Using Its Quality Evaluation and Segment Propagation.
IEEE Trans. Image Process., 2014
Unsupervised Multiclass Region Cosegmentation via Ensemble Clustering and Energy Minimization.
IEEE Trans. Circuits Syst. Video Technol., 2014
Noise-Robust Texture Description Using Local Contrast Patterns via Global Measures.
IEEE Signal Process. Lett., 2014
Bird breed classification and annotation using saliency based graphical model.
J. Vis. Commun. Image Represent., 2014
Cosegmentation from similar backgrounds.
Proceedings of the IEEE International Symposium on Circuits and Systemss, 2014
Favorite object extraction using web images.
Proceedings of the IEEE International Symposium on Circuits and Systemss, 2014
Using mid-high level cues to detect salient object.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014
Fast and efficient inter CU decision for high efficiency video coding.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014
Automatic image co-segmentation using geometric mean saliency.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014
On Multiple Image Group Cosegmentation.
Proceedings of the Computer Vision - ACCV 2014, 2014
2013
From Logo to Object Segmentation.
IEEE Trans. Multim., 2013
Co-Salient Object Detection From Multiple Images.
IEEE Trans. Multim., 2013
Feature Adaptive Co-Segmentation by Complexity Awareness.
IEEE Trans. Image Process., 2013
Image Cosegmentation by Incorporating Color Reward Strategy and Active Contour Model.
IEEE Trans. Cybern., 2013
Robust texture representation by using binary code ensemble.
Proceedings of the 2013 Visual Communications and Image Processing, 2013
Object co-segmentation based on directed graph clustering.
Proceedings of the 2013 Visual Communications and Image Processing, 2013
Segmenting specific object based on logo detection.
Proceedings of the 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013), 2013
Complexity awareness based feature adaptive co-segmentation.
Proceedings of the IEEE International Conference on Image Processing, 2013
2012
Object Co-Segmentation Based on Shortest Path Algorithm and Saliency Model.
IEEE Trans. Multim., 2012
A new co-saliency model via pairwise constraint graph matching.
Proceedings of the International Symposium on Intelligent Signal Processing and Communications Systems, 2012
Image co-segmentation via active contours.
Proceedings of the 2012 IEEE International Symposium on Circuits and Systems, 2012
2011
Change detection in unregistered optical satellite images using combinatorial clustering method.
Proceedings of the International Symposium on Intelligent Signal Processing and Communications Systems, 2011
2008
A Novel Blind Image Watermarking Scheme Based on Support Vector Machine in DCT Domain.
Proceedings of the 2008 International Conference on Computational Intelligence and Security, 2008