2025

Attention-disentangled Uniform Orthogonal Feature Space Optimization for Few-shot Object Detection.

[DOI]

Taijin Zhao

Heqian Qiu

CoRR, June, 2025

Cognition Transferring and Decoupling for Text-Supervised Egocentric Semantic Segmentation.

[DOI]

IEEE Trans. Circuits Syst. Video Technol., May, 2025

Unsupervised Ego- and Exo-centric Dense Procedural Activity Captioning via Gaze Consensus Adaptation.

[DOI]

CoRR, April, 2025

Challenges and Trends in Egocentric Vision: A Survey.

[DOI]

CoRR, March, 2025

Class Incremental Learning With Less Forgetting Direction and Equilibrium Point.

[DOI]

IEEE Trans. Circuits Syst. Video Technol., February, 2025

MCCE-REC: MLLM-Driven Cross-Modal Contrastive Entropy Model for Zero-Shot Referring Expression Comprehension.

[DOI]

IEEE Trans. Circuits Syst. Video Technol., January, 2025

EgoMe: Follow Me via Egocentric View in Real World.

[DOI]

CoRR, January, 2025

Distribution-Level Memory Recall for Continual Learning: Preserving Knowledge and Avoiding Confusion.

[DOI]

IEEE Trans. Multim., 2025

Geodesic-Aligned Gradient Projection for Continual Task Learning.

[DOI]

IEEE Trans. Image Process., 2025

Adaptively forget with crossmodal and textual distillation for class-incremental video captioning.

[DOI]

Neurocomputing, 2025

2024

Continual Cross-Domain Image Compression via Entropy Prior Guided Knowledge Distillation and Scalable Decoding.

[DOI]

IEEE Trans. Circuits Syst. Video Technol., September, 2024

Robust Unpaired Image Dehazing via Adversarial Deformation Constraint.

[DOI]

IEEE Trans. Circuits Syst. Video Technol., September, 2024

Learning Offset Probability Distribution for Accurate Object Detection.

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., May, 2024

TridentCap: Image-Fact-Style Trident Semantic Framework for Stylized Image Captioning.

[DOI]

IEEE Trans. Circuits Syst. Video Technol., May, 2024

CrowdCaption++: Collective-Guided Crowd Scenes Captioning.

[DOI]

IEEE Trans. Multim., 2024

Visual and Textual Prior Guided Mask Assemble for Few-Shot Segmentation and Beyond.

[DOI]

IEEE Trans. Multim., 2024

Oriented-DINO: Angle Decoupling Prediction and Consistency Optimizing for Oriented Detection Transformer.

[DOI]

IEEE Trans. Geosci. Remote. Sens., 2024

VLM-guided Explicit-Implicit Complementary novel class semantic learning for few-shot object detection.

[DOI]

Expert Syst. Appl., 2024

ARIC: An Activity Recognition Dataset in Classroom Surveillance Images.

[DOI]

CoRR, 2024

Region Prompt Tuning: Fine-grained Scene Text Detection Utilizing Region Text Prompt.

[DOI]

CoRR, 2024

Slightly Shift New Classes to Remember Old Classes for Video Class-Incremental Learning.

[DOI]

CoRR, 2024

MCF-VC: Mitigate Catastrophic Forgetting in Class-Incremental Learning for Multimodal Video Captioning.

[DOI]

CoRR, 2024

Proposal-level Correction Guided by CLIP for Few-shot Object Detection.

[DOI]

Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2024

IoU-CLIP: IoU-Aware Language-Image Model Tuning for Open Vocabulary Object Detection.

[DOI]

Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2024

DP-RSCAP: Dual Prompt-Based Scene and Entity Network for Remote Sensing Image Captioning.

[DOI]

Proceedings of the IGARSS 2024, 2024

Attribute-Prompting Multi-Modal Object Reasoning Transformer for Remote Sensing Visual Grounding.

[DOI]

Proceedings of the IGARSS 2024, 2024

Video Class-Incremental Learning With Clip Based Transformer.

[DOI]

Proceedings of the IEEE International Conference on Image Processing, 2024

A Text Detector Based on the Specific Text Prompt.

[DOI]

Proceedings of the IEEE International Conference on Image Processing, 2024

Class Incremental Learning with Multi-Teacher Distillation.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Prompt-Driven Referring Image Segmentation with Instance Contrasting.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

HumanFormer: Human-centric Prompting Multi-modal Perception Transformer for Referring Crowd Detection.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Must Unsupervised Continual Learning Relies on Previous Information?

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

Disturbed Augmentation Invariance for Unsupervised Visual Representation Learning.

[DOI]

IEEE Trans. Circuits Syst. Video Technol., November, 2023

Cross-Modal Recurrent Semantic Comprehension for Referring Image Segmentation.

[DOI]

IEEE Trans. Circuits Syst. Video Technol., July, 2023

CrossDet++: Growing Crossline Representation for Object Detection.

[DOI]

IEEE Trans. Circuits Syst. Video Technol., March, 2023

Bias-Correction Feature Learner for Semi-Supervised Instance Segmentation.

[DOI]

IEEE Trans. Multim., 2023

What Happens in Crowd Scenes: A New Dataset About Crowd Scenes for Image Captioning.

[DOI]

IEEE Trans. Multim., 2023

Unsupervised Visual Representation Learning via Multi-Dimensional Relationship Alignment.

[DOI]

IEEE Trans. Image Process., 2023

DRDet: Dual-Angle Rotated Line Representation for Oriented Object Detection.

[DOI]

IEEE Trans. Geosci. Remote. Sens., 2023

GRSDet: Learning to Generate Local Reverse Samples for Few-shot Object Detection.

[DOI]

CoRR, 2023

CFS: Character Feature Summarization Model for Real-time End-to-end Text Spotting.

[DOI]

Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2023

Novel-Registrable Weights and Region-Level Contrastive Learning for Incremental Few-shot Object Detection.

[DOI]

Proceedings of the Neural Information Processing - 30th International Conference, 2023

PTCP: Alleviate Layer Collapse in Pruning at Initialization via Parameter Threshold Compensation and Preservation.

[DOI]

Proceedings of the Neural Information Processing - 30th International Conference, 2023

Optimizing Mode Connectivity for Class Incremental Learning.

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Confusion Mixup Regularized Multimodal Fusion Network for Continual Egocentric Activity Recognition.

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Contrastive Continuity on Augmentation Stability Rehearsal for Continual Self-Supervised Learning.

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Incrementer: Transformer for Class-Incremental Semantic Segmentation with Knowledge Distillation Focusing on Old Class.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

CafeBoost: Causal Feature Boost to Eliminate Task-Induced Bias for Class Incremental Learning.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Bal-R$^2$CNN: High Quality Recurrent Object Detection With Balance Optimization.

[DOI]

IEEE Trans. Multim., 2022

POS-Trends Dynamic-Aware Model for Video Caption.

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2022

Real-time panoptic segmentation with relationship between adjacent pixels and boundary prediction.

[DOI]

Neurocomputing, 2022

Instance-level Context Attention Network for instance segmentation.

[DOI]

Neurocomputing, 2022

Mining Regional Relation from Pixel-wise Annotation for Scene Parsing.

[DOI]

Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2022

DE-CrossDet: Divisible and Extensible Crossline Representation for Object Detection.

[DOI]

Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2022

Cross-Domain Object Detection with Missing Classes in Target Domain.

[DOI]

Proceedings of the 24th IEEE International Workshop on Multimedia Signal Processing, 2022

RefCrowd: Grounding the Target in Crowd with Referring Expressions.

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Pedestrian Attribute Recognition Based on Association Rules.

[DOI]

Diwei Xie

Heqian Qiu

Linfeng Xu

Proceedings of the 8th IEEE International Conference on Cloud Computing and Intelligent Systems, 2022

2021

CrossDet: Crossline Representation for Object Detection.

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020

Hierarchical Context Features Embedding for Object Detection.

[DOI]

IEEE Trans. Multim., 2020

A multi-scale language embedding network for proposal-free referring expression comprehension.

[DOI]

Proceedings of the MMAsia 2020: ACM Multimedia Asia, 2020

Multi-stage Tag Guidance Network in Video Caption.

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Language-Aware Fine-Grained Object Representation for Referring Expression Comprehension.

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

VisDrone-DET2020: The Vision Meets Drone Object Detection in Image Challenge Results.

[DOI]

Apostolos Axenopoulos

Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020

Offset Bin Classification Network for Accurate Object Detection.

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

A<sup>2</sup>RMNet: Adaptively Aspect Ratio Multi-Scale Network for Object Detection in Remote Sensing Images.

[DOI]

Remote. Sens., 2019

VisDrone-DET2019: The Vision Meets Drone Object Detection in Image Challenge Results.

[DOI]

Sai Saketh Chennamsetty

Shuhao Chen

Shuo Wei

Srinivas S. S. Kruthiventi

Varghese Alex Kollerathu

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

2018

VisDrone-DET2018: The Vision Meets Drone Object Detection in Image Challenge Results.

[DOI]

Konstantinos Avgerinakis

Naveen Kumar Vedurupaka

Nehal Mamgain

Nitin Bansal

Oliver Acatay

Panagiotis Giannakeris

Vineeth N. Balasubramanian

Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018