2025

ReferSAM: Unleashing Segment Anything Model for Referring Image Segmentation.

[DOI]

Sun'ao Liu

Hongtao Xie

Jiannan Ge

Yongdong Zhang

IEEE Trans. Circuits Syst. Video Technol., May, 2025

SpaceVLLM: Endowing Multimodal Large Language Model with Spatio-Temporal Video Grounding Capability.

[DOI]

CoRR, March, 2025

Denoised and Dynamic Alignment Enhancement for Zero-Shot Learning.

[DOI]

IEEE Trans. Image Process., 2025

2024

Balanced Classification: A Unified Framework for Long-Tailed Object Detection.

[DOI]

IEEE Trans. Multim., 2024

Towards Discriminative Feature Generation for Generalized Zero-Shot Learning.

[DOI]

IEEE Trans. Multim., 2024

AlignZeg: Mitigating Objective Misalignment for Zero-Shot Semantic Segmentation.

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Towards Balanced Alignment: Modal-Enhanced Semantic Modeling for Video Moment Retrieval.

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Neighborhood-Adaptive Multi-Cluster Ranking for Deep Metric Learning.

[DOI]

IEEE Trans. Circuits Syst. Video Technol., April, 2023

Frequency-based Zero-Shot Learning with Phase Augmentation.

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Progressive Spatio-Temporal Prototype Matching for Text-Video Retrieval.

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022

Deep Fourier Ranking Quantization for Semi-Supervised Image Retrieval.

[DOI]

IEEE Trans. Image Process., 2022

Dual Part Discovery Network for Zero-Shot Learning.

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Dual-Stream Knowledge-Preserving Hashing for Unsupervised Video Retrieval.

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

2021

Semantic-guided Reinforced Region Embedding for Generalized Zero-Shot Learning.

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021