ReferSAM: Unleashing Segment Anything Model for Referring Image Segmentation.
IEEE Trans. Circuits Syst. Video Technol., May, 2025
SpaceVLLM: Endowing Multimodal Large Language Model with Spatio-Temporal Video Grounding Capability.
CoRR, March, 2025
Denoised and Dynamic Alignment Enhancement for Zero-Shot Learning.
IEEE Trans. Image Process., 2025
Balanced Classification: A Unified Framework for Long-Tailed Object Detection.
IEEE Trans. Multim., 2024
Towards Discriminative Feature Generation for Generalized Zero-Shot Learning.
IEEE Trans. Multim., 2024
AlignZeg: Mitigating Objective Misalignment for Zero-Shot Semantic Segmentation.
Proceedings of the Computer Vision - ECCV 2024, 2024
Towards Balanced Alignment: Modal-Enhanced Semantic Modeling for Video Moment Retrieval.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
Neighborhood-Adaptive Multi-Cluster Ranking for Deep Metric Learning.
IEEE Trans. Circuits Syst. Video Technol., April, 2023
Frequency-based Zero-Shot Learning with Phase Augmentation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Progressive Spatio-Temporal Prototype Matching for Text-Video Retrieval.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Deep Fourier Ranking Quantization for Semi-Supervised Image Retrieval.
IEEE Trans. Image Process., 2022
Dual Part Discovery Network for Zero-Shot Learning.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
Dual-Stream Knowledge-Preserving Hashing for Unsupervised Video Retrieval.
Proceedings of the Computer Vision - ECCV 2022, 2022
Semantic-guided Reinforced Region Embedding for Generalized Zero-Shot Learning.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021