PraNet-V2: Dual-Supervised Reverse Attention for Medical Image Segmentation.
CoRR, April, 2025
CM-YOLO: Context Modulated Representation Learning for Ship Detection.
IEEE Trans. Geosci. Remote. Sens., 2025
SeFAR: Semi-supervised Fine-grained Action Recognition with Temporal Perturbation and Learning Stabilization.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025
Effective Rotate: Learning Rotation-Robust Prototype for Aerial Object Detection.
IEEE Trans. Geosci. Remote. Sens., 2024
GaussianVTON: 3D Human Virtual Try-ON via Multi-Stage Gaussian Splatting Editing with Image Prompting.
CoRR, 2024
FineCLIPER: Multi-modal Fine-grained CLIP for Dynamic Facial Expression Recognition with AdaptERs.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
P2P: Transforming from Point Supervision to Explicit Visual Prompt for Object Detection and Segmentation.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024
Just a Hint: Point-Supervised Camouflaged Object Detection.
Proceedings of the Computer Vision - ECCV 2024, 2024
Tracking without Label: Unsupervised Multiple Object Tracking via Contrastive Similarity Learning.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Revisiting Skeleton-based Action Recognition.
CoRR, 2021
DecAug: Augmenting HOI Detection via Decomposition.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
DIRV: Dense Interaction Region Voting for End-to-End Human-Object Interaction Detection.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
FineGym: A Hierarchical Video Dataset for Fine-Grained Action Understanding.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
Intra- and Inter-Action Understanding via Temporal Action Parsing.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
Find and Focus: Retrieve and Localize Video Events with Natural Language Queries.
Proceedings of the Computer Vision - ECCV 2018, 2018
Digital Longmen Project: A Free Walking VR System with Image-Based Restoration.
Proceedings of the Computer Vision - ACCV 2016 Workshops, 2016