2025
PraNet-V2: Dual-Supervised Reverse Attention for Medical Image Segmentation.
CoRR, April, 2025

CM-YOLO: Context Modulated Representation Learning for Ship Detection.
IEEE Trans. Geosci. Remote. Sens., 2025

SeFAR: Semi-supervised Fine-grained Action Recognition with Temporal Perturbation and Learning Stabilization.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Effective Rotate: Learning Rotation-Robust Prototype for Aerial Object Detection.
IEEE Trans. Geosci. Remote. Sens., 2024

GaussianVTON: 3D Human Virtual Try-ON via Multi-Stage Gaussian Splatting Editing with Image Prompting.
CoRR, 2024

FineCLIPER: Multi-modal Fine-grained CLIP for Dynamic Facial Expression Recognition with AdaptERs.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

P2P: Transforming from Point Supervision to Explicit Visual Prompt for Object Detection and Segmentation.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Just a Hint: Point-Supervised Camouflaged Object Detection.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
Tracking without Label: Unsupervised Multiple Object Tracking via Contrastive Similarity Learning.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2021
Revisiting Skeleton-based Action Recognition.
CoRR, 2021

DecAug: Augmenting HOI Detection via Decomposition.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

DIRV: Dense Interaction Region Voting for End-to-End Human-Object Interaction Detection.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
FineGym: A Hierarchical Video Dataset for Fine-Grained Action Understanding.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Intra- and Inter-Action Understanding via Temporal Action Parsing.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2018
Find and Focus: Retrieve and Localize Video Events with Natural Language Queries.
Proceedings of the Computer Vision - ECCV 2018, 2018

2016
Digital Longmen Project: A Free Walking VR System with Image-Based Restoration.
Proceedings of the Computer Vision - ACCV 2016 Workshops, 2016