Qffusion: Controllable Portrait Video Editing via Quadrant-Grid Attention Learning.
CoRR, January, 2025
Identity-Preserving Video Dubbing Using Motion Warping.
CoRR, January, 2025
GPAvatar: Generalizable and Precise Head Avatar from Image(s).
CoRR, 2024
GPAvatar: Generalizable and Precise Head Avatar from Image(s).
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Accurate 3D Face Reconstruction with Facial Component Tokens.
,
,
,
,
,
,
,
,
,
,
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
MODA: Mapping-Once Audio-driven Portrait Animation with Dual Attentions.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Accelerating the Training of Video Super-resolution Models.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
Tagging before Alignment: Integrating Multi-Modal Tags for Video-Text Retrieval.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
Dual Semantic Fusion Network for Video Object Detection.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
Robust Visual Tracking via Statistical Positive Sample Generation and Gradient Aware Learning.
Proceedings of the MMAsia '19: ACM Multimedia Asia, Beijing, China, December 16-18, 2019, 2019