2025

InterActHuman: Multi-Concept Human Animation with Layout-Aligned Audio Conditions.

[DOI]

Zhenzhi Wang

Jiaqi Yang

CoRR, June, 2025

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models.

[DOI]

CoRR, February, 2025

CyberHost: A One-stage Diffusion Framework for Audio-driven Talking Body Generation.

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Loopy: Taming Audio-Driven Portrait Avatar with Long-Term Motion Dependency.

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

FADA: Fast Diffusion Avatar Synthesis with Mixed-Supervised Multi-CFG Distillation.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

MobilePortrait: Real-Time One-Shot Neural Head Avatars on Mobile Devices.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024

CyberHost: Taming Audio-driven Avatar Diffusion Model with Region Codebook Attention.

[DOI]

CoRR, 2024

Superior and Pragmatic Talking Face Generation with Teacher-Student Framework.

[DOI]

CoRR, 2024

2023

HandMIM: Pose-Aware Self-Supervised Learning for 3D Hand Mesh Estimation.

[DOI]

CoRR, 2023

2022

Person Re-Identification by Context-Aware Part Attention and Multi-Head Collaborative Learning.

[DOI]

IEEE Trans. Inf. Forensics Secur., 2022

Deep Learning for Person Re-Identification: A Survey and Outlook.

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

2021

Video person re-identification with global statistic pooling and self-attention distillation.

[DOI]

Gaojie Lin

Sanyuan Zhao

Jianbing Shen

Neurocomputing, 2021