2025
InstantCharacter: Personalize Any Characters with a Scalable Diffusion Transformer Framework.
CoRR, April, 2025

FireEdit: Fine-grained Instruction-based Image Editing via Region-aware Vision Language Model.
CoRR, March, 2025

2024
GaussianCube: Structuring Gaussian Splatting using Optimal Transport for 3D Generative Modeling.
CoRR, 2024

GaussianCube: A Structured and Explicit Radiance Representation for 3D Generative Modeling.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models.
Proceedings of the Computer Vision - ECCV 2024, 2024

Plan, Posture and Go: Towards Open-Vocabulary Text-to-Motion Generation.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
Plan, Posture and Go: Towards Open-World Text-to-Motion Generation.
CoRR, 2023

Efficient Text-Guided 3D-Aware Portrait Generation with Score Distillation Sampling on Distribution.
CoRR, 2023

Efficient Meshy Neural Fields for Animatable Human Avatars.
CoRR, 2023