2025
Training-Free Efficient Video Generation via Dynamic Token Carving.
CoRR, May, 2025

TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models.
CoRR, March, 2025

Make-Your-Video: Customized Video Generation Using Textual and Structural Guidance.
IEEE Trans. Vis. Comput. Graph., February, 2025

MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation.
CoRR, February, 2025

MagicStick: Controllable Video Editing via Control Handle Transformations.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025

2024
ToonCrafter: Generative Cartoon Interpolation.
ACM Trans. Graph., December, 2024

StyleCrafter: Taming Artistic Video Diffusion with Reference-Augmented Adapter Learning.
ACM Trans. Graph., December, 2024

ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis.
CoRR, 2024

DynamiCrafter: Animating Open-Domain Images with Video Diffusion Priors.
Proceedings of the Computer Vision - ECCV 2024, 2024

Storytelling Video Generation with Retrieval Augmentation and Character Consistency.
Proceedings of the Computer Vision - ECCV 2024 Workshops, 2024

2023
Scale-Arbitrary Invertible Image Downscaling.
IEEE Trans. Image Process., 2023

StyleCrafter: Enhancing Stylized Text-to-Video Generation with Style Adapter.
CoRR, 2023

VideoCrafter1: Open Diffusion Models for High-Quality Video Generation.
CoRR, 2023

DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors.
CoRR, 2023

Animate-A-Story: Storytelling with Retrieval-Augmented Video Generation.
CoRR, 2023

Video Colorization with Pre-trained Text-to-Image Diffusion Models.
CoRR, 2023

Improved Diffusion-based Image Colorization via Piggybacked Models.
CoRR, 2023

Real-World Image Variation by Aligning Diffusion Inversion Chain.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Ref-NPR: Reference-Based Non-Photorealistic Radiance Fields for Controllable Scene Stylization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

CodeTalker: Speech-Driven 3D Facial Animation with Discrete Motion Prior.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Ref-NPR: Reference-Based Non-Photorealistic Radiance Fields.
CoRR, 2022

Scale-arbitrary Invertible Image Downscaling.
CoRR, 2022

2021
Flow-aware synthesis: A generic motion model for video frame interpolation.
Comput. Vis. Media, 2021

Unstructured Knowledge Access in Task-oriented Dialog Modeling using Language Inference, Knowledge Retrieval and Knowledge-Integrative Response Generation.
CoRR, 2021