Training-Free Efficient Video Generation via Dynamic Token Carving.
CoRR, May, 2025
TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models.
CoRR, March, 2025
Make-Your-Video: Customized Video Generation Using Textual and Structural Guidance.
,
,
,
,
,
,
,
,
,
,
,
IEEE Trans. Vis. Comput. Graph., February, 2025
MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation.
CoRR, February, 2025
MagicStick: Controllable Video Editing via Control Handle Transformations.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025
ToonCrafter: Generative Cartoon Interpolation.
ACM Trans. Graph., December, 2024
StyleCrafter: Taming Artistic Video Diffusion with Reference-Augmented Adapter Learning.
ACM Trans. Graph., December, 2024
ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis.
CoRR, 2024
DynamiCrafter: Animating Open-Domain Images with Video Diffusion Priors.
Proceedings of the Computer Vision - ECCV 2024, 2024
Storytelling Video Generation with Retrieval Augmentation and Character Consistency.
,
,
,
,
,
,
,
,
,
,
Proceedings of the Computer Vision - ECCV 2024 Workshops, 2024
Scale-Arbitrary Invertible Image Downscaling.
IEEE Trans. Image Process., 2023
StyleCrafter: Enhancing Stylized Text-to-Video Generation with Style Adapter.
CoRR, 2023
VideoCrafter1: Open Diffusion Models for High-Quality Video Generation.
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2023
DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors.
CoRR, 2023
Animate-A-Story: Storytelling with Retrieval-Augmented Video Generation.
,
,
,
,
,
,
,
,
,
,
CoRR, 2023
Video Colorization with Pre-trained Text-to-Image Diffusion Models.
CoRR, 2023
Improved Diffusion-based Image Colorization via Piggybacked Models.
CoRR, 2023
Real-World Image Variation by Aligning Diffusion Inversion Chain.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Ref-NPR: Reference-Based Non-Photorealistic Radiance Fields for Controllable Scene Stylization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
CodeTalker: Speech-Driven 3D Facial Animation with Discrete Motion Prior.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Ref-NPR: Reference-Based Non-Photorealistic Radiance Fields.
CoRR, 2022
Scale-arbitrary Invertible Image Downscaling.
CoRR, 2022
Flow-aware synthesis: A generic motion model for video frame interpolation.
Comput. Vis. Media, 2021
Unstructured Knowledge Access in Task-oriented Dialog Modeling using Language Inference, Knowledge Retrieval and Knowledge-Integrative Response Generation.
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2021