2025

Model Reveals What to Cache: Profiling-Based Feature Reuse for Video Diffusion Models.

[DOI]

Xuran Ma

Yexin Liu

CoRR, April, 2025

Temporal Regularization Makes Your Video Generator Stronger.

[DOI]

CoRR, March, 2025

Niagara: Normal-Integrated Geometric Affine Field for Scene Reconstruction from a Single View.

[DOI]

CoRR, March, 2025

VideoMerge: Towards Training-free Long Video Generation.

[DOI]

Siyang Zhang

Harry Yang

Ser-Nam Lim

CoRR, March, 2025

LightGen: Efficient Image Generation through Knowledge Distillation and Direct Preference Optimization.

[DOI]

CoRR, March, 2025

VFX Creator: Animated Visual Effect Generation with Controllable Diffusion Transformer.

[DOI]

CoRR, February, 2025

Encrypted Large Model Inference: The Equivariant Encryption Paradigm.

[DOI]

CoRR, February, 2025

2024

Next Patch Prediction for Autoregressive Visual Generation.

[DOI]

CoRR, 2024

VideoGen-of-Thought: A Collaborative Framework for Multi-Shot Video Generation.

[DOI]

CoRR, 2024

OmniCreator: Self-Supervised Unified Generation with Universal Editing.

[DOI]

CoRR, 2024

DreamDance: Animating Human Images by Enriching 3D Geometry Cues from 2D Poses.

[DOI]

CoRR, 2024

Meta-Learning for Speeding Up Large Model Inference in Decentralized Environments.

[DOI]

CoRR, 2024

Model Agnostic Hybrid Sharding For Heterogeneous Distributed Inference.

[DOI]

CoRR, 2024

Complete Security and Privacy for AI Inference in Decentralized Systems.

[DOI]

CoRR, 2024

AnyV2V: A Plug-and-Play Framework For Any Video-to-Video Editing Tasks.

[DOI]

CoRR, 2024

ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation.

[DOI]

CoRR, 2024

2023

Latent-Shift: Latent Diffusion with Temporal Shift for Efficient Text-to-Video Generation.

[DOI]

CoRR, 2023

Make-A-Video: Text-to-Video Generation without Text-Video Data.

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022

RegMixup: Mixup as a Regularizer Can Surprisingly Improve Accuracy and Out Distribution Robustness.

[DOI]

CoRR, 2022

Using Mixup as a Regularizer Can Surprisingly Improve Accuracy & Out-of-Distribution Robustness.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration.

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive Transformer.

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

2021

Robustness and Generalization via Generative Adversarial Training.

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2019

Fine-grained Synthesis of Unrestricted Adversarial Examples.

[DOI]

CoRR, 2019

2014

Low-rank SIFT: An affine invariant feature for place recognition.

[DOI]

Proceedings of the 2014 IEEE International Conference on Image Processing, 2014