David Junhao Zhang

According to our database¹, David Junhao Zhang authored at least 20 papers between 2021 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

ReCapture: Generative Video Camera Controls for User-Provided Videos using Masked Video Fine-Tuning.

[BibT_eX]

[DOI]

CoRR, 2024

MixEval-X: Any-to-Any Evaluations from Real-World Data Mixtures.

[BibT_eX]

[DOI]

CoRR, 2024

Show-o: One Single Transformer to Unify Multimodal Understanding and Generation.

[BibT_eX]

[DOI]

CoRR, 2024

Towards A Better Metric for Text-to-Video Generation.

[BibT_eX]

[DOI]

CoRR, 2024

Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions.

[BibT_eX]

[DOI]

CoRR, 2024

MotionDirector: Motion Customization of Text-to-Video Diffusion Models.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Free-ATM: Harnessing Free Attention Masks for Representation Learning on Diffusion-Generated Images.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

DragAnything: Motion Control for Anything Using Entity Representation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

MotionDirector: Motion Customization of Text-to-Video Diffusion Models.

[BibT_eX]

[DOI]

CoRR, 2023

Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation.

[BibT_eX]

[DOI]

CoRR, 2023

Dataset Condensation via Generative Model.

[BibT_eX]

[DOI]

CoRR, 2023

Free-ATM: Exploring Unsupervised Learning on Diffusion-Generated Images with Free Attention Masks.

[BibT_eX]

[DOI]

CoRR, 2023

Label-Efficient Online Continual Object Detection in Streaming Video.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Too Large; Data Reduction for Vision-Language Pre-Training.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Making Vision Transformers Efficient from A Token Sparsification View.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

DeVRF: Fast Deformable Voxel Radiance Fields for Dynamic Scenes.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

MorphMLP: An Efficient MLP-Like Backbone for Spatial-Temporal Representation Learning.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Dual-AI: Dual-path Actor Interaction Learning for Group Activity Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

MorphMLP: A Self-Attention Free, MLP-Like Backbone for Image and Video.

[BibT_eX]

[DOI]

CoRR, 2021

David Junhao Zhang

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...