Daquan Zhou

Orcid: 0000-0002-4771-1796

According to our database¹, Daquan Zhou authored at least 47 papers between 2019 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2024

LVD-2M: A Long-take Video Dataset with Temporally Dense Captions.

[BibT_eX]

[DOI]

CoRR, 2024

Loong: Generating Minute-level Long Videos with Autoregressive Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation.

[BibT_eX]

[DOI]

CoRR, 2024

PLLaVA : Parameter-free LLaVA Extension from Images to Videos for Video Dense Captioning.

[BibT_eX]

[DOI]

CoRR, 2024

Chain of Thought Explanation for Dialogue State Tracking.

[BibT_eX]

[DOI]

CoRR, 2024

Sora Generates Videos with Stunning Geometrical Consistency.

[BibT_eX]

[DOI]

CoRR, 2024

Magic-Me: Identity-Specific Video Customized Diffusion.

[BibT_eX]

[DOI]

CoRR, 2024

MagicVideo-V2: Multi-Stage High-Aesthetic Video Generation.

[BibT_eX]

[DOI]

CoRR, 2024

InfoBatch: Lossless Training Speed Up by Unbiased Dynamic Data Pruning.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

MAgIC: Investigation of Large Language Model Powered Multi-Agent in Cognition, Adaptability, Rationality and Collaboration.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

EPIM: Efficient Processing-In-Memory Accelerators based on Epitome.

[BibT_eX]

[DOI]

Proceedings of the 61st ACM/IEEE Design Automation Conference, 2024

PromptCoT: Align Prompt Distribution via Adapted Chain-of-Thought.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

Token Selection is a Simple Booster for Vision Transformers.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., November, 2023

Factorization Vision Transformer: Modeling Long Range Dependency with Local Window Cost.

[BibT_eX]

[DOI]

CoRR, 2023

ChatAnything: Facetime Chat with LLM-Enhanced Personas.

[BibT_eX]

[DOI]

CoRR, 2023

Low-Resolution Self-Attention for Semantic Segmentation.

[BibT_eX]

[DOI]

CoRR, 2023

MaskDiffusion: Boosting Text-to-Image Consistency with Conditional Mask.

[BibT_eX]

[DOI]

CoRR, 2023

BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs.

[BibT_eX]

[DOI]

CoRR, 2023

InfoBatch: Lossless Training Speed Up by Unbiased Dynamic Data Pruning.

[BibT_eX]

[DOI]

CoRR, 2023

DiM: Distilling Dataset into Generative Model.

[BibT_eX]

[DOI]

CoRR, 2023

Expanding Small-Scale Datasets with Guided Imagination.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

PPG Reloaded: An Empirical Study on What Matters in Phasic Policy Gradient.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Dataset Quantization.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

DiffFit: Unlocking Transferability of Large Diffusion Models via Simple Parameter-Efficient Fine-Tuning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Diffusion Probabilistic Model Made Slim.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Velocity-to-velocity human motion forecasting.

[BibT_eX]

[DOI]

Pattern Recognit., 2022

Progressive Tandem Learning for Pattern Recognition With Deep Spiking Neural Networks.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

MagicVideo: Efficient Video Generation With Latent Diffusion Models.

[BibT_eX]

[DOI]

CoRR, 2022

MagicMix: Semantic Mixing with Diffusion Models.

[BibT_eX]

[DOI]

CoRR, 2022

M<sup>2</sup>BEV: Multi-Camera Joint 3D Detection and Segmentation with Unified Birds-Eye View Representation.

[BibT_eX]

[DOI]

CoRR, 2022

Deep Model Reassembly.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Scaling & Shifting Your Features: A New Baseline for Efficient Model Tuning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Sharpness-Aware Training for Free.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Understanding The Robustness in Vision Transformers.

[BibT_eX]

[DOI]

Animashree Anandkumar

Jiashi Feng

José M. Álvarez

Proceedings of the International Conference on Machine Learning, 2022

Shunted Self-Attention via Multi-Scale Token Aggregation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Refiner: Refining Self-attention for Vision Transformers.

[BibT_eX]

[DOI]

CoRR, 2021

Token Labeling: Training a 85.4% Top-1 Accuracy Vision Transformer with 56M Parameters on ImageNet.

[BibT_eX]

[DOI]

CoRR, 2021

DeepViT: Towards Deeper Vision Transformer.

[BibT_eX]

[DOI]

CoRR, 2021

All Tokens Matter: Token Labeling for Training Better Vision Transformers.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

AutoSpace: Neural Architecture Search with Less Human Interference.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Coordinate Attention for Efficient Mobile Network Design.

[BibT_eX]

[DOI]

Qibin Hou

Daquan Zhou

Jiashi Feng

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020

Progressive Tandem Learning for Pattern Recognition with Deep Spiking Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2020

ConvBERT: Improving BERT with Span-based Dynamic Convolution.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Neural Epitome Search for Architecture-Agnostic Network Compression.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Learning Representations, 2020

Rethinking Bottleneck Structure for Efficient Mobile Network Design.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

2019

Deep Model Compression via Filter Auto-sampling.

[BibT_eX]

[DOI]

CoRR, 2019

PANet: Few-Shot Image Semantic Segmentation With Prototype Alignment.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Daquan Zhou

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...