Daquan Zhou

Orcid: 0000-0002-4771-1796

According to our database1, Daquan Zhou authored at least 47 papers between 2019 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
LVD-2M: A Long-take Video Dataset with Temporally Dense Captions.
CoRR, 2024

Loong: Generating Minute-level Long Videos with Autoregressive Language Models.
CoRR, 2024

StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation.
CoRR, 2024

PLLaVA : Parameter-free LLaVA Extension from Images to Videos for Video Dense Captioning.
CoRR, 2024

Chain of Thought Explanation for Dialogue State Tracking.
CoRR, 2024

Sora Generates Videos with Stunning Geometrical Consistency.
CoRR, 2024

Magic-Me: Identity-Specific Video Customized Diffusion.
CoRR, 2024

MagicVideo-V2: Multi-Stage High-Aesthetic Video Generation.
CoRR, 2024

InfoBatch: Lossless Training Speed Up by Unbiased Dynamic Data Pruning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

MAgIC: Investigation of Large Language Model Powered Multi-Agent in Cognition, Adaptability, Rationality and Collaboration.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

EPIM: Efficient Processing-In-Memory Accelerators based on Epitome.
Proceedings of the 61st ACM/IEEE Design Automation Conference, 2024

PromptCoT: Align Prompt Distribution via Adapted Chain-of-Thought.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Token Selection is a Simple Booster for Vision Transformers.
IEEE Trans. Pattern Anal. Mach. Intell., November, 2023

Factorization Vision Transformer: Modeling Long Range Dependency with Local Window Cost.
CoRR, 2023

ChatAnything: Facetime Chat with LLM-Enhanced Personas.
CoRR, 2023

Low-Resolution Self-Attention for Semantic Segmentation.
CoRR, 2023

MaskDiffusion: Boosting Text-to-Image Consistency with Conditional Mask.
CoRR, 2023

BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs.
CoRR, 2023

InfoBatch: Lossless Training Speed Up by Unbiased Dynamic Data Pruning.
CoRR, 2023

DiM: Distilling Dataset into Generative Model.
CoRR, 2023

Expanding Small-Scale Datasets with Guided Imagination.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

PPG Reloaded: An Empirical Study on What Matters in Phasic Policy Gradient.
Proceedings of the International Conference on Machine Learning, 2023

Dataset Quantization.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

DiffFit: Unlocking Transferability of Large Diffusion Models via Simple Parameter-Efficient Fine-Tuning.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Diffusion Probabilistic Model Made Slim.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Velocity-to-velocity human motion forecasting.
Pattern Recognit., 2022

Progressive Tandem Learning for Pattern Recognition With Deep Spiking Neural Networks.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

MagicVideo: Efficient Video Generation With Latent Diffusion Models.
CoRR, 2022

MagicMix: Semantic Mixing with Diffusion Models.
CoRR, 2022

M<sup>2</sup>BEV: Multi-Camera Joint 3D Detection and Segmentation with Unified Birds-Eye View Representation.
CoRR, 2022

Deep Model Reassembly.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Scaling & Shifting Your Features: A New Baseline for Efficient Model Tuning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Sharpness-Aware Training for Free.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Understanding The Robustness in Vision Transformers.
Proceedings of the International Conference on Machine Learning, 2022

Shunted Self-Attention via Multi-Scale Token Aggregation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Refiner: Refining Self-attention for Vision Transformers.
CoRR, 2021

Token Labeling: Training a 85.4% Top-1 Accuracy Vision Transformer with 56M Parameters on ImageNet.
CoRR, 2021

DeepViT: Towards Deeper Vision Transformer.
CoRR, 2021

All Tokens Matter: Token Labeling for Training Better Vision Transformers.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

AutoSpace: Neural Architecture Search with Less Human Interference.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Coordinate Attention for Efficient Mobile Network Design.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Progressive Tandem Learning for Pattern Recognition with Deep Spiking Neural Networks.
CoRR, 2020

ConvBERT: Improving BERT with Span-based Dynamic Convolution.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Neural Epitome Search for Architecture-Agnostic Network Compression.
Proceedings of the 8th International Conference on Learning Representations, 2020

Rethinking Bottleneck Structure for Efficient Mobile Network Design.
Proceedings of the Computer Vision - ECCV 2020, 2020

2019
Deep Model Compression via Filter Auto-sampling.
CoRR, 2019

PANet: Few-Shot Image Semantic Segmentation With Prototype Alignment.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019


  Loading...