Zhuofan Zong

According to our database1, Zhuofan Zong authored at least 17 papers between 2020 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
VividFace: A Diffusion-Based Hybrid Framework for High-Fidelity Video Face Swapping.
CoRR, 2024

EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM.
CoRR, 2024

Exploring the Role of Large Language Models in Prompt Encoding for Diffusion Models.
CoRR, 2024

Visual CoT: Unleashing Chain-of-Thought Reasoning in Multi-Modal Language Models.
CoRR, 2024

MoVA: Adapting Mixture of Vision Experts to Multimodal Context.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought Reasoning.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Exploring the Role of Large Language Models in Prompt Encoding for Diffusion Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

2023
RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths.
CoRR, 2023

RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

DETRs with Collaborative Hybrid Assignments Training.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
Large-batch Optimization for Dense Visual Predictions.
CoRR, 2022

Large-batch Optimization for Dense Visual Predictions: Training Faster R-CNN in 4.2 Minutes.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Self-slimmed Vision Transformer.
Proceedings of the Computer Vision - ECCV 2022, 2022

2021
RCNet: Reverse Feature Pyramid and Cross-scale Shift Network for Object Detection.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

2020
Graph Attention Based Proposal 3D ConvNets for Action Detection.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020


  Loading...