Yupan Huang

According to our database1, Yupan Huang authored at least 14 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
TextDiffuser-2: Unleashing the Power of Language Models for Text Rendering.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
Kosmos-2.5: A Multimodal Literate Model.
CoRR, 2023

Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models.
CoRR, 2023

TextDiffuser: Diffusion Models as Text Painters.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2022
LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

2021
Reinforced Short-Length Hashing.
IEEE Trans. Circuits Syst. Video Technol., 2021

Probing Inter-modality: Visual Parsing with Self-Attention for Vision-Language Pre-training.
CoRR, 2021

Probing Inter-modality: Visual Parsing with Self-Attention for Vision-and-Language Pre-training.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Unifying Multimodal Transformer for Bi-directional Image and Text Generation.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

A Picture is Worth a Thousand Words: A Unified System for Diverse Captions and Rich Images Generation.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Be Specific, Be Clear: Bridging Machine and Human Captions by Scene-Guided Transformer.
Proceedings of the MMPT@ICMR2021: Proceedings of the 2021 Workshop on Multi-Modal Pre-Training for Multimedia Understanding, 2021

Seeing Out of the Box: End-to-End Pre-Training for Vision-Language Representation Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Reinforcing Short-Length Hashing.
CoRR, 2020

2019
Decoupling Localization and Classification in Single Shot Temporal Action Detection.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019


  Loading...