Yunlong Tang

Orcid: 0000-0003-2796-1787

Affiliations:
  • University of Rochester, NY, USA
  • Tencent (China), Shenzhen, China (former)
  • Southern University of Science and Technology, Department of Computer Science and Engineering, Shenzhen, China (former)


According to our database1, Yunlong Tang authored at least 15 papers between 2022 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Unveiling Visual Perception in Language Models: An Attention Head Analysis Approach.
CoRR, 2024

VidComposition: Can MLLMs Analyze Compositions in Compiled Videos?
CoRR, 2024

Scaling Concept With Text-Guided Diffusion Models.
CoRR, 2024

MMCOMPOSITION: Revisiting the Compositionality of Pre-trained Vision-Language Models.
CoRR, 2024

AIM 2024 Challenge on Video Saliency Prediction: Methods and Results.
CoRR, 2024

CaRDiff: Video Salient Object Ranking Chain of Thought Reasoning for Saliency Prediction with Diffusion.
CoRR, 2024

Do More Details Always Introduce More Hallucinations in LVLM-based Image Captioning?
CoRR, 2024

V2Xum-LLM: Cross-Modal Video Summarization with Temporal Prompt Instruction Tuning.
CoRR, 2024

AVicuna: Audio-Visual LLM with Interleaver and Context-Boundary Alignment for Temporal Referential Dialogue.
CoRR, 2024

EAGLE: Egocentric AGgregated Language-video Engine.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

2023
Video Understanding with Large Language Models: A Survey.
CoRR, 2023

LaunchpadGPT: Language Model as Music Visualization Designer on Launchpad.
CoRR, 2023

LLMVA-GEBC: Large Language Model with Video Adapter for Generic Event Boundary Captioning.
CoRR, 2023

Caption Anything: Interactive Image Description with Diverse Multimodal Controls.
CoRR, 2023

2022
Multi-modal Segment Assemblage Network for Ad Video Editing with Importance-Coherence Reward.
Proceedings of the Computer Vision - ACCV 2022, 2022


  Loading...