Sicong Leng

Orcid: 0000-0002-3084-5026

According to our database1, Sicong Leng authored at least 11 papers between 2021 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
BenchX: A Unified Benchmark Framework for Medical Vision-Language Pretraining on Chest X-Rays.
CoRR, 2024

Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss.
CoRR, 2024

The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio.
CoRR, 2024

AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention.
CoRR, 2024

VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs.
CoRR, 2024

Constrained Layout Generation with Factor Graphs.
CoRR, 2024

Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Constrained Layout Generation with Factor Graphs.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Uncovering what, why and How: A Comprehensive Benchmark for Causation Understanding of Video Anomaly.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Tell2Design: A Dataset for Language-Guided Floor Plan Generation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2021
Interventional Video Grounding With Dual Contrastive Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021


  Loading...