We stand with Ukraine

We stand with Ukraine

Fuxiao Liu

Orcid: 0009-0009-4662-2810

According to our database¹, Fuxiao Liu authored at least 21 papers between 2020 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

AIDE: Agentically Improve Visual Language Model with Domain Experts.

[BibT_eX]

[DOI]

Ming-Chang Chiu

,

,

,

,

,

,

,

CoRR, February, 2025

2024

DAVE: Diverse Atomic Visual Elements Dataset with High Representation of Vulnerable Road Users in Complex and Unpredictable Environments.

[BibT_eX]

[DOI]

,

Pedro Sandoval-Segura

,

Chengyuan Zhang

,

,

,

,

,

,

,

CoRR, 2024

DeepFM-Crispr: Prediction of CRISPR On-Target Effects via Deep Learning.

[BibT_eX]

[DOI]

,

CoRR, 2024

Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders.

[BibT_eX]

[DOI]

,

,

,

,

Subhashree Radhakrishnan

,

,

,

,

,

,

Bryan Catanzaro

,

,

,

,

CoRR, 2024

Mosaic IT: Enhancing Instruction Tuning with Data Mosaics.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2024

Large Language Models and Causal Inference in Collaboration: A Comprehensive Survey.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

Julian J. McAuley

,

,

CoRR, 2024

On the Safety Concerns of Deploying LLMs/VLMs in Robotics: Highlighting the Risks and Vulnerabilities.

[BibT_eX]

[DOI]

,

,

,

,

Souradip Chakraborty

,

,

Brian M. Sadler

,

,

Amrit Singh Bedi

CoRR, 2024

MMC: Advancing Multimodal Chart Understanding with Large-scale Instruction Tuning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

From Multimodal LLM to Human-level AI: Modality, Instruction, Reasoning and Beyond.

[BibT_eX]

[DOI]

,

,

,

,

Zhuosheng Zhang

,

,

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

SCP: Soft Conditional Prompt Learning for Aerial Video Action Recognition.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2024

Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Hallusionbench: An Advanced Diagnostic Suite for Entangled Language Hallucination and Visual Illusion in Large Vision-Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Gedas Bertasius

,

,

,

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023

HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2023

Driving Policy Prediction based on Deep Learning Models.

[BibT_eX]

[DOI]

CoRR, 2023

Towards Understanding In-Context Learning with Contrastive Demonstrations and Saliency Maps.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2023

Aligning Large Multi-Modal Model with Robust Instruction Tuning.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2023

DocumentCLIP: Linking Figures and Main Body Text in Reflowed Documents.

[BibT_eX]

[DOI]

,

,

Chris Tensmeyer

CoRR, 2023

COVID-VTS: Fact Extraction and Verification on Short Video Platforms.

[BibT_eX]

[DOI]

,

,

Abhinav Shrivastava

Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

2021

Visual News: Benchmark and Challenges in News Image Captioning.

[BibT_eX]

[DOI]

,

,

,

Vicente Ordonez

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

2020

VisualNews : Benchmark and Challenges in Entity-aware Image Captioning.

[BibT_eX]

[DOI]

,

,

,

Vicente Ordonez

CoRR, 2020

Loading...