Fuxiao Liu

Orcid: 0009-0009-4662-2810

According to our database1, Fuxiao Liu authored at least 20 papers between 2020 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
DAVE: Diverse Atomic Visual Elements Dataset with High Representation of Vulnerable Road Users in Complex and Unpredictable Environments.
CoRR, 2024

DeepFM-Crispr: Prediction of CRISPR On-Target Effects via Deep Learning.
CoRR, 2024

Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders.
CoRR, 2024

Mosaic IT: Enhancing Instruction Tuning with Data Mosaics.
CoRR, 2024

Large Language Models and Causal Inference in Collaboration: A Comprehensive Survey.
CoRR, 2024

On the Safety Concerns of Deploying LLMs/VLMs in Robotics: Highlighting the Risks and Vulnerabilities.
CoRR, 2024

MMC: Advancing Multimodal Chart Understanding with Large-scale Instruction Tuning.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

From Multimodal LLM to Human-level AI: Modality, Instruction, Reasoning and Beyond.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

SCP: Soft Conditional Prompt Learning for Aerial Video Action Recognition.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2024

Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Hallusionbench: An Advanced Diagnostic Suite for Entangled Language Hallucination and Visual Illusion in Large Vision-Language Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models.
CoRR, 2023

Driving Policy Prediction based on Deep Learning Models.
CoRR, 2023

Towards Understanding In-Context Learning with Contrastive Demonstrations and Saliency Maps.
CoRR, 2023

Aligning Large Multi-Modal Model with Robust Instruction Tuning.
CoRR, 2023

DocumentCLIP: Linking Figures and Main Body Text in Reflowed Documents.
CoRR, 2023

COVID-VTS: Fact Extraction and Verification on Short Video Platforms.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

2021
Visual News: Benchmark and Challenges in News Image Captioning.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

2020
VisualNews : Benchmark and Challenges in Entity-aware Image Captioning.
CoRR, 2020


  Loading...