Bohan Zhai

According to our database¹, Bohan Zhai authored at least 14 papers between 2020 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

2020

2021

2022

2023

2024

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

Law of Vision Representation in MLLMs.

[BibT_eX]

[DOI]

CoRR, 2024

InfiMM-HD: A Leap Forward in High-Resolution Multimodal Understanding.

[BibT_eX]

[DOI]

CoRR, 2024

Exploring the Reasoning Abilities of Multimodal Large Language Models (MLLMs): A Comprehensive Survey on Emerging Trends in Multimodal Reasoning.

[BibT_eX]

[DOI]

CoRR, 2024

Multitask Vision-Language Prompt Tuning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

COCO is "ALL" You Need for Visual Instruction Fine-tuning.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

InfiMM: Advancing Multimodal Understanding with an Open-Sourced Visual Language Model.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023

CORE-MM: Complex Open-Ended Reasoning Evaluation For Multi-Modal Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

HallE-Switch: Rethinking and Controlling Object Existence Hallucinations in Large Vision Language Models for Detailed Caption.

[BibT_eX]

[DOI]

CoRR, 2023

2022

Integer-Only Zero-Shot Quantization for Efficient Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Image2Point: 3D Point-Cloud Understanding with 2D Image Pretrained Models.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

2021

Image2Point: 3D Point-Cloud Understanding with Pretrained 2D ConvNets.

[BibT_eX]

[DOI]

CoRR, 2021

Q-ASR: Integer-only Zero-shot Quantization for Efficient Speech Recognition.

[BibT_eX]

[DOI]

CoRR, 2021

You Only Group Once: Efficient Point-Cloud Processing with Token Representation and Relation Inference Module.

[BibT_eX]

[DOI]

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

2020

SqueezeWave: Extremely Lightweight Vocoders for On-device Speech Synthesis.

[BibT_eX]

[DOI]

CoRR, 2020

Bohan Zhai

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...