Bohan Zhai

According to our database1, Bohan Zhai authored at least 12 papers between 2020 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
InfiMM-HD: A Leap Forward in High-Resolution Multimodal Understanding.
CoRR, 2024

COCO is "ALL" You Need for Visual Instruction Fine-tuning.
CoRR, 2024

Exploring the Reasoning Abilities of Multimodal Large Language Models (MLLMs): A Comprehensive Survey on Emerging Trends in Multimodal Reasoning.
CoRR, 2024

Multitask Vision-Language Prompt Tuning.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

2023
CORE-MM: Complex Open-Ended Reasoning Evaluation For Multi-Modal Large Language Models.
CoRR, 2023

HallE-Switch: Rethinking and Controlling Object Existence Hallucinations in Large Vision Language Models for Detailed Caption.
CoRR, 2023

2022
Integer-Only Zero-Shot Quantization for Efficient Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022

Image2Point: 3D Point-Cloud Understanding with 2D Image Pretrained Models.
Proceedings of the Computer Vision - ECCV 2022, 2022

2021
Image2Point: 3D Point-Cloud Understanding with Pretrained 2D ConvNets.
CoRR, 2021

Q-ASR: Integer-only Zero-shot Quantization for Efficient Speech Recognition.
CoRR, 2021

You Only Group Once: Efficient Point-Cloud Processing with Token Representation and Relation Inference Module.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

2020
SqueezeWave: Extremely Lightweight Vocoders for On-device Speech Synthesis.
CoRR, 2020


  Loading...