2024
Finite Element Analysis of Electrostatic Coupling in LISA Pathfinder Inertial Sensors.
Sensors, October, 2024

AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?
CoRR, 2024

Law of Vision Representation in MLLMs.
CoRR, 2024

Multitask Vision-Language Prompt Tuning.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

2023
HallE-Switch: Rethinking and Controlling Object Existence Hallucinations in Large Vision Language Models for Detailed Caption.
CoRR, 2023

Time Will Tell: New Outlooks and A Baseline for Temporal Multi-View 3D Object Detection.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022
Image2Point: 3D Point-Cloud Understanding with 2D Image Pretrained Models.
Proceedings of the Computer Vision - ECCV 2022, 2022

2021
Image2Point: 3D Point-Cloud Understanding with Pretrained 2D ConvNets.
CoRR, 2021

2017
3.7 A 1920×1080 30fps 2.3TOPS/W stereo-depth processor for robust autonomous navigation.
Proceedings of the 2017 IEEE International Solid-State Circuits Conference, 2017