Finite Element Analysis of Electrostatic Coupling in LISA Pathfinder Inertial Sensors.
Sensors, October, 2024
AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
Law of Vision Representation in MLLMs.
CoRR, 2024
Multitask Vision-Language Prompt Tuning.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024
HallE-Switch: Rethinking and Controlling Object Existence Hallucinations in Large Vision Language Models for Detailed Caption.
CoRR, 2023
Time Will Tell: New Outlooks and A Baseline for Temporal Multi-View 3D Object Detection.
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Image2Point: 3D Point-Cloud Understanding with 2D Image Pretrained Models.
Proceedings of the Computer Vision - ECCV 2022, 2022
Image2Point: 3D Point-Cloud Understanding with Pretrained 2D ConvNets.
CoRR, 2021
3.7 A 1920×1080 30fps 2.3TOPS/W stereo-depth processor for robust autonomous navigation.
Proceedings of the 2017 IEEE International Solid-State Circuits Conference, 2017