Gait Recognition in the Wild: A Large-Scale Benchmark and NAS-Based Baseline.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2025
InsightDrive: Insight Scene Representation for End-to-End Autonomous Driving.
CoRR, March, 2025
Bidirectional Prototype-Reward co-Evolution for Test-Time Adaptation of Vision-Language Models.
CoRR, March, 2025
Keeping Yourself is Important in Downstream Tuning Multimodal Large Language Model.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, March, 2025
WMNav: Integrating Vision-Language Models into World Models for Object Goal Navigation.
CoRR, March, 2025
Stereo Anything: Unifying Stereo Matching with Large-Scale Mixed Data.
CoRR, 2024
DriveMLLM: A Benchmark for Spatial Understanding with Multimodal Large Language Models in Autonomous Driving.
CoRR, 2024
AdvDiffuser: Generating Adversarial Safety-Critical Driving Scenarios via Guided Diffusion.
CoRR, 2024
LightStereo: Channel Boost Is All Your Need for Efficient 2D Cost Aggregation.
CoRR, 2024
Instruct Large Language Models to Drive like Humans.
CoRR, 2024
MaskFuser: Masked Fusion of Joint Multi-Modal Tokenization for End-to-End Autonomous Driving.
CoRR, 2024
GenAD: Generative End-to-End Autonomous Driving.
CoRR, 2024
AdvDiffuser: Generating Adversarial Safety-Critical Driving Scenarios via Guided Diffusion.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2024
GenAD: Generative End-to-End Autonomous Driving.
Proceedings of the Computer Vision - ECCV 2024, 2024
DiffusionDepth: Diffusion Denoising Approach for Monocular Depth Estimation.
Proceedings of the Computer Vision - ECCV 2024, 2024
OpenStereo: A Comprehensive Benchmark for Stereo Matching and Strong Baseline.
CoRR, 2023
Multi-Prompt with Depth Partitioned Cross-Modal Learning.
CoRR, 2023
A Simple Baseline for Supervised Surround-view Depth Estimation.
CoRR, 2023
DiffusionDepth: Diffusion Denoising Approach for Monocular Depth Estimation.
CoRR, 2023
DyGait: Exploiting Dynamic Representations for High-performance Gait Recognition.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
CompletionFormer: Depth Completion with Convolutions and Vision Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Gait Recognition in the Wild: A Benchmark.
CoRR, 2022
GaitStrip: Gait Recognition via Effective Strip-based Feature Representations and Multi-Level Framework.
CoRR, 2022
GaitStrip: Gait Recognition via Effective Strip-Based Feature Representations and Multi-level Framework.
Proceedings of the Computer Vision - ACCV 2022, 2022
MonoViT: Self-Supervised Monocular Depth Estimation with a Vision Transformer.
Proceedings of the International Conference on 3D Vision, 2022