2025

Gait Recognition in the Wild: A Large-Scale Benchmark and NAS-Based Baseline.

[DOI]

Xianda Guo

Zheng Zhu

IEEE Trans. Pattern Anal. Mach. Intell., June, 2025

InsightDrive: Insight Scene Representation for End-to-End Autonomous Driving.

[DOI]

CoRR, March, 2025

Bidirectional Prototype-Reward co-Evolution for Test-Time Adaptation of Vision-Language Models.

[DOI]

CoRR, March, 2025

Keeping Yourself is Important in Downstream Tuning Multimodal Large Language Model.

[DOI]

CoRR, March, 2025

WMNav: Integrating Vision-Language Models into World Models for Object Goal Navigation.

[DOI]

CoRR, March, 2025

2024

Stereo Anything: Unifying Stereo Matching with Large-Scale Mixed Data.

[DOI]

CoRR, 2024

DriveMLLM: A Benchmark for Spatial Understanding with Multimodal Large Language Models in Autonomous Driving.

[DOI]

CoRR, 2024

AdvDiffuser: Generating Adversarial Safety-Critical Driving Scenarios via Guided Diffusion.

[DOI]

CoRR, 2024

LightStereo: Channel Boost Is All Your Need for Efficient 2D Cost Aggregation.

[DOI]

CoRR, 2024

Instruct Large Language Models to Drive like Humans.

[DOI]

CoRR, 2024

MaskFuser: Masked Fusion of Joint Multi-Modal Tokenization for End-to-End Autonomous Driving.

[DOI]

CoRR, 2024

GenAD: Generative End-to-End Autonomous Driving.

[DOI]

CoRR, 2024

AdvDiffuser: Generating Adversarial Safety-Critical Driving Scenarios via Guided Diffusion.

[DOI]

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2024

GenAD: Generative End-to-End Autonomous Driving.

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

DiffusionDepth: Diffusion Denoising Approach for Monocular Depth Estimation.

[DOI]

Yiquan Duan

Xianda Guo

Zheng Zhu

Proceedings of the Computer Vision - ECCV 2024, 2024

2023

OpenStereo: A Comprehensive Benchmark for Stereo Matching and Strong Baseline.

[DOI]

CoRR, 2023

Multi-Prompt with Depth Partitioned Cross-Modal Learning.

[DOI]

CoRR, 2023

A Simple Baseline for Supervised Surround-view Depth Estimation.

[DOI]

CoRR, 2023

DiffusionDepth: Diffusion Denoising Approach for Monocular Depth Estimation.

[DOI]

Yiqun Duan

Zheng Zhu

Xianda Guo

CoRR, 2023

DyGait: Exploiting Dynamic Representations for High-performance Gait Recognition.

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

CompletionFormer: Depth Completion with Convolutions and Vision Transformers.

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Gait Recognition in the Wild: A Benchmark.

[DOI]

CoRR, 2022

GaitStrip: Gait Recognition via Effective Strip-based Feature Representations and Multi-Level Framework.

[DOI]

CoRR, 2022

GaitStrip: Gait Recognition via Effective Strip-Based Feature Representations and Multi-level Framework.

[DOI]

Proceedings of the Computer Vision - ACCV 2022, 2022

MonoViT: Self-Supervised Monocular Depth Estimation with a Vision Transformer.

[DOI]

Proceedings of the International Conference on 3D Vision, 2022