2025
GoalFlow: Goal-Driven Flow Matching for Multimodal Trajectories Generation in End-to-End Autonomous Driving.
CoRR, March, 2025
2024
Metric3D v2: A Versatile Monocular Geometric Foundation Model for Zero-Shot Metric Depth and Surface Normal Estimation.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024
Scaling Up Multi-domain Semantic Segmentation with Sentence Embeddings.
Int. J. Comput. Vis., September, 2024
Towards Domain-agnostic Depth Completion.
Mach. Intell. Res., August, 2024
SC-DepthV3: Robust Self-Supervised Monocular Depth Estimation for Dynamic Scenes.
IEEE Trans. Pattern Anal. Mach. Intell., January, 2024
DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT.
CoRR, 2024
RoMeO: Robust Metric Visual Odometry.
CoRR, 2024
Boost 3D Reconstruction using Diffusion-based Monocular Camera Calibration.
CoRR, 2024
Depth Any Video with Scalable Synthetic Data.
CoRR, 2024
DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model.
CoRR, 2024
HE-Drive: Human-Like End-to-End Driving with Vision Language Models.
CoRR, 2024
OccRWKV: Rethinking Efficient 3D Semantic Occupancy Prediction with Linear Complexity.
CoRR, 2024
LaserHuman: Language-guided Scene-aware Human Motion Generation in Free Environment.
CoRR, 2024
DC-Gaussian: Improving 3D Gaussian Splatting for Reflective Dash Cam Videos.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
GaussianPro: 3D Gaussian Splatting with Progressive Propagation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
GIM: Learning Generalizable Image Matcher From Internet Videos.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
UC-NERF: Neural Radiance Field for Under-Calibrated Multi-View Cameras in Autonomous Driving.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Robust Lightweight Depth Estimation Model via Data-Free Distillation.
Proceedings of the IEEE International Conference on Acoustics, 2024
GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image.
Proceedings of the Computer Vision - ECCV 2024, 2024
Adaptive Fusion of Single-View and Multi-View Depth for Autonomous Driving.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
2023
Towards Accurate Reconstruction of 3D Scene Shape From A Single Monocular Image.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2023
HumanRecon: Neural Reconstruction of Dynamic Human Using Geometric Cues and Physical Priors.
CoRR, 2023
Robust Geometry-Preserving Depth Estimation Using Differentiable Rendering.
CoRR, 2023
Robust Geometry-Preserving Depth Estimation Using Differentiable Rendering.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
FrozenRecon: Pose-free 3D Scene Reconstruction with Frozen Depth Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
The Second Monocular Depth Estimation Challenge.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Learning to Fuse Monocular and Multi-view Cues for Multi-frame Depth Estimation in Dynamic Scenes.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
2022
Improving Monocular Visual Odometry Using Learned Depth.
IEEE Trans. Robotics, 2022
Pseudo-LiDAR-Based Road Detection.
IEEE Trans. Circuits Syst. Video Technol., 2022
Virtual Normal: Enforcing Geometric Constraints for Accurate and Robust Depth Prediction.
IEEE Trans. Pattern Anal. Mach. Intell., 2022
Hierarchical Normalization for Robust Monocular Depth Estimation.
CoRR, 2022
Towards Domain-agnostic Depth Completion.
CoRR, 2022
Exploiting Correspondences with All-pairs Correlations for Multi-view Depth Estimation.
CoRR, 2022
The devil is in the labels: Semantic segmentation from sentences.
CoRR, 2022
Hierarchical Normalization for Robust Monocular Depth Estimation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Controllable Shadow Generation Using Pixel Height Maps.
Proceedings of the Computer Vision - ECCV 2022, 2022
PointInst3D: Segmenting 3D Instances by Points.
Proceedings of the Computer Vision - ECCV 2022, 2022
Retrieval Augmented Classification for Long-Tail Visual Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
2021
Generic Perceptual Loss for Modeling Structured Output Dependencies.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
Learning To Recover 3D Scene Shape From a Single Image.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
2020
DiverseDepth: Affine-invariant Depth Prediction Using Diverse Data.
CoRR, 2020
Task-Aware Monocular Depth Estimation for 3D Object Detection.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
Training Compact Neural Networks via Auxiliary Overparameterization.
CoRR, 2019
Enforcing Geometric Constraints of Virtual Normal for Depth Prediction.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019