2025
HOI-PAGE: Zero-Shot Human-Object Interaction Generation with Part Affordance Guidance.
CoRR, June, 2025
TUM2TWIN: Introducing the Large-Scale Multimodal Urban Digital Twin Benchmark Dataset.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, May, 2025
QuickSplat: Fast 3D Surface Reconstruction via Learned Gaussian Initialization.
CoRR, May, 2025
Physically Consistent Humanoid Loco-Manipulation using Latent Diffusion Models.
CoRR, April, 2025
ScanEdit: Hierarchically-Guided Functional 3D Scan Editing.
CoRR, April, 2025
AI Agents in Engineering Design: A Multi-Agent Framework for Aesthetic and Aerodynamic Car Design.
CoRR, March, 2025
TripNet: Learning Large-scale High-fidelity 3D Car Aerodynamics with Triplane Networks.
CoRR, March, 2025
ExCap3D: Expressive 3D Scene Understanding via Object Captioning with Varying Detail.
CoRR, March, 2025
Animating the Uncaptured: Humanoid Mesh Animation with Video Diffusion Models.
CoRR, March, 2025
MeshPad: Interactive Sketch-Conditioned Artist-Designed Mesh Generation and Editing.
CoRR, March, 2025
Non-Gaited Legged Locomotion With Monte-Carlo Tree Search and Supervised Learning.
IEEE Robotics Autom. Lett., February, 2025
Use of Winsome Robots for Understanding Human Feedback (UWU).
CoRR, February, 2025
Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
DNF: Unconditional 4D Generation with Dictionary-based Neural Fields.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
LT3SD: Latent Trees for 3D Scene Diffusion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
MeshArt: Generating Articulated Meshes with Structure-Guided Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
PrEditor3D: Fast and Precise 3D Shape Editing.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
SceneFactor: Factored Latent 3D Diffusion for Controllable 3D Scene Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
2024
DiffCAD: Weakly-Supervised Probabilistic CAD Model Retrieval and Alignment from an RGB Image.
ACM Trans. Graph., July, 2024
GaussianSpeech: Audio-Driven Gaussian Avatars.
CoRR, 2024
DrivAerNet: A Parametric Car Dataset for Data-Driven Aerodynamic Design and Graph-Based Drag Prediction.
CoRR, 2024
L3DG: Latent 3D Gaussian Diffusion.
Proceedings of the SIGGRAPH Asia 2024 Conference Papers, 2024
DrivAerNet++: A Large-Scale Multimodal Car Dataset with Computational Fluid Dynamics Simulations and Deep Learning Benchmarks.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Coherent 3D Scene Diffusion From a Single RGB Image.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
End-to-End Piano Performance-MIDI to Score Conversion With Transformers.
Proceedings of the 25th International Society for Music Information Retrieval Conference, 2024
DiffuScene: Denoising Diffusion Models for Generative Indoor Scene Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
DPHMs: Diffusion Parametric Head Models for Depth-Based Tracking.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
MeshGPT: Generating Triangle Meshes with Decoder-Only Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
UnScene3D: Unsupervised 3D Instance Segmentation for Indoor Scenes.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
GenZI: Zero-Shot 3D Human-Scene Interaction Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
FutureHuman3D: Forecasting Complex Long-Term 3D Human Behavior from Video Observations.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
CG-HOI: Contact-Guided 3D Human-Object Interaction Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
PaSCo: Urban 3D Panoptic Scene Completion with Uncertainty Awareness.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
FaceTalk: Audio-Driven Motion Diffusion for Neural Parametric Head Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
PT43D: A Probabilistic Transformer for Generating 3D Shapes from Single Highly-Ambiguous RGB Images.
Proceedings of the 35th British Machine Vision Conference, 2024
2023
DiffuScene: Scene Graph Denoising Diffusion Probabilistic Model for Generative Indoor Scene Synthesis.
CoRR, 2023
Mask3D: Pre-training 2D Vision Transformers by Learning Masked 3D Priors.
CoRR, 2023
ClipFace: Text-guided Editing of Textured 3D Morphable Models.
Proceedings of the ACM SIGGRAPH 2023 Conference Proceedings, 2023
ScanNet++: A High-Fidelity Dataset of 3D Indoor Scenes.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
HyperDiffusion: Generating Implicit Neural Fields with Weight-Space Diffusion.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Mesh2Tex: Generating Mesh Textures from Image Queries.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Panoptic Lifting for 3D Scene Understanding with Neural Fields.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Learning 3D Scene Priors with 2D Supervision.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Mask3D: Pretraining 2D Vision Transformers by Learning Masked 3D Priors.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
ObjectMatch: Robust Registration using Canonical Object Correspondences.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Neural Part Priors: Learning to Optimize Part-Based Object Completion in RGB-D Scans.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
2022
Forecasting Actions and Characteristic 3D Poses.
CoRR, 2022
Neural Poisson: Indicator Functions for Neural Fields.
CoRR, 2022
Weakly-Supervised End-to-End CAD Retrieval to Scan Objects.
CoRR, 2022
PatchComplete: Learning Multi-Resolution Patch Priors for 3D Shape Completion on Unseen Categories.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Texturify: Generating Textures on 3D Shape Surfaces.
Proceedings of the Computer Vision - ECCV 2022, 2022
Language-Grounded Indoor 3D Semantic Segmentation in the Wild.
Proceedings of the Computer Vision - ECCV 2022, 2022
Pose2Room: Understanding 3D Scenes from Human Activities.
Proceedings of the Computer Vision - ECCV 2022, 2022
4DContrast: Contrastive Learning with Dynamic Correspondences for 3D Scene Understanding.
Proceedings of the Computer Vision - ECCV 2022, 2022
SPAMs: Structured Implicit Parametric Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
ROCA: Robust CAD Model Retrieval and Alignment from a Single Image.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
Forecasting Characteristic 3D Poses of Human Actions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
Message from the Program Chairs: 3DV 2022.
Proceedings of the International Conference on 3D Vision, 2022
2021
Exploring Location-Based AR Narrative Design for Historic Site.
Presence Teleoperators Virtual Environ., December, 2021
Guest Editorial: Special Issue on Performance Evaluation in Computer Vision.
Int. J. Comput. Vis., 2021
Panoptic 3D Scene Reconstruction From a Single RGB Image.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
TransformerFusion: Monocular RGB Scene Reconstruction using Transformers.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
RetrievalFuse: Neural 3D Scene Reconstruction with a Database.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
NPMs: Neural Parametric Models for 3D Deformable Shapes.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Patch2CAD: Patchwise Embedding Learning for In-the-Wild Shape Retrieval from a Single Image.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Pri3D: Can 3D Priors Help 2D Representation Learning?
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Seeing Behind Objects for 3D Multi-Object Tracking in RGB-D Sequences.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
SPSG: Self-Supervised Photometric Scene Generation From RGB-D Scans.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
Neural Deformation Graphs for Globally-Consistent Non-Rigid Reconstruction.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
Towards Part-Based Understanding of RGB-D Scans.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
2020
Neural Non-Rigid Tracking.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Mask2CAD: 3D Shape Prediction by Learning to Segment and Retrieve.
Proceedings of the Computer Vision - ECCV 2020, 2020
SceneCAD: Predicting Object Alignments and Layouts in RGB-D Scans.
Proceedings of the Computer Vision - ECCV 2020, 2020
Adversarial Texture Optimization From RGB-D Scans.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
RevealNet: Seeing Behind Objects in RGB-D Scans.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
SG-NN: Sparse Generative Neural Networks for Self-Supervised Scene Completion of RGB-D Scans.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
2019
3D-SIC: 3D Semantic Instance Completion for RGB-D Scans.
CoRR, 2019
Joint Embedding of 3D Scan and CAD Objects.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019
End-to-End CAD Model Retrieval and 9DoF Alignment in 3D Scans.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019
3D-SIS: 3D Semantic Instance Segmentation of RGB-D Scans.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019
Scan2Mesh: From Unstructured Range Scans to 3D Meshes.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019
Scan2CAD: Learning CAD Model Alignment in RGB-D Scans.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019
2018
Using generative deep learning to create high-quality models from 3D scans.
PhD thesis, 2018
3DMV: Joint 3D-Multi-view Prediction for 3D Semantic Scene Segmentation.
Proceedings of the Computer Vision - ECCV 2018, 2018
ScanComplete: Large-Scale Scene Completion and Semantic Segmentation for 3D Scans.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018
2017
3Dlite: towards commodity 3D scanning for content creation.
ACM Trans. Graph., 2017
BundleFusion: Real-Time Globally Consistent 3D Reconstruction Using On-the-Fly Surface Reintegration.
ACM Trans. Graph., 2017
LayerBuilder: Layer Decomposition for Interactive Image and Video Color Editing.
CoRR, 2017
Shape Completion Using 3D-Encoder-Predictor CNNs and Shape Synthesis.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017
ScanNet: Richly-Annotated 3D Reconstructions of Indoor Scenes.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017
Matterport3D: Learning from RGB-D Data in Indoor Environments.
Proceedings of the 2017 International Conference on 3D Vision, 2017
2016
BundleFusion: Real-time Globally Consistent 3D Reconstruction using On-the-fly Surface Re-integration.
CoRR, 2016
Volumetric and Multi-view CNNs for Object Classification on 3D Data.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016
Learning to Navigate the Energy Landscape.
Proceedings of the Fourth International Conference on 3D Vision, 2016
2015
Shading-based refinement on volumetric signed distance functions.
ACM Trans. Graph., 2015
Database-Assisted Object Retrieval for Real-Time 3D Reconstruction.
Comput. Graph. Forum, 2015
2014
Combining Inertial Navigation and ICP for Real-time 3D Surface Reconstruction.
Proceedings of the 35th Annual Conference of the European Association for Computer Graphics, 2014