Federico Tombari

Orcid: 0000-0001-5598-5212

Affiliations:
  • Google, Zurich, Switzerland
  • University of Bologna, Italy


According to our database1, Federico Tombari authored at least 324 papers between 2004 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
I2DFormer+: Learning Image to Document Summary Attention for Zero-Shot Image Classification.
Int. J. Comput. Vis., September, 2024

Holistic OR domain modeling: a semantic scene graph approach.
Int. J. Comput. Assist. Radiol. Surg., May, 2024

Self-Supervised Latent Space Optimization With Nebula Variational Coding.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2024

Occlusion-Aware Self-Supervised Monocular 6D Object Pose Estimation.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2024

NEWTON: Neural View-Centric Mapping for On-the-Fly Large-Scale SLAM.
IEEE Robotics Autom. Lett., 2024

3D Adversarial Augmentations for Robust Out-of-Domain Predictions.
Int. J. Comput. Vis., 2024

TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters.
CoRR, 2024

Neural Semantic Map-Learning for Autonomous Vehicles.
CoRR, 2024

Search3D: Hierarchical Open-Vocabulary 3D Segmentation.
CoRR, 2024

LiLoc: Lifelong Localization using Adaptive Submap Joining and Egocentric Factor Graph.
CoRR, 2024

Toward a Diffusion-Based Generalist for Dense Vision Tasks.
CoRR, 2024

Stylebreeder: Exploring and Democratizing Artistic Styles through Text-to-Image Models.
CoRR, 2024

Mixed Diffusion for 3D Indoor Scene Synthesis.
CoRR, 2024

Splat-SLAM: Globally Optimized RGB-only SLAM with 3D Gaussians.
CoRR, 2024

How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs.
CoRR, 2024

OpenNeRF: Open Set 3D Neural Scene Segmentation with Pixel-Wise Features and Rendered Novel Views.
CoRR, 2024

CLoRA: A Contrastive Approach to Compose Multiple LoRA Models.
CoRR, 2024

Zero123-6D: Zero-shot Novel View Synthesis for RGB Category-level 6D Pose Estimation.
CoRR, 2024

RadSplat: Radiance Field-Informed Gaussian Splatting for Robust Real-Time Rendering with 900+ FPS.
CoRR, 2024

FocusCLIP: Multimodal Subject-Level Guidance for Zero-Shot Transfer in Human-Centric Tasks.
CoRR, 2024

OpenSUN3D: 1st Workshop Challenge on Open-Vocabulary 3D Scene Understanding.
CoRR, 2024

InseRF: Text-Driven Generative Object Insertion in Neural 3D Scenes.
CoRR, 2024

Learning to Prompt with Text Only Supervision for Vision-Language Models.
CoRR, 2024

Dynamic Gaussian Marbles for Novel View Synthesis of Casual Monocular Videos.
Proceedings of the SIGGRAPH Asia 2024 Conference Papers, 2024

SG-Bot: Object Rearrangement via Coarse-to-Fine Robotic Imagination on Scene Graphs.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Physics-Encoded Graph Neural Networks for Deformation Prediction under Contact.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Extracting Training Data From Document-Based VQA Models.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

OpenNeRF: Open Set 3D Neural Scene Segmentation with Pixel-Wise Features and Rendered Novel Views.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Denoising Diffusion via Image-Based Rendering.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

PhysAvatar: Learning the Physics of Dressed 3D Avatars from Visual Observations.
Proceedings of the Computer Vision - ECCV 2024, 2024

EchoScene: Indoor Scene Generation via Information Echo Over Scene Graph Diffusion.
Proceedings of the Computer Vision - ECCV 2024, 2024

P2P-Bridge: Diffusion Bridges for 3D Point Cloud Denoising.
Proceedings of the Computer Vision - ECCV 2024, 2024

SILC: Improving Vision Language Pretraining with Self-distillation.
Proceedings of the Computer Vision - ECCV 2024, 2024

SceneGraphLoc: Cross-Modal Coarse Visual Localization on 3D Scene Graphs.
Proceedings of the Computer Vision - ECCV 2024, 2024

Self-supervised Shape Completion via Involution and Implicit Correspondences.
Proceedings of the Computer Vision - ECCV 2024, 2024

GeoGaussian: Geometry-Aware Gaussian Splatting for Scene Rendering.
Proceedings of the Computer Vision - ECCV 2024, 2024

Text-Conditioned Resampler For Long Form Video Understanding.
Proceedings of the Computer Vision - ECCV 2024, 2024

BRAVE: Broadening the Visual Encoding of Vision-Language Models.
Proceedings of the Computer Vision - ECCV 2024, 2024

Segment3D: Learning Fine-Grained Class-Agnostic 3D Segmentation Without Manual Labels.
Proceedings of the Computer Vision - ECCV 2024, 2024

SemiVL: Semi-Supervised Semantic Segmentation with Vision-Language Guidance.
Proceedings of the Computer Vision - ECCV 2024, 2024

D-SCo: Dual-Stream Conditional Diffusion for Monocular Hand-Held Object Reconstruction.
Proceedings of the Computer Vision - ECCV 2024, 2024

KP-RED: Exploiting Semantic Keypoints for Joint 3D Shape Retrieval and Deformation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

MOHO: Learning Single-View Hand-Held Object Reconstruction with Multi-View Occlusion-Aware Supervision.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

CONFORM: Contrast is All You Need For High-Fidelity Text-to-Image Diffusion Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Know Your Neighbors: Improving Single-View Reconstruction via Spatial Vision-Language Reasoning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

HyperSDFusion: Bridging Hierarchical Structures in Language and Geometry for Enhanced 3D Text2Shape Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

SceneFun3D: Fine-Grained Functionality and Affordance Understanding in 3D Scenes.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

SecondPose: SE(3)-Consistent Dual-Stream Feature Fusion for Category-Level Pose Estimation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

RaNeuS: Ray-adaptive Neural Surface Reconstruction.
Proceedings of the International Conference on 3D Vision, 2024

TextMesh: Generation of Realistic 3D Meshes From Text Prompts.
Proceedings of the International Conference on 3D Vision, 2024

NeRFMeshing: Distilling Neural Radiance Fields into Geometrically-Accurate 3D Meshes.
Proceedings of the International Conference on 3D Vision, 2024

2023
SupeRGB-D: Zero-Shot Instance Segmentation in Cluttered Indoor Environments.
IEEE Robotics Autom. Lett., June, 2023

Domain-Specific Priors and Meta Learning for Few-Shot First-Person Action Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

Self-Supervised Category-Level 6D Object Pose Estimation With Optical Flow Consistency.
IEEE Robotics Autom. Lett., May, 2023

Unsupervised Template Warp Consistency for Implicit Surface Correspondences.
Comput. Graph. Forum, May, 2023

Towards Long-Term Retrieval-Based Visual Localization in Indoor Environments With Changes.
IEEE Robotics Autom. Lett., April, 2023

OPA-3D: Occlusion-Aware Pixel-Wise Aggregation for Monocular 3D Object Detection.
IEEE Robotics Autom. Lett., March, 2023

Lidar Upsampling With Sliced Wasserstein Distance.
IEEE Robotics Autom. Lett., 2023

Batch normalization embeddings for deep domain generalization.
Pattern Recognit., 2023

Query-guided networks for few-shot fine-grained classification and person search.
Pattern Recognit., 2023

UniSDF: Unifying Neural Representations for High-Fidelity 3D Reconstruction of Complex Scenes with Reflections.
CoRR, 2023

LIME: Localized Image Editing via Attention Regularization in Diffusion Models.
CoRR, 2023

Re-Nerfing: Enforcing Geometric Constraints on Neural Radiance Fields through Novel Views Synthesis.
CoRR, 2023

DNS SLAM: Dense Neural Semantic-Informed SLAM.
CoRR, 2023

HACD: Hand-Aware Conditional Diffusion for Monocular Hand-Held Object Reconstruction.
CoRR, 2023

3D Compression Using Neural Fields.
CoRR, 2023

MOHO: Learning Single-view Hand-held Object Reconstruction with Multi-view Occlusion-Aware Supervision.
CoRR, 2023

Open-Structure: a Structural Benchmark Dataset for SLAM Algorithms.
CoRR, 2023

CCD-3DR: Consistent Conditioning in Diffusion for Single-Image 3D Reconstruction.
CoRR, 2023

View-to-Label: Multi-View Consistency for Self-Supervised 3D Object Detection.
CoRR, 2023

DDF-HO: Hand-Held Object Reconstruction via Conditional Directed Distance Field.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

CommonScenes: Generating Commonsense 3D Indoor Scenes with Scene Graphs.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

OpenMask3D: Open-Vocabulary 3D Instance Segmentation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

MonoGraspNet: 6-DoF Grasping with a Single RGB Image.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

SST: Real-time End-to-end Monocular 3D Reconstruction via Sparse Spatial-Temporal Guidance.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

PointHGN: Point Heterogeneous Graph Neural Network for Point Cloud Learning.
Proceedings of the IEEE International Conference on Multimedia and Expo Workshops, 2023

Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

LatentSwap3D: Semantic Edits on 3D Image GANs.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

SparseFusion: Fusing Multi-Modal Sparse Representations for Multi-Sensor 3D Object Detection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Dynamic Hyperbolic Attention Network for Fine Hand-object Reconstruction.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Introducing Language Guidance in Prompt-based Continual Learning.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Robust Monocular Depth Estimation under Challenging Conditions.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Segmenting Known Objects and Unseen Unknowns without Prior Knowledge.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

U-RED: Unsupervised 3D Shape Retrieval and Deformation for Partial Point Clouds.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

IPCC-TP: Utilizing Incremental Pearson Correlation Coefficient for Joint Multi-Agent Trajectory Prediction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Incremental 3D Semantic Scene Graph Prediction from RGB Sequences.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

SPARF: Neural Radiance Fields from Sparse and Noisy Poses.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Shape, Pose, and Appearance from a Single Image via Bootstrapped Radiance Field Inversion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

I2MVFormer: Large Language Model Generated Multi-View Document Supervision for Zero-Shot Image Classification.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Robust and Efficient Edge-guided Pose Estimation with Resolution-conditioned NeRF.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

2022
Recurrent Models for Lane Change Prediction and Situation Assessment.
IEEE Trans. Intell. Transp. Syst., 2022

Object-Aware Monocular Depth Prediction With Instance Convolutions.
IEEE Robotics Autom. Lett., 2022

Time-to-Label: Temporal Consistency for Self-Supervised Monocular 3D Object Detection.
IEEE Robotics Autom. Lett., 2022

CertainNet: Sampling-Free Uncertainty Estimation for Object Detection.
IEEE Robotics Autom. Lett., 2022

3DPointCaps++: Learning 3D Representations with Capsule Networks.
Int. J. Comput. Vis., 2022

SoftPool++: An Encoder-Decoder Network for Point Cloud Completion.
Int. J. Comput. Vis., 2022

Learning 3D Semantic Scene Graphs with Instance Embeddings.
Int. J. Comput. Vis., 2022

ParGAN: Learning Real Parametrizable Transformations.
CoRR, 2022

Holistic Segmentation.
CoRR, 2022

Incremental 3D Scene Completion for Safe and Efficient Exploration Mapping and Planning.
CoRR, 2022

RBP-Pose: Residual Bounding Box Projection for Category-Level Pose Estimation.
CoRR, 2022

Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language.
CoRR, 2022

From 2D to 3D: Re-thinking Benchmarking of Monocular Depth Prediction.
CoRR, 2022

Transformers in Action: Weakly Supervised Action Segmentation.
CoRR, 2022

Neural Fields in Visual Computing and Beyond.
Comput. Graph. Forum, 2022

I2DFormer: Learning Image to Document Attention for Zero-Shot Image Classification.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

4D-OR: Semantic Scene Graphs for OR Domain Modeling.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, 2022

SSP-Pose: Symmetry-Aware Shape Prior Deformation for Direct Category-Level Object Pose Estimation.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022

CloudAttention: Efficient Multi-Scale Attention Scheme For 3D Point Cloud Learning.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022

On the Practicality of Deterministic Epistemic Uncertainty.
Proceedings of the International Conference on Machine Learning, 2022

SecNet: Semantic Eye Completion in Implicit Field.
Proceedings of The 1st Gaze Meets ML workshop, 2022

RBP-Pose: Residual Bounding Box Projection for Category-Level Pose Estimation.
Proceedings of the Computer Vision - ECCV 2022, 2022

Unsupervised Domain Adaptive Object Detection with Class Label Shift Weighted Local Features.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

Implicit Neural Representations for Image Compression.
Proceedings of the Computer Vision - ECCV 2022, 2022

3D Compositional Zero-Shot Learning with DeCompositional Consensus.
Proceedings of the Computer Vision - ECCV 2022, 2022

E-Graph: Minimal Solution for Rigid Rotation with Extensibility Graphs.
Proceedings of the Computer Vision - ECCV 2022, 2022

GOCA: Guided Online Cluster Assignment for Self-supervised Video Representation Learning.
Proceedings of the Computer Vision - ECCV 2022, 2022

Learning Local Displacements for Point Cloud Completion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

SHIFT: A Synthetic Driving Dataset for Continuous Multi-Task Domain Adaptation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

ZebraPose: Coarse to Fine Surface Encoding for 6DoF Object Pose Estimation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Bending Graphs: Hierarchical Shape Matching using Gated Optimal Transport.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

3D-VField: Adversarial Augmentation of Point Clouds for Domain Generalization in 3D Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

GPV-Pose: Category-level Object Pose Estimation via Geometry-guided Point-wise Voting.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Opportunistic Interfaces for Augmented Reality: Transforming Everyday Objects into Tangible 6DoF Interfaces Using Ad hoc UI.
Proceedings of the CHI '22: CHI Conference on Human Factors in Computing Systems, New Orleans, LA, USA, 29 April 2022, 2022

DisPositioNet: Disentangled Pose and Identity in Semantic Image Manipulation.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

ManiFlow: Implicitly Representing Manifolds with Normalizing Flows.
Proceedings of the International Conference on 3D Vision, 2022

2021
SRH-Net: Stacked Recurrent Hourglass Network for Stereo Matching.
IEEE Robotics Autom. Lett., October, 2021

Panoster: End-to-End Panoptic Segmentation of LiDAR Point Clouds.
IEEE Robotics Autom. Lett., 2021

3D-VField: Learning to Adversarially Deform Point Clouds for Robust 3D Object Detection.
CoRR, 2021

Semantic Dense Reconstruction with Consistent Scene Segments.
CoRR, 2021

Attention-based Adversarial Appearance Learning of Augmented Pedestrians.
CoRR, 2021

On the Practicality of Deterministic Epistemic Uncertainty.
CoRR, 2021

Multimodal Semantic Scene Graphs for Holistic Modeling of Surgical Procedures.
CoRR, 2021

LegoFormer: Transformers for Block-by-Block Multi-view 3D Reconstruction.
CoRR, 2021

Unsupervised Novel View Synthesis from a Single Image.
CoRR, 2021

DB-GAN: Boosting Object Recognition Under Strong Lighting Conditions.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Adversarial Domain Feature Adaptation for Bronchoscopic Depth Estimation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2021 - 24th International Conference, Strasbourg, France, September 27, 2021

Unsupervised Traffic Scene Generation with Synthetic 3D Scene Graphs.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

Content Disentanglement for Semantically Consistent Synthetic-to-Real Domain Adaptation.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

Semantic Image Alignment for Vehicle Localization.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

3D Scene Understanding with Scene Graphs and Self-Supervision.
Proceedings of the International Conference on Image Processing and Vision Engineering, 2021

ManhattanSLAM: Robust Planar Tracking and Mapping Leveraging Mixture of Manhattan Frames.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

RGB-D SLAM with Structural Regularities.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Lightweight Semantic Mesh Mapping for Autonomous Vehicles.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

TSDF++: A Multi-Object Formulation for Dynamic Object Tracking and Reconstruction.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Unconditional Scene Graph Generation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

SO-Pose: Exploiting Self-Occlusion for Direct 6D Pose Estimation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Graph-to-3D: End-to-End Generation and Manipulation of 3D Scenes Using Scene Graphs.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

LOLA v1.1 - An Upgrade in Hardware and Software Design for Dynamic Multi-Contact Locomotion.
Proceedings of the 20th IEEE-RAS International Conference on Humanoid Robots, 2021

3D Indoor Scene Understanding with Scene Graphs and Self-supervision.
Proceedings of the 16th International Joint Conference on Computer Vision, 2021

SceneGraphFusion: Incremental 3D Scene Graph Prediction From RGB-D Sequences.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Learning Graph Embeddings for Compositional Zero-Shot Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Variational Transformer Networks for Layout Generation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

GDR-Net: Geometry-Guided Direct Regression Network for Monocular 6D Object Pose Estimation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Go with the Flows: Mixtures of Normalizing Flows for Point Cloud Generation and Reconstruction.
Proceedings of the International Conference on 3D Vision, 2021

R4Dyn: Exploring Radar for Self-Supervised Monocular Depth Estimation of Dynamic Scenes.
Proceedings of the International Conference on 3D Vision, 2021

2020
Explicit Domain Adaptation With Loosely Coupled Samples.
IEEE Robotics Autom. Lett., 2020

Co-Planar Parametrization for Stereo-SLAM and Visual-Inertial Odometry.
IEEE Robotics Autom. Lett., 2020

Structure-SLAM: Low-Drift Monocular SLAM in Indoor Environments.
IEEE Robotics Autom. Lett., 2020

Ambiguity in Sequential Data: Predicting Uncertain Futures With Recurrent Models.
IEEE Robotics Autom. Lett., 2020

Guest Editors' Introduction to the Special Issue on RGB-D Vision: Methods and Applications.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Joint motion boundary detection and CNN-based feature visualization for video object segmentation.
Neural Comput. Appl., 2020

Joint detection and tracking in videos with identification features.
Image Vis. Comput., 2020

Quantifying Aleatoric and Epistemic Uncertainty Using Density Estimation in Latent Space.
CoRR, 2020

3DSNet: Unsupervised Shape-to-Shape 3D Style Transfer.
CoRR, 2020

KLIEP-based Density Ratio Estimation for Semantically Consistent Synthetic to Real Images Adaptation in Urban Traffic Scenes.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

Adversarial Appearance Learning in Augmented Cityscapes for Pedestrian Recognition in Autonomous Driving.
Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

Binary DAD-Net: Binarized Driveable Area Detection Network for Autonomous Driving.
Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

Restricting the Flow: Information Bottlenecks for Attribution.
Proceedings of the 8th International Conference on Learning Representations, 2020

Quaternion Equivariant Capsule Networks for 3D Point Clouds.
Proceedings of the Computer Vision - ECCV 2020, 2020

Deep Positional and Relational Feature Learning for Rotation-Invariant Point Cloud Analysis.
Proceedings of the Computer Vision - ECCV 2020, 2020

SoftPoolNet: Shape Descriptor for Point Cloud Completion and Classification.
Proceedings of the Computer Vision - ECCV 2020, 2020

Self6D: Self-supervised Monocular 6D Object Pose Estimation.
Proceedings of the Computer Vision - ECCV 2020, 2020

Beyond Controlled Environments: 3D Camera Re-localization in Changing Indoor Scenes.
Proceedings of the Computer Vision - ECCV 2020, 2020

Learning 3D Semantic Scene Graphs From 3D Indoor Reconstructions.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Semantic Image Manipulation Using Scene Graphs.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

SCFusion: Real-time Incremental Scene Reconstruction with Semantic Completion.
Proceedings of the 8th International Conference on 3D Vision, 2020

A Divide et Impera Approach for 3D Shape Reconstruction from Multiple Views.
Proceedings of the 8th International Conference on 3D Vision, 2020

Graphite: Graph-Induced Feature Extraction for Point Cloud Registration.
Proceedings of the 8th International Conference on 3D Vision, 2020

2019
Learning Descriptors With Cube Loss for View-Based 3-D Object Retrieval.
IEEE Trans. Multim., 2019

2D Image-To-3D Model: Knowledge-Based 3D Building Reconstruction (3DBR) Using Single Aerial Images and Convolutional Neural Networks (CNNs).
Remote. Sens., 2019

Variational Object-Aware 3-D Hand Pose From a Single RGB Image.
IEEE Robotics Autom. Lett., 2019

Peeking behind objects: Layered depth prediction from a single image.
Pattern Recognit. Lett., 2019

Unsupervised Monocular Depth Prediction for Indoor Continuous Video Streams.
CoRR, 2019

Grasp Type Estimation for Myoelectric Prostheses using Point Cloud Feature Learning.
CoRR, 2019

Domain-Specific Priors and Meta Learning for Low-shot First-Person Action Recognition.
CoRR, 2019

Sampling/Importance Resampling for Semantically Consistent Synthetic to Real Image Domain Adaptation in Urban Traffic Scenes.
Proceedings of the 2019 IEEE Intelligent Vehicles Symposium, 2019

Headlight Range Estimation for Autonomous Driving using Deep Neural Networks.
Proceedings of the 2019 IEEE Intelligent Vehicles Symposium, 2019

Crowd-sourced Semantic Edge Mapping for Autonomous Vehicles.
Proceedings of the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019

Attention-based Lane Change Prediction.
Proceedings of the International Conference on Robotics and Automation, 2019

ForkNet: Multi-Branch Volumetric Semantic Completion From a Single Depth Image.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

RIO: 3D Object Instance Re-Localization in Changing Indoor Environments.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Sampling-Free Epistemic Uncertainty Estimation Using Approximated Variance Propagation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Explaining the Ambiguity of Object Detection and 6D Pose From Visual Data.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Object-Driven Multi-Layer Scene Decomposition From a Single Image.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

3D Point Capsule Networks.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Query-Guided End-To-End Person Search.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

GFrames: Gradient-Based Local Reference Frame for 3D Shape Matching.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Abstract: Leveraging Web Data for Skin Lesion Classification.
Proceedings of the Bildverarbeitung für die Medizin 2019 - Algorithmen - Systeme, 2019

2018
Real-Time Fully Incremental Scene Understanding on Mobile Platforms.
IEEE Robotics Autom. Lett., 2018

Tracking-by-Detection of 3D Human Shapes: From Surfaces to Volumes.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Co-segmentation via visualization.
J. Vis. Commun. Image Represent., 2018

Learning to Detect Good 3D Keypoints.
Int. J. Comput. Vis., 2018

Real-Time Accurate 3D Head Tracking and Pose Estimation with Consumer RGB-D Cameras.
Int. J. Comput. Vis., 2018

Learning without prejudice: Avoiding bias in webly-supervised action recognition.
Comput. Vis. Image Underst., 2018

A performance evaluation of point pair features.
Comput. Vis. Image Underst., 2018

Explaining the Ambiguity of Object Detection and 6D Pose from Visual Data.
CoRR, 2018

Deep Learned Full-3D Object Completion from Single View.
CoRR, 2018

Convolutional neural networks for real-time epileptic seizure detection.
Comput. methods Biomech. Biomed. Eng. Imaging Vis., 2018

Webly Supervised Learning for Skin Lesion Classification.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2018, 2018

Fast and Accurate Semantic Mapping through Geometric-based Incremental Segmentation.
Proceedings of the 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2018

Self-Supervised Learning of the Drivable Area for Autonomous Vehicles.
Proceedings of the 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2018

Semantic Monocular SLAM for Highly Dynamic Environments.
Proceedings of the 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2018

Situation Assessment for Planning Lane Changes: Combining Recurrent Models and Prediction.
Proceedings of the 2018 IEEE International Conference on Robotics and Automation, 2018

Local Image Descriptors with Statistical Losses.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Distortion-Aware Convolutional Filters for Dense Prediction in Panoramic Images.
Proceedings of the Computer Vision - ECCV 2018, 2018

Fully-Convolutional Point Networks for Large-Scale Point Clouds.
Proceedings of the Computer Vision - ECCV 2018, 2018

Deep Model-Based 6D Pose Refinement in RGB.
Proceedings of the Computer Vision - ECCV 2018, 2018


A Summary of the 4th International Workshop on Recovering 6D Object Pose.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

Human Motion Analysis with Deep Metric Learning.
Proceedings of the Computer Vision - ECCV 2018, 2018

Guide Me: Interacting With Deep Networks.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Dealing with Ambiguity in Robotic Grasping via Multiple Predictions.
Proceedings of the Computer Vision - ACCV 2018, 2018

Adversarial Semantic Scene Completion from a Single Depth Image.
Proceedings of the 2018 International Conference on 3D Vision, 2018

2017
Looking Beyond the Simple Scenarios: Combining Learners and Optimizers in 3D Temporal Tracking.
IEEE Trans. Vis. Comput. Graph., 2017

Large scale and long standing simultaneous reconstruction and segmentation.
Comput. Vis. Image Underst., 2017

6D Object Pose Estimation with Depth Images: A Seamless Approach for Robotic Interaction and Augmented Reality.
CoRR, 2017

Concurrent Segmentation and Localization for Tracking of Surgical Instruments.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2017, 2017

Mixed Reality Support for Orthopaedic Surgery.
Proceedings of the IEEE International Symposium on Mixed and Augmented Reality, 2017

SSD-6D: Making RGB-Based 3D Detection and 6D Pose Estimation Great Again.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Long Short-Term Memory Kalman Filters: Recurrent Neural Estimators for Pose Regularization.
Proceedings of the IEEE International Conference on Computer Vision, 2017

CNN-SLAM: Real-Time Dense Monocular SLAM with Learned Depth Prediction.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Real-Time 3D Model Tracking in Color and Depth on a Single CPU Core.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

One For All: Adaptive Learning-based Temporal Tracker for 3D Head Shape Models.
Proceedings of the British Machine Vision Conference 2017, 2017

Automatic Initialization and Failure Detection for Surgical Tool Tracking in Retinal Microsurgery.
Proceedings of the Bildverarbeitung für die Medizin 2017 - Algorithmen - Systeme, 2017

Abstract: Real-Time Online Adaption for Robust Instrument Tracking and Pose Estimation.
Proceedings of the Bildverarbeitung für die Medizin 2017 - Algorithmen - Systeme, 2017

2016
Semantic parametric body shape estimation from noisy depth sequences.
Robotics Auton. Syst., 2016

A Global Hypothesis Verification Framework for 3D Object Recognition in Clutter.
IEEE Trans. Pattern Anal. Mach. Intell., 2016

Real-time localization of articulated surgical instruments in retinal microsurgery.
Medical Image Anal., 2016

Learning in an Uncertain World: Representing Ambiguity Through Multiple Hypotheses.
CoRR, 2016

A Taxonomy and Library for Visualizing Learned Features in Convolutional Neural Networks.
CoRR, 2016

Real-Time Online Adaption for Robust Instrument Tracking and Pose Estimation.
Proceedings of the Medical Image Computing and Computer-Assisted Intervention - MICCAI 2016, 2016

Patient MoCap: Human Pose Estimation Under Blanket Occlusion for Hospital Monitoring Applications.
Proceedings of the Medical Image Computing and Computer-Assisted Intervention - MICCAI 2016, 2016

Frustration Free Pose Computation For Spatial AR Devices in Industrial Scenario.
Proceedings of the 2016 IEEE International Symposium on Mixed and Augmented Reality, 2016

Sensor substitution for video-based action recognition.
Proceedings of the 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2016

Incremental scene understanding on dense SLAM.
Proceedings of the 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2016

When 2.5D is not enough: Simultaneous reconstruction, segmentation and recognition on dense SLAM.
Proceedings of the 2016 IEEE International Conference on Robotics and Automation, 2016

A Radial Search Method for Fast Nearest Neighbor Search on Range Images.
Proceedings of the Computer Vision - ECCV 2016 Workshops, 2016

Deep Learning of Local RGB-D Patches for 3D Object Detection and 6D Pose Estimation.
Proceedings of the Computer Vision - ECCV 2016, 2016

RobotFusion: Grasping with a Robotic Manipulator via Multi-view Reconstruction.
Proceedings of the Computer Vision - ECCV 2016 Workshops, 2016

An Octree-Based Approach towards Efficient Variational Range Data Fusion.
Proceedings of the British Machine Vision Conference 2016, 2016

Retrieval of Human Subjects from Depth Sensor Data.
Proceedings of the 9th Eurographics Workshop on 3D Object Retrieval, 2016

Deeper Depth Prediction with Fully Convolutional Residual Networks.
Proceedings of the Fourth International Conference on 3D Vision, 2016

2015
Registration with the Point Cloud Library: A Modular Framework for Aligning in 3-D.
IEEE Robotics Autom. Mag., 2015

Traffic sign detection via interest region extraction.
Pattern Recognit., 2015

The Maximal Self-dissimilarity Interest Point Detector.
IPSJ Trans. Comput. Vis. Appl., 2015

Hierarchical Multi-Organ Segmentation Without Registration in 3D Abdominal CT Images.
Proceedings of the Medical Computer Vision: Algorithms for Big Data, 2015

Surgical Tool Tracking and Pose Estimation in Retinal Microsurgery.
Proceedings of the Medical Image Computing and Computer-Assisted Intervention - MICCAI 2015, 2015

Robust Segmentation of Various Anatomies in 3D Ultrasound Using Hough Forests and Learned Data Representations.
Proceedings of the Medical Image Computing and Computer-Assisted Intervention - MICCAI 2015, 2015

A Step Closer To Reality: Closed Loop Dynamic Registration Correction in SAR.
Proceedings of the 2015 IEEE International Symposium on Mixed and Augmented Reality, 2015

Augmenting Mobile C-arm Fluoroscopes via Stereo-RGBD Sensors for Multimodal Visualization.
Proceedings of the 2015 IEEE International Symposium on Mixed and Augmented Reality, 2015

Real-time and scalable incremental segmentation on dense SLAM.
Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2015

Bounded Non-Local Means for Fast and Effective Image Denoising.
Proceedings of the Image Analysis and Processing - ICIAP 2015, 2015

A Versatile Learning-Based 3D Temporal Tracker: Scalable, Robust, Online.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Learning a Descriptor-Specific 3D Keypoint Detector.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Universal Hough dictionaries for object tracking.
Proceedings of the British Machine Vision Conference 2015, 2015

Hashmod: A Hashing Method for Scalable 3D Object Detection.
Proceedings of the British Machine Vision Conference 2015, 2015

A Combined Generalized and Subject-Specific 3D Head Pose Estimation.
Proceedings of the 2015 International Conference on 3D Vision, 2015

Repeatable Local Coordinate Frames for 3D Human Motion Tracking: From Rigid to Non-rigid.
Proceedings of the 2015 International Conference on 3D Vision, 2015

2014
Three Dimensional Shape Descriptor.
Computer Vision, A Reference Guide, 2014

SHOT: Unique signatures of histograms for surface and texture description.
Comput. Vis. Image Underst., 2014

Automatic detection of pole-like structures in 3D urban environments.
Proceedings of the 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2014

Interest Points via Maximal Self-Dissimilarities.
Proceedings of the Computer Vision - ACCV 2014, 2014

2013
Performance Evaluation of 3D Keypoint Detectors.
Int. J. Comput. Vis., 2013

Multimodal Video Analysis on Self-Powered Resource-Limited Wireless Smart Camera.
IEEE J. Emerg. Sel. Topics Circuits Syst., 2013

A traffic sign detection pipeline based on interest region extraction.
Proceedings of the 2013 International Joint Conference on Neural Networks, 2013

Multimodal cue integration through Hypotheses Verification for RGB-D object recognition and 6DOF pose estimation.
Proceedings of the 2013 IEEE International Conference on Robotics and Automation, 2013

BOLD Features to Detect Texture-less Objects.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Feature-based automatic 3D registration for cultural heritage applications.
Proceedings of the 1st Digital Heritage International Congress, 2013

GPU-SHOT: Parallel Optimization for Real-Time 3D Local Description.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2013

2012
Tutorial: Point Cloud Library: Three-Dimensional Object Recognition and 6 DOF Pose Estimation.
IEEE Robotics Autom. Mag., 2012

Performance Evaluation of Full Search Equivalent Pattern Matching Algorithms.
IEEE Trans. Pattern Anal. Mach. Intell., 2012

Hough Voting for 3D Object Recognition under Occlusion and Clutter.
IPSJ Trans. Comput. Vis. Appl., 2012

Supervised learning of hidden and non-hidden 0-order affordances and detection in real scenes.
Proceedings of the IEEE International Conference on Robotics and Automation, 2012

A Global Hypotheses Verification Method for 3D Object Recognition.
Proceedings of the Computer Vision - ECCV 2012, 2012

OUR-CVFH - Oriented, Unique and Repeatable Clustered Viewpoint Feature Histogram for Object Recognition and 6DOF Pose Estimation.
Proceedings of the Pattern Recognition, 2012

On the Affinity between 3D Detectors and Descriptors.
Proceedings of the 2012 Second International Conference on 3D Imaging, 2012

Toward Compressed 3D Descriptors.
Proceedings of the 2012 Second International Conference on 3D Imaging, 2012

2011
Adaptive Low Resolution Pruning for fast Full Search-equivalent pattern matching.
Pattern Recognit. Lett., 2011

Efficient template matching for multi-channel images.
Pattern Recognit. Lett., 2011

Improving Geometric Hashing by Means of Feature Descriptors.
Proceedings of the VISAPP 2011, 2011

Online learning for automatic segmentation of 3D data.
Proceedings of the 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2011

A combined texture-shape descriptor for enhanced 3D feature matching.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Evaluation of stereo algorithms for 3D object recognition.
Proceedings of the IEEE International Conference on Computer Vision Workshops, 2011

3D Data Segmentation by Local Classification and Markov Random Fields.
Proceedings of the International Conference on 3D Imaging, 2011

2010
Unique shape context for 3d data description.
Proceedings of the ACM workshop on 3D object retrieval, 2010

Robust and efficient background subtraction by quadratic polynomial fitting.
Proceedings of the International Conference on Image Processing, 2010

A 3D reconstruction system based on improved spacetime stereo.
Proceedings of the 11th International Conference on Control, 2010

Stereo for robots: Quantitative evaluation of efficient and low-memory dense stereo algorithms.
Proceedings of the 11th International Conference on Control, 2010

Unique Signatures of Histograms for Local Surface Description.
Proceedings of the Computer Vision, 2010

Accurate and Efficient Background Subtraction by Monotonic Second-Degree Polynomial Fitting.
Proceedings of the Seventh IEEE International Conference on Advanced Video and Signal Based Surveillance, 2010

On the Use of Implicit Shape Models for Recognition of Object Categories in 3D Data.
Proceedings of the Computer Vision - ACCV 2010, 2010

Second-Order Polynomial Models for Background Subtraction.
Proceedings of the Computer Vision - ACCV 2010 Workshops, 2010

2009
Full-Search-Equivalent Pattern Matching with Incremental Dissimilarity Approximations.
IEEE Trans. Pattern Anal. Mach. Intell., 2009

A Practical Stereo System based on Regularization and Texture Projection.
Proceedings of the ICINCO 2009, 2009

Multimodal Abandoned/Removed Object Detection for Low Power Video Surveillance Systems.
Proceedings of the Sixth IEEE International Conference on Advanced Video and Signal Based Surveillance, 2009

A Template Analysis Methodology to Improve the Efficiency of Fast Matching Algorithms.
Proceedings of the Advanced Concepts for Intelligent Vision Systems, 2009

Enhanced Low-Resolution Pruning for Fast Full-Search Template Matching.
Proceedings of the Advanced Concepts for Intelligent Vision Systems, 2009

2008
Fast Full-Search Equivalent Template Matching by Enhanced Bounded Correlation.
IEEE Trans. Image Process., 2008

Performance Evaluation of Robust Matching Measures.
Proceedings of the VISAPP 2008: Proceedings of the Third International Conference on Computer Vision Theory and Applications, Funchal, Madeira, Portugal, January 22-25, 2008, 2008

Near real-time stereo based on effective cost aggregation.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Markerless Augmented Reality Using Image Mosaics.
Proceedings of the Image and Signal Processing - 3rd International Conference, 2008

Reliable rejection of mismatching candidates for efficient ZNCC template matching.
Proceedings of the International Conference on Image Processing, 2008

Classification and evaluation of cost aggregation methods for stereo correspondence.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Multi-view Access Monitoring and Singularization in Interlocks.
Proceedings of the Fifth IEEE International Conference on Advanced Video and Signal Based Surveillance, 2008

Graffiti Detection Using a Time-Of-Flight Camera.
Proceedings of the Advanced Concepts for Intelligent Vision Systems, 2008

2007
Segmentation-Based Adaptive Support for Accurate Stereo Correspondence.
Proceedings of the Advances in Image and Video Technology, Second Pacific Rim Symposium, 2007

A robust measure for visual correspondence.
Proceedings of the 14th International Conference on Image Analysis and Processing (ICIAP 2007), 2007

Efficient and optimal block matching for motion estimation.
Proceedings of the 14th International Conference on Image Analysis and Processing (ICIAP 2007), 2007

Stereo Vision Enabling Precise Border Localization Within a Scanline Optimization Framework.
Proceedings of the Computer Vision, 2007

2006
Template Matching Based on the L_p Norm Using Sufficient Conditions with Incremental Approximations.
Proceedings of the Advanced Video and Signal Based Surveillance, 2006

2005
ZNCC-based template matching using bounded partial correlation.
Pattern Recognit. Lett., 2005

Speeding-up NCC-Based Template Matching Using Parallel Multimedia Instructions.
Proceedings of the Seventh International Workshop on Computer Architectures for Machine Perception (CAMP 2005), 2005

2004
An Algorithm for Efficient and Exhaustive Template Matching.
Proceedings of the Image Analysis and Recognition: International Conference, 2004


  Loading...