Raquel Urtasun

Affiliations:
  • University of Toronto, Canada
  • Waabi
  • Swiss Federal Institute of Technology in Lausanne, Switzerland (former)


According to our database1, Raquel Urtasun authored at least 296 papers between 2001 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
SceneControl: Diffusion for Controllable Traffic Scene Generation.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

QuAD: Query-based Interpretable Neural Motion Planning for Autonomous Driving.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Copilot4D: Learning Unsupervised World Models for Autonomous Driving via Discrete Diffusion.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Learning to Drive via Asymmetric Self-Play.
Proceedings of the Computer Vision - ECCV 2024, 2024

UniCal: Unified Neural Sensor Calibration.
Proceedings of the Computer Vision - ECCV 2024, 2024

G3R: Gradient Guided Generalizable Reconstruction.
Proceedings of the Computer Vision - ECCV 2024, 2024

DeTra: A Unified Model for Object Detection and Trajectory Forecasting.
Proceedings of the Computer Vision - ECCV 2024, 2024

UnO: Unsupervised Occupancy Fields for Perception and Forecasting.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
LightSim: Neural Lighting Simulation for Urban Scenes.
CoRR, 2023

UltraLiDAR: Learning Compact Representations for LiDAR Completion and Generation.
CoRR, 2023

Learning Unsupervised World Models for Autonomous Driving via Discrete Diffusion.
CoRR, 2023

Neural Lighting Simulation for Urban Scenes.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Reconstructing Objects in-the-wild for Realistic Sensor Simulation.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

GoRela: Go Relative for Viewpoint-Invariant Motion Forecasting.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Towards Zero Domain Gap: A Comprehensive Study of Realistic LiDAR Simulation for Autonomy Testing.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Real-Time Neural Rasterization for Large Scenes.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

MemorySeg: Online LiDAR Semantic Segmentation with a Latent Memory.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Towards Unsupervised Object Detection from LiDAR Point Clouds.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Learning Compact Representations for LiDAR Completion and Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

MIXSIM: A Hierarchical Framework for Mixed Reality Traffic Simulation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Implicit Occupancy Flow Fields for Perception and Prediction in Self-Driving.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

UniSim: A Neural Closed-Loop Sensor Simulator.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Learning Realistic Traffic Agents in Closed-loop.
Proceedings of the Conference on Robot Learning, 2023

LabelFormer: Object Trajectory Refinement for Offboard Perception from LiDAR Point Clouds.
Proceedings of the Conference on Robot Learning, 2023

Towards Scalable Coverage-Based Testing of Autonomous Vehicles.
Proceedings of the Conference on Robot Learning, 2023

Adv3D: Generating Safety-Critical 3D Objects through Closed-Loop Simulation.
Proceedings of the Conference on Robot Learning, 2023

4D-Former: Multimodal 4D Panoptic Segmentation.
Proceedings of the Conference on Robot Learning, 2023

2022
GeoNet++: Iterative Geometric Neural Network with Edge-Aware Refinement for Joint Depth and Surface Normal Estimation.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Mending Neural Implicit Modeling for 3D Vehicle Reconstruction in the Wild.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

Rethinking Closed-Loop Training for Autonomous Driving.
Proceedings of the Computer Vision - ECCV 2022, 2022

Virtual Correspondence: Humans as a Cue for Extreme-View Geometry.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

CADSim: Robust and Scalable in-the-wild 3D Reconstruction for Controllable Sensor Simulation.
Proceedings of the Conference on Robot Learning, 2022

2021
NP-DRAW: A Non-Parametric Structured Latent Variable Modelfor Image Generation.
CoRR, 2021

Just Label What You Need: Fine-Grained Active Selection for Perception and Prediction through Partially Labeled Scenes.
CoRR, 2021

Non-parametric Memory for Spatio-Temporal Segmentation of Construction Zones for Self-Driving.
CoRR, 2021

Secrets of 3D Implicit Object Shape Reconstruction in the Wild.
CoRR, 2021

Network Automatic Pruning: Start NAP and Take a Nap.
CoRR, 2021

PLUME: Efficient 3D Object Detection from Stereo Images.
CoRR, 2021

Cost-Efficient Online Hyperparameter Optimization.
CoRR, 2021

Auto4D: Learning to Label 4D Objects from Sequential Point Clouds.
CoRR, 2021

VideoClick: Video Object Segmentation with a Single Click.
CoRR, 2021

GeoSim: Photorealistic Image Simulation with Geometry-Aware Composition.
CoRR, 2021

Safety-Oriented Pedestrian Motion and Scene Occupancy Forecasting.
CoRR, 2021

NP-DRAW: A Non-Parametric Structured Latent Variable Model for Image Generation.
Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, 2021

LaneRCNN: Distributed Representations for Graph-Centric Motion Forecasting.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

PLUMENet: Efficient 3D Object Detection from Stereo Images.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

Diverse Complexity Measures for Dataset Curation in Self-Driving.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

Safety-Oriented Pedestrian Occupancy Forecasting.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

Asynchronous Multi-View SLAM.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Perceive, Attend, and Drive: Learning Spatial Attention for Safe Self-Driving.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Deep Structured Reactive Planning.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

A PAC-Bayesian Approach to Generalization Bounds for Graph Neural Networks.
Proceedings of the 9th International Conference on Learning Representations, 2021

Self-Supervised Representation Learning from Flow Equivariance.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Adversarial Attacks On Multi-Agent Communication.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

LookOut: Diverse Multi-Future Prediction and Planning for Self-Driving.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

S3: Neural Shape, Skeleton, and Skinning Fields for 3D Human Modeling.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

AdvSim: Generating Safety-Critical Scenarios for Self-Driving Vehicles.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

SceneGen: Learning To Generate Realistic Traffic Scenes.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

TrafficSim: Learning To Simulate Realistic Multi-Agent Behaviors.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Deep Multi-Task Learning for Joint Localization, Perception, and Prediction.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Permute, Quantize, and Fine-Tune: Efficient Compression of Neural Networks.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

GeoSim: Realistic Video Simulation via Geometry-Aware Composition for Self-Driving.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

MP3: A Unified Model To Map, Perceive, Predict and Plan.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Exploring Adversarial Robustness of Multi-sensor Perception Systems in Self Driving.
Proceedings of the Conference on Robot Learning, 8-11 November 2021, London, UK., 2021

Just Label What You Need: Fine-Grained Active Selection for P&P through Partially Labeled Scenes.
Proceedings of the Conference on Robot Learning, 8-11 November 2021, London, UK., 2021

2020
Recovering and Simulating Pedestrians in the Wild.
CoRR, 2020

StrObe: Streaming Object Detection from LiDAR Packets.
CoRR, 2020

LiRaNet: End-to-End Trajectory Prediction using Spatio-Temporal Radar Fusion.
CoRR, 2020

V2VNet: Vehicle-to-Vehicle Communication for Joint Perception and Prediction.
CoRR, 2020

ShapeAdv: Generating Shape-Aware Adversarial 3D Point Clouds.
CoRR, 2020

Physically Realizable Adversarial Examples for LiDAR Object Detection.
CoRR, 2020

LoCo: Local Contrastive Representation Learning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

MuSCLE: Multi Sweep Compression of LiDAR using Deep Entropy Models.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Pit30M: A Benchmark for Global Localization in the Age of Self-Driving Cars.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

End-to-end Contextual Perception and Prediction with Interaction Transformer.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

The Importance of Prior Knowledge in Precise Multimodal Prediction.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

SpAGNN: Spatially-Aware Graph Neural Networks for Relational Behavior Forecasting from Sensor Data.
Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

Multi-Agent Routing Value Iteration Network.
Proceedings of the 37th International Conference on Machine Learning, 2020

Hierarchical Verification for Adversarial Robustness.
Proceedings of the 37th International Conference on Machine Learning, 2020

DSDNet: Deep Structured Self-driving Network.
Proceedings of the Computer Vision - ECCV 2020, 2020

Dense RepPoints: Representing Visual Objects with Dense Point Sets.
Proceedings of the Computer Vision - ECCV 2020, 2020

RadarNet: Exploiting Radar for Robust Perception of Dynamic Objects.
Proceedings of the Computer Vision - ECCV 2020, 2020

Testing the Safety of Self-driving Vehicles by Simulating Perception and Prediction.
Proceedings of the Computer Vision - ECCV 2020, 2020

V2VNet: Vehicle-to-Vehicle Communication for Joint Perception and Prediction.
Proceedings of the Computer Vision - ECCV 2020, 2020

Perceive, Predict, and Plan: Safe Motion Planning Through Interpretable Semantic Representations.
Proceedings of the Computer Vision - ECCV 2020, 2020

Deep Feedback Inverse Problem Solver.
Proceedings of the Computer Vision - ECCV 2020, 2020

Conditional Entropy Coding for Efficient Video Compression.
Proceedings of the Computer Vision - ECCV 2020, 2020

Learning Lane Graph Representations for Motion Forecasting.
Proceedings of the Computer Vision - ECCV 2020, 2020

LevelSet R-CNN: A Deep Variational Method for Instance Segmentation.
Proceedings of the Computer Vision - ECCV 2020, 2020

Weakly-Supervised 3D Shape Completion in the Wild.
Proceedings of the Computer Vision - ECCV 2020, 2020

Implicit Latent Variable Model for Scene-Consistent Motion Forecasting.
Proceedings of the Computer Vision - ECCV 2020, 2020

Physically Realizable Adversarial Examples for LiDAR Object Detection.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

LiDARsim: Realistic LiDAR Simulation by Leveraging the Real World.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

PnPNet: End-to-End Perception and Prediction With Tracking in the Loop.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

PolyTransform: Deep Polygon Transformer for Instance Segmentation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

OctSqueeze: Octree-Structured Entropy Model for LiDAR Compression.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Recovering and Simulating Pedestrians in the Wild.
Proceedings of the 4th Conference on Robot Learning, 2020

Learning to Communicate and Correct Pose Errors.
Proceedings of the 4th Conference on Robot Learning, 2020

LiRaNet: End-to-End Trajectory Prediction using Spatio-Temporal Radar Fusion.
Proceedings of the 4th Conference on Robot Learning, 2020

Universal Embeddings for Spatio-Temporal Tagging of Self-Driving Logs.
Proceedings of the 4th Conference on Robot Learning, 2020

StrObe: Streaming Object Detection from LiDAR Packets.
Proceedings of the 4th Conference on Robot Learning, 2020

2019
Spatially-Aware Graph Neural Networks for Relational Behavior Forecasting from Sensor Data.
CoRR, 2019

Learning to Remember from a Multi-Task Teacher.
CoRR, 2019

Efficient Graph Generation with Graph Recurrent Attention Networks.
CoRR, 2019

Deformable Filter Convolution for Point Cloud Reasoning.
CoRR, 2019

Efficient Graph Generation with Graph Recurrent Attention Networks.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Jointly Learnable Behavior and Trajectory Planning for Self-Driving Vehicles.
Proceedings of the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019

Exploiting Sparse Semantic HD Maps for Self-Driving Vehicle Localization.
Proceedings of the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019

DeepSignals: Predicting Intent of Drivers Through Visual Signals.
Proceedings of the International Conference on Robotics and Automation, 2019

Graph HyperNetworks for Neural Architecture Search.
Proceedings of the 7th International Conference on Learning Representations, 2019

LanczosNet: Multi-Scale Deep Graph Convolutional Networks.
Proceedings of the 7th International Conference on Learning Representations, 2019

Dimensionality Reduction for Representing the Knowledge of Probabilistic Models.
Proceedings of the 7th International Conference on Learning Representations, 2019

DMM-Net: Differentiable Mask-Matching Network for Video Object Segmentation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

DSIC: Deep Stereo Image Compression.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

DAGMapper: Learning to Map by Discovering Lane Topology.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

DeepPruner: Learning Efficient Stereo Matching via Differentiable PatchMatch.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Learning Joint 2D-3D Representations for Depth Completion.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

End-To-End Interpretable Neural Motion Planner.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

UPSNet: A Unified Panoptic Segmentation Network.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Learning to Localize Through Compressed Binary Maps.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Deep Rigid Instance Scene Flow.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Multi-Task Multi-Sensor Fusion for 3D Object Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Convolutional Recurrent Network for Road Boundary Extraction.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

DARNet: Deep Active Ray Network for Building Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Identifying Unknown Instances for Autonomous Driving.
Proceedings of the 3rd Annual Conference on Robot Learning, 2019

Discrete Residual Flow for Probabilistic Pedestrian Behavior Prediction.
Proceedings of the 3rd Annual Conference on Robot Learning, 2019

2018
Proteus: Exploiting precision variability in deep neural networks.
Parallel Comput., 2018

3D Object Proposals Using Stereo Imagery for Accurate Object Class Detection.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Neural Guided Constraint Logic Programming for Program Synthesis.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

MultiNet: Real-time Joint Semantic Reasoning for Autonomous Driving.
Proceedings of the 2018 IEEE Intelligent Vehicles Symposium, 2018

Deep Multi-Sensor Lane Detection.
Proceedings of the 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2018

End-to-end Learning of Multi-sensor 3D Tracking by Detection.
Proceedings of the 2018 IEEE International Conference on Robotics and Automation, 2018

Learning to Reweight Examples for Robust Deep Learning.
Proceedings of the 35th International Conference on Machine Learning, 2018

Reviving and Improving Recurrent Back-Propagation.
Proceedings of the 35th International Conference on Machine Learning, 2018

Leveraging Constraint Logic Programming for Neural Guided Program Synthesis.
Proceedings of the 6th International Conference on Learning Representations, 2018

Inference in probabilistic graphical models by Graph Neural Networks.
Proceedings of the 6th International Conference on Learning Representations, 2018

Graph Partition Neural Networks for Semi-Supervised Classification.
Proceedings of the 6th International Conference on Learning Representations, 2018

Single Image Intrinsic Decomposition Without a Single Intrinsic Image.
Proceedings of the Computer Vision - ECCV 2018, 2018

Deep Continuous Fusion for Multi-sensor 3D Object Detection.
Proceedings of the Computer Vision - ECCV 2018, 2018

End-to-End Deep Structured Models for Drawing Crosswalks.
Proceedings of the Computer Vision - ECCV 2018, 2018

PIXOR: Real-Time 3D Object Detection From Point Clouds.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Deep Parametric Continuous Convolutional Neural Networks.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

SBNet: Sparse Blocks Network for Fast Inference.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

GeoNet: Geometric Neural Network for Joint Depth and Surface Normal Estimation.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Matching Adversarial Networks.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Learning Deep Structured Active Contours End-to-End.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Fast and Furious: Real Time End-to-End 3D Detection, Tracking and Motion Forecasting With a Single Convolutional Net.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Hierarchical Recurrent Attention Networks for Structured Online Maps.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

SurfConv: Bridging 3D and 2D Convolution for RGBD Images.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

HDNET: Exploiting HD Maps for 3D Object Detection.
Proceedings of the 2nd Annual Conference on Robot Learning, 2018

IntentNet: Learning to Predict Intention from Raw Sensor Data.
Proceedings of the 2nd Annual Conference on Robot Learning, 2018

Learning to Localize Using a LiDAR Intensity Map.
Proceedings of the 2nd Annual Conference on Robot Learning, 2018

Efficient Convolutions for Real-Time Semantic Segmentation of 3D Point Clouds.
Proceedings of the 2018 International Conference on 3D Vision, 2018

2017
Exploiting Deep Matching and SAR Data for the Geo-Localization Accuracy Improvement of Optical Satellite Images.
Remote. Sens., 2017

Few-Shot Learning Through an Information Retrieval Lens.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

The Reversible Residual Network: Backpropagation Without Storing Activations.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Find your way by observing the sun and other semantic cues.
Proceedings of the 2017 IEEE International Conference on Robotics and Automation, 2017

Deep Spectral Clustering Learning.
Proceedings of the 34th International Conference on Machine Learning, 2017

Normalizing the Normalizers: Comparing and Extending Network Normalization Schemes.
Proceedings of the 5th International Conference on Learning Representations, 2017

Song From PI: A Musically Plausible Network for Pop Music Generation.
Proceedings of the 5th International Conference on Learning Representations, 2017

Joint Embeddings of Scene Graphs and Images.
Proceedings of the 5th International Conference on Learning Representations, 2017

Be Your Own Prada: Fashion Synthesis with Structural Coherence.
Proceedings of the IEEE International Conference on Computer Vision, 2017

TorontoCity: Seeing the World with a Million Eyes.
Proceedings of the IEEE International Conference on Computer Vision, 2017

3D Graph Neural Networks for RGBD Semantic Segmentation.
Proceedings of the IEEE International Conference on Computer Vision, 2017

DeepRoadMapper: Extracting Road Topology from Aerial Images.
Proceedings of the IEEE International Conference on Computer Vision, 2017

SGN: Sequential Grouping Networks for Instance Segmentation.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Situation Recognition with Graph Neural Networks.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Towards Diverse and Natural Image Descriptions via a Conditional GAN.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Efficient Multiple Instance Metric Learning Using Weakly Supervised Data.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Sports Field Localization via Deep Structured Models.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Annotating Object Instances with a Polygon-RNN.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Deep Watershed Transform for Instance Segmentation.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Human-Machine CRFs for Identifying Bottlenecks in Scene Understanding.
IEEE Trans. Pattern Anal. Mach. Intell., 2016

Map-Based Probabilistic Visual Self-Localization.
IEEE Trans. Pattern Anal. Mach. Intell., 2016

Blending Learning and Inference in Conditional Random Fields.
J. Mach. Learn. Res., 2016

Efficient Summarization with Read-Again and Copy Mechanism.
CoRR, 2016

Order-Embeddings of Images and Language.
Proceedings of the 4th International Conference on Learning Representations, 2016

Soccer Field Localization from a Single Image.
CoRR, 2016

Deep Semantic Matching for Optical Flow.
CoRR, 2016

Towards Generalizable Sentence Embeddings.
Proceedings of the 1st Workshop on Representation Learning for NLP, 2016

Proximal Deep Structured Models.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Understanding the Effective Receptive Field in Deep Convolutional Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Learning Deep Parsimonious Representations.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Training Deep Neural Networks via Direct Loss Minimization.
Proceedings of the 33nd International Conference on Machine Learning, 2016

HouseCraft: Building Houses from Rental Ads and Street Views.
Proceedings of the Computer Vision - ECCV 2016, 2016

Exploiting Semantic Information and Deep Matching for Optical Flow.
Proceedings of the Computer Vision - ECCV 2016, 2016

Instance-Level Segmentation for Autonomous Driving with Deep Densely Connected MRFs.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

MovieQA: Understanding Stories in Movies through Question-Answering.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

HD Maps: Fine-Grained Road Segmentation by Parsing Ground and Aerial Images.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Efficient Deep Learning for Stereo Matching.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Monocular 3D Object Detection for Autonomous Driving.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Towards Affordable Self-driving Cars.
Proceedings of the British Machine Vision Conference 2016, 2016

Sequential Inference for Deep Gaussian Process.
Proceedings of the 19th International Conference on Artificial Intelligence and Statistics, 2016

2015
Instance-Level Segmentation with Deep Densely Connected MRFs.
CoRR, 2015

Direct Loss Minimization for Training Deep Neural Nets.
CoRR, 2015

Fully Connected Deep Structured Networks.
CoRR, 2015

Generating Multi-Sentence Lingual Descriptions of Indoor Scenes.
CoRR, 2015

Reduced-Precision Strategies for Bounded Memory in Deep Neural Nets.
CoRR, 2015

Learning Deep Structured Models.
Proceedings of the 3rd International Conference on Learning Representations, 2015

Estimating Drivable Collision-Free Space from Monocular Video.
Proceedings of the 2015 IEEE Winter Conference on Applications of Computer Vision, 2015

Skip-Thought Vectors.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

3D Object Proposals for Accurate Object Class Detection.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Aligning Books and Movies: Towards Story-Like Visual Explanations by Watching Movies and Reading Books.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Monocular Object Instance Segmentation and Depth Ordering with CNNs.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Lost Shopping! Monocular Localization in Large Indoor Spaces.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Enhancing Road Maps by Parsing Aerial Images Around the World.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

FollowMe: Efficient Online Min-Cost Flow Tracking with Bounded Memory and Computation.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

segDeepM: Exploiting segmentation and context in deep neural networks for object detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Real-time coarse-to-fine topologically preserving segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Learning to segment under various forms of weak supervision.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Holistic 3D scene understanding from a single geo-tagged image.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Neuroaesthetics in fashion: Modeling the perception of fashionability.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Rent3D: Floor-plan priors for monocular layout estimation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Generating Multi-sentence Natural Language Descriptions of Indoor Scenes.
Proceedings of the British Machine Vision Conference 2015, 2015

2014
3D Traffic Scene Understanding From Movable Platforms.
IEEE Trans. Pattern Anal. Mach. Intell., 2014

Human-Machine CRFs for Identifying Bottlenecks in Holistic Scene Understanding.
CoRR, 2014

Bayesian Filtering with Online Gaussian Process Latent Variable Models.
Proceedings of the Thirtieth Conference on Uncertainty in Artificial Intelligence, 2014

Message Passing Inference for Large Scale Graphical Models with High Order Potentials.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Efficient Inference of Continuous Markov Random Fields with Polynomial Potentials.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Globally Convergent Parallel MAP LP Relaxation Solver using the Frank-Wolfe Algorithm.
Proceedings of the 31th International Conference on Machine Learning, 2014

Transductive Gaussian processes for image denoising.
Proceedings of the 2014 IEEE International Conference on Computational Photography, 2014

Efficient Joint Segmentation, Occlusion Labeling, Stereo and Flow Estimation.
Proceedings of the Computer Vision - ECCV 2014, 2014

Tell Me What You See and I Will Show You Where It Is.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

The Role of Context for Object Detection and Semantic Segmentation in the Wild.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Visual Semantic Search: Retrieving Videos via Complex Textual Queries.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

What Are You Talking About? Text-to-Image Coreference.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Detect What You Can: Detecting and Representing Objects Using Holistic Models and Body Parts.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Beat the MTurkers: Automatic Image Labeling from Weak 3D Supervision.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

A High Performance CRF Model for Clothes Parsing.
Proceedings of the Computer Vision - ACCV 2014, 2014

2013
Vision meets robotics: The KITTI dataset.
Int. J. Robotics Res., 2013

Latent Structured Active Learning.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Estimating the 3D Layout of Indoor Scenes and Its Clutter from Depth Sensors.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Understanding High-Level Semantics by Modeling Traffic Patterns.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Box in the Box: Joint 3D Layout and Object Reasoning from Single Images.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Holistic Scene Understanding for 3D Object Detection with RGBD Cameras.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Robust Monocular Epipolar Flow Estimation.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Analyzing Semantic Segmentation Using Hybrid Human-Machine CRFs.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

A Sentence Is Worth a Thousand Pixels.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Bottom-Up Segmentation for Top-Down Detection.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Lost! Leveraging the Crowd for Probabilistic Visual Self-Localization.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

2012
A Family of MCMC Methods on Implicitly Defined Manifolds.
Proceedings of the Fifteenth International Conference on Artificial Intelligence and Statistics, 2012

Efficient Learning of Structured Predictors in General Graphical Models
CoRR, 2012

Multi-View Learning in the Presence of View Disagreement
CoRR, 2012

Globally Convergent Dual MAP LP Relaxation Solvers using Fenchel-Young Margins.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

3D Object Detection and Viewpoint Estimation with a Deformable 3D Cuboid Model.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Efficient Structured Prediction with Latent Variables for General Graphical Models.
Proceedings of the 29th International Conference on Machine Learning, 2012

Continuous Markov Random Fields for Robust Stereo Estimation.
Proceedings of the Computer Vision - ECCV 2012, 2012

Efficient Exact Inference for 3D Indoor Scene Understanding.
Proceedings of the Computer Vision - ECCV 2012, 2012

Beyond Feature Points: Structured Prediction for Monocular Non-rigid 3D Reconstruction.
Proceedings of the Computer Vision - ECCV 2012, 2012

Describing the scene as a whole: Joint object detection, scene classification and semantic segmentation.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

A constrained latent variable model.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Efficient structured prediction for 3D indoor scene understanding.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Are we ready for autonomous driving? The KITTI vision benchmark suite.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

2011
Learning Probabilistic Non-Linear Latent Variable Models for Tracking Complex Activities.
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

Joint 3D Estimation of Objects and Scene Layout.
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

Convex Max-Product over Compact Sets for Protein Folding.
Proceedings of the 28th International Conference on Machine Learning, 2011

Physically-based motion models for 3D tracking: A convex formulation.
Proceedings of the IEEE International Conference on Computer Vision, 2011

Data-driven animation of hand-object interactions.
Proceedings of the Ninth IEEE International Conference on Automatic Face and Gesture Recognition (FG 2011), 2011

Supervised hierarchical Pitman-Yor process for natural scene segmentation.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Distributed message passing for large scale graphical models.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

A generative model for 3D urban scene understanding from movable platforms.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

2010
Factorized Orthogonal Latent Spaces.
Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, 2010

Gaussian Processes for Object Categorization.
Int. J. Comput. Vis., 2010

Approximated Structured Prediction for Learning Large Scale Graphical Models
CoRR, 2010

Implicitly Constrained Gaussian Process Regression for Monocular Non-Rigid Pose Estimation.
Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

Sparse Coding for Learning Interpretable Spatio-Temporal Primitives.
Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

A Primal-Dual Message-Passing Algorithm for Approximated Large Scale Structured Prediction.
Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

Learning to Recognize Objects from Unseen Modalities.
Proceedings of the Computer Vision, 2010

Sufficient dimension reduction for visual sequence classification.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Combining discriminative and generative methods for 3D deformable surface and articulated pose reconstruction.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Efficient Large-Scale Stereo Matching.
Proceedings of the Computer Vision - ACCV 2010, 2010

2009
Non-linear matrix factorization with Gaussian processes.
Proceedings of the 26th Annual International Conference on Machine Learning, 2009

Rank priors for continuous non-linear dimensionality reduction.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Co-training with noisy perceptual observations.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

2008
Topologically-constrained latent variable models.
Proceedings of the Machine Learning, 2008

Sparse probabilistic regression for activity-independent human pose inference.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Local deformation models for monocular 3D shape recovery.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Unsupervised feature selection via distributed coding for multi-view object recognition.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

2007
Discriminative Gaussian process latent variable model for classification.
Proceedings of the Machine Learning, 2007

Active Learning with Gaussian Processes for Object Categorization.
Proceedings of the IEEE 11th International Conference on Computer Vision, 2007

Modeling Human Locomotion with Topologically Constrained Latent Variable Models.
Proceedings of the Human Motion, 2007

Patch-Based Pose Inference with a Mixture of Density Estimators.
Proceedings of the Analysis and Modeling of Faces and Gestures, 2007

2006
Temporal motion models for monocular and multiview 3D human body tracking.
Comput. Vis. Image Underst., 2006

3D People Tracking with Gaussian Process Dynamical Models.
Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006), 2006

2005
Hierarchical implicit surface joint limits for human body tracking.
Comput. Vis. Image Underst., 2005

Priors for People Tracking from Small Training Sets.
Proceedings of the 10th IEEE International Conference on Computer Vision (ICCV 2005), 2005

Monocular 3D Tracking of the Golf Swing.
Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), 2005

Monocular 3-D Tracking of the Golf Swing.
Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), 2005

2004
Style-Based Motion Synthesis.
Comput. Graph. Forum, 2004

3D Tracking for Gait Characterization and Recognition.
Proceedings of the Sixth IEEE International Conference on Automatic Face and Gesture Recognition (FGR 2004), 2004

3D Human Body Tracking Using Deterministic Temporal Motion Models.
Proceedings of the Computer Vision, 2004

Hierarchical Implicit Surface Joint Limits to Constrain Video-Based Motion Capture.
Proceedings of the Computer Vision, 2004

2003
opologically controlled segmentation of 3D magnetic resonance images of the head by using morphological operators.
Pattern Recognit., 2003

Automatic Determination of Shoulder Joint Limits Using Quaternion Field Boundaries.
Int. J. Robotics Res., 2003

2002
An Automatic Method For Determining Quaternion Field Boundaries for Ball-and-Socket Joint Limits.
Proceedings of the 5th IEEE International Conference on Automatic Face and Gesture Recognition (FGR 2002), 2002

2001
Segmentation of 3D head MR images using morphological reconstruction under constraints and automatic selection of markers.
Proceedings of the 2001 International Conference on Image Processing, 2001


  Loading...