Carl Doersch

According to our database1, Carl Doersch authored at least 36 papers between 2010 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Gen2Act: Human Video Generation in Novel Scenarios enables Generalizable Robot Manipulation.
CoRR, 2024

TAPVid-3D: A Benchmark for Tracking Any Point in 3D.
CoRR, 2024

BootsTAP: Bootstrapped Training for Tracking-Any-Point.
CoRR, 2024

RoboTAP: Tracking Arbitrary Points for Few-Shot Visual Imitation.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Learning from One Continuous Video Stream.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Perception Test: A Diagnostic Benchmark for Multimodal Video Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

TAPIR: Tracking Any Point with per-frame Initialization and temporal Refinement.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
TAP-Vid: A Benchmark for Tracking Any Point in a Video.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Perceiver IO: A General Architecture for Structured Inputs & Outputs.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Input-level Inductive Biases for 3D Reconstruction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022


2021
Inferring a Continuous Distribution of Atom Coordinates from Cryo-EM Images using VAEs.
CoRR, 2021

2020
Bootstrap Your Own Latent - A New Approach to Self-Supervised Learning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

CrossTransformers: spatially-aware few-shot transfer.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

2019
Sim2real transfer learning for 3D pose estimation: motion to the rescue.
CoRR, 2019

Data-Efficient Image Recognition with Contrastive Predictive Coding.
CoRR, 2019

Sim2real transfer learning for 3D human pose estimation: motion to the rescue.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Structured agents for physical construction.
Proceedings of the 36th International Conference on Machine Learning, 2019

Video Action Transformer Network.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Exploiting Temporal Context for 3D Human Pose Estimation in the Wild.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
The Visual QA Devil in the Details: The Impact of Early Fusion and Batch Norm on CLEVR.
CoRR, 2018

A Better Baseline for AVA.
CoRR, 2018

Kickstarting Deep Reinforcement Learning.
CoRR, 2018

Learning Visual Question Answering by Bootstrapping Hard Attention.
Proceedings of the Computer Vision - ECCV 2018, 2018

2017
Multi-task Self-Supervised Visual Learning.
Proceedings of the IEEE International Conference on Computer Vision, 2017

2016
Supervision Beyond Manual Annotations for Learning Visual Representations.
PhD thesis, 2016

Data-dependent Initializations of Convolutional Neural Networks.
Proceedings of the 4th International Conference on Learning Representations, 2016

Tutorial on Variational Autoencoders.
CoRR, 2016

An Uncertain Future: Forecasting from Static Images Using Variational Autoencoders.
Proceedings of the Computer Vision - ECCV 2016, 2016

2015
Mid-level Elements for Object Detection.
CoRR, 2015

What makes Paris look like Paris?
Commun. ACM, 2015

Unsupervised Visual Representation Learning by Context Prediction.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

2014
Context as Supervisory Signal: Discovering Objects with Predictable Context.
Proceedings of the Computer Vision - ECCV 2014, 2014

2013
Mid-level Visual Element Discovery as Discriminative Mode Seeking.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

2012
Bounding the Probability of Error for High Precision Optical Character Recognition.
J. Mach. Learn. Res., 2012

2010
Improving state-of-the-art OCR through high-precision document-specific modeling.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010


  Loading...