João Carreira

Orcid: 0000-0002-0207-1254

Affiliations:
  • DeepMind, Google, London, UK


According to our database1, João Carreira authored at least 43 papers between 2015 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
BootsTAP: Bootstrapped Training for Tracking-Any-Point.
CoRR, 2024

2023
Transframer: Arbitrary Frame Prediction with Generative Models.
Trans. Mach. Learn. Res., 2023

Perception Test 2023: A Summary of the First Challenge And Outcome.
CoRR, 2023

Learning from One Continuous Video Stream.
CoRR, 2023

Zorro: the masked multimodal transformer.
CoRR, 2023

Perception Test: A Diagnostic Benchmark for Multimodal Video Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Self-supervised video pretraining yields robust and more human-aligned visual representations.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

TAPIR: Tracking Any Point with per-frame Initialization and temporal Refinement.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
Self-supervised video pretraining yields strong image representations.
CoRR, 2022

Where Should I Spend My FLOPS? Efficiency Evaluations of Visual Pre-training Methods.
CoRR, 2022

Hierarchical Perceiver.
CoRR, 2022

TAP-Vid: A Benchmark for Tracking Any Point in a Video.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

General-purpose, long-context autoregressive modeling with Perceiver AR.
Proceedings of the International Conference on Machine Learning, 2022

Perceiver IO: A General Architecture for Structured Inputs & Outputs.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Towards Learning Universal Audio Representations.
Proceedings of the IEEE International Conference on Acoustics, 2022

Object Discovery and Representation Networks.
Proceedings of the Computer Vision - ECCV 2022, 2022

Input-level Inductive Biases for 3D Reconstruction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Compressed Vision for Efficient Video Understanding.
Proceedings of the Computer Vision - ACCV 2022, 2022

2021
Perceiver: General Perception with Iterative Attention.
Proceedings of the 38th International Conference on Machine Learning, 2021

Efficient Visual Pretraining with Contrastive Detection.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Gradient Forward-Propagation for Large-Scale Temporal Video Modelling.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
A Short Note on the Kinetics-700-2020 Human Action Dataset.
CoRR, 2020

The AVA-Kinetics Localized Human Actions Video Dataset.
CoRR, 2020

Visual Grounding in Video for Unsupervised Word Translation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Sideways: Depth-Parallel Training of Video Models.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
A Short Note on the Kinetics-700 Human Action Dataset.
CoRR, 2019

Controllable Attention for Structured Layered Video Decomposition.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Video Action Transformer Network.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

The Visual Centrifuge: Model-Free Layered Video Representations.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
A Short Note about Kinetics-600.
CoRR, 2018

A Better Baseline for AVA.
CoRR, 2018

Massively Parallel Video Networks.
Proceedings of the Computer Vision - ECCV 2018, 2018

2017
Learning Category-Specific Deformable 3D Models for Object Reconstruction.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

The Kinetics Human Action Video Dataset.
CoRR, 2017

Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
The three R's of computer vision: Recognition, reconstruction and reorganization.
Pattern Recognit. Lett., 2016

Human Pose Estimation with Iterative Error Feedback.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015
Shape and Symmetry Induction for 3D Objects.
CoRR, 2015

Pose Induction for Novel Object Categories.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Amodal Completion and Size Constancy in Natural Scenes.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Learning to See by Moving.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Category-specific object reconstruction from a single image.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Virtual view networks for object reconstruction.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015


  Loading...