João Carreira

Orcid: 0000-0002-0207-1254

Affiliations:

DeepMind, Google, London, UK

According to our database¹, João Carreira authored at least 43 papers between 2015 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

BootsTAP: Bootstrapped Training for Tracking-Any-Point.

[BibT_eX]

[DOI]

CoRR, 2024

2023

Transframer: Arbitrary Frame Prediction with Generative Models.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2023

Perception Test 2023: A Summary of the First Challenge And Outcome.

[BibT_eX]

[DOI]

CoRR, 2023

Learning from One Continuous Video Stream.

[BibT_eX]

[DOI]

CoRR, 2023

Zorro: the masked multimodal transformer.

[BibT_eX]

[DOI]

Jean-Baptiste Alayrac

CoRR, 2023

Perception Test: A Diagnostic Benchmark for Multimodal Video Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Self-supervised video pretraining yields robust and more human-aligned visual representations.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

TAPIR: Tracking Any Point with per-frame Initialization and temporal Refinement.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022

Self-supervised video pretraining yields strong image representations.

[BibT_eX]

[DOI]

CoRR, 2022

Where Should I Spend My FLOPS? Efficiency Evaluations of Visual Pre-training Methods.

[BibT_eX]

[DOI]

CoRR, 2022

Hierarchical Perceiver.

[BibT_eX]

[DOI]

CoRR, 2022

TAP-Vid: A Benchmark for Tracking Any Point in a Video.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

General-purpose, long-context autoregressive modeling with Perceiver AR.

[BibT_eX]

[DOI]

Jean-Baptiste Alayrac

João Carreira

Jesse H. Engel

Proceedings of the International Conference on Machine Learning, 2022

Perceiver IO: A General Architecture for Structured Inputs & Outputs.

[BibT_eX]

[DOI]

Andrew Jaegle

Sebastian Borgeaud

Jean-Baptiste Alayrac

Proceedings of the Tenth International Conference on Learning Representations, 2022

Towards Learning Universal Audio Representations.

[BibT_eX]

[DOI]

Jean-Baptiste Alayrac

Sander Dieleman

João Carreira

Aäron van den Oord

Proceedings of the IEEE International Conference on Acoustics, 2022

Object Discovery and Representation Networks.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Input-level Inductive Biases for 3D Reconstruction.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Compressed Vision for Efficient Video Understanding.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2022, 2022

2021

Perceiver: General Perception with Iterative Attention.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

Efficient Visual Pretraining with Contrastive Detection.

[BibT_eX]

[DOI]

Olivier J. Hénaff

Skanda Koppula

Jean-Baptiste Alayrac

Aäron van den Oord

Oriol Vinyals

João Carreira

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Gradient Forward-Propagation for Large-Scale Temporal Video Modelling.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020

A Short Note on the Kinetics-700-2020 Human Action Dataset.

[BibT_eX]

[DOI]

CoRR, 2020

The AVA-Kinetics Localized Human Actions Video Dataset.

[BibT_eX]

[DOI]

CoRR, 2020

Visual Grounding in Video for Unsupervised Word Translation.

[BibT_eX]

[DOI]

Gunnar A. Sigurdsson

Jean-Baptiste Alayrac

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Sideways: Depth-Parallel Training of Video Models.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

A Short Note on the Kinetics-700 Human Action Dataset.

[BibT_eX]

[DOI]

CoRR, 2019

Controllable Attention for Structured Layered Video Decomposition.

[BibT_eX]

[DOI]

Jean-Baptiste Alayrac

João Carreira

Relja Arandjelovic

Andrew Zisserman

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Video Action Transformer Network.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

The Visual Centrifuge: Model-Free Layered Video Representations.

[BibT_eX]

[DOI]

Jean-Baptiste Alayrac

João Carreira

Andrew Zisserman

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018

A Short Note about Kinetics-600.

[BibT_eX]

[DOI]

CoRR, 2018

A Better Baseline for AVA.

[BibT_eX]

[DOI]

CoRR, 2018

Massively Parallel Video Networks.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

2017

Learning Category-Specific Deformable 3D Models for Object Reconstruction.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2017

The Kinetics Human Action Video Dataset.

[BibT_eX]

[DOI]

Sudheendra Vijayanarasimhan

CoRR, 2017

Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset.

[BibT_eX]

[DOI]

João Carreira

Andrew Zisserman

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016

The three R's of computer vision: Recognition, reconstruction and reorganization.

[BibT_eX]

[DOI]

Jitendra Malik

Pablo Andrés Arbeláez

Pattern Recognit. Lett., 2016

Human Pose Estimation with Iterative Error Feedback.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015

Shape and Symmetry Induction for 3D Objects.

[BibT_eX]

[DOI]

CoRR, 2015

Pose Induction for Novel Object Categories.

[BibT_eX]

[DOI]

Shubham Tulsiani

João Carreira

Jitendra Malik

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Amodal Completion and Size Constancy in Natural Scenes.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Learning to See by Moving.

[BibT_eX]

[DOI]

Pulkit Agrawal

João Carreira

Jitendra Malik

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Category-specific object reconstruction from a single image.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Virtual view networks for object reconstruction.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

João Carreira

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...