Carl Vondrick

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Learning the Predictability of the Future.

[BibT_eX]

[DOI]

Didac Suris

Ruoshi Liu

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Generative Interventions for Causal Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Learning Goals From Failure.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

The Boombox: Visual Reconstruction from Acoustic Vibrations.

[BibT_eX]

[DOI]

Proceedings of the Conference on Robot Learning, 8-11 November 2021, London, UK., 2021

Towards a Unifying Framework for Formal Theories of Novelty.

[BibT_eX]

[DOI]

Terrance E. Boult

Przemyslaw A. Grabowicz

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Moments in Time Dataset: One Million Videos for Event Understanding.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2020

Globetrotter: Unsupervised Multilingual Translation from Visual Alignment.

[BibT_eX]

[DOI]

Dídac Surís

CoRR, 2020

A Unifying Framework for Formal Theories of Novelty: Framework, Examples and Discussion.

[BibT_eX]

[DOI]

Terrance E. Boult

Przemyslaw A. Grabowicz

CoRR, 2020

Analogical Reasoning for Visually Grounded Language Acquisition.

[BibT_eX]

[DOI]

CoRR, 2020

Video Representations of Goals Emerge from Watching Failure.

[BibT_eX]

[DOI]

CoRR, 2020

Listening to Sounds of Silence for Speech Denoising.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Visual Hide and Seek.

[BibT_eX]

[DOI]

Proceedings of the 2020 Conference on Artificial Life, 2020

Learning to Learn Words from Visual Scenes.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Multitask Learning Strengthens Adversarial Robustness.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

We Have So Much in Common: Modeling Semantic Relational Set Abstractions in Videos.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Oops! Predicting Unintentional Action in Video.

[BibT_eX]

[DOI]

Boyuan Chen

Sudheendra Vijayanarasimhan

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

Bringing Engineering Rigor to Deep Learning.

[BibT_eX]

[DOI]

ACM SIGOPS Oper. Syst. Rev., 2019

Learning to Learn Words from Narrated Video.

[BibT_eX]

[DOI]

CoRR, 2019

DeepBase: Deep Inspection of Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 2019 International Conference on Management of Data, 2019

Metric Learning for Adversarial Robustness.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

VideoBERT: A Joint Model for Video and Language Representation Learning.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Relational Action Forecasting.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Multi-Level Multimodal Common Semantic Space for Image-Phrase Grounding.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018

Cross-Modal Scene Networks.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2018

The Sound of Pixels.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Tracking Emerges by Colorizing Videos.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Actor-Centric Relation Network.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

AVA: A Video Dataset of Spatio-Temporally Localized Atomic Visual Actions.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017

Predictive vision.

[BibT_eX]

[DOI]

PhD thesis, 2017

See, Hear, and Read: Deep Aligned Representations.

[BibT_eX]

[DOI]

Yusuf Aytar

CoRR, 2017

Following Gaze in Video.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2017

Generating the Future with Adversarial Transformers.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016

Do We Need More Training Data?

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2016

Visualizing Object Detection Features.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2016

Following Gaze Across Views.

[BibT_eX]

[DOI]

CoRR, 2016

Who is Mistaken?

[BibT_eX]

[DOI]

Benjamin Eysenbach

CoRR, 2016

Generating Videos with Scene Dynamics.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

SoundNet: Learning Sound Representations from Unlabeled Video.

[BibT_eX]

[DOI]

Yusuf Aytar

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Anticipating Visual Representations from Unlabeled Video.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Predicting Motivations of Actions by Leveraging Text.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Learning Aligned Cross-Modal Representations from Weakly Aligned Data.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015

Anticipating the future by watching unlabeled video.

[BibT_eX]

[DOI]

CoRR, 2015

Learning visual biases from human imagination.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Where are they looking?

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

2014

Acquiring Visual Classifiers from Human Imagination.

[BibT_eX]

[DOI]

CoRR, 2014

Inferring the Why in Images.

[BibT_eX]

[DOI]

CoRR, 2014

Assessing the Quality of Actions.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2014, 2014

2013

Efficiently Scaling up Crowdsourced Video Annotation - A Set of Best Practices for High Quality, Economical Video Labeling.

[BibT_eX]

[DOI]

Donald J. Patterson

Deva Ramanan

Int. J. Comput. Vis., 2013

HOGgles: Visualizing Object Detection Features.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2013

2012

Inverting and Visualizing Features for Object Detection

[BibT_eX]

[DOI]

CoRR, 2012

Do We Need More Training Data or Better Models for Object Detection?.

[BibT_eX]

[DOI]

Proceedings of the British Machine Vision Conference, 2012

2011

Video Annotation and Tracking with Active Learning.

[BibT_eX]

[DOI]

Deva Ramanan

Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

A large-scale benchmark dataset for event recognition in surveillance video.

[BibT_eX]

[DOI]

Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

AVSS 2011 demo session: A large-scale benchmark dataset for event recognition in surveillance video.

[BibT_eX]

[DOI]

Proceedings of the 8th IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2011

2010

Efficiently Scaling Up Video Annotation with Crowdsourced Marketplaces.

[BibT_eX]

[DOI]