Carl Vondrick

Orcid: 0000-0003-1139-9208

According to our database1, Carl Vondrick authored at least 115 papers between 2010 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Task Bias in Contrastive Vision-Language Models.
Int. J. Comput. Vis., June, 2024

Self-Improving Autonomous Underwater Manipulation.
CoRR, 2024

Differentiable Robot Rendering.
CoRR, 2024

Dreamitate: Real-World Visuomotor Policy Learning via Video Generation.
CoRR, 2024

See It from My Perspective: Diagnosing the Western Cultural Bias of Large Vision-Language Models in Image Understanding.
CoRR, 2024

PaperBot: Learning to Design Real-World Tools Using Paper.
CoRR, 2024

MIRACLE: An Online, Explainable Multimodal Interactive Concept Learning System.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

SelfIE: Self-Interpretation of Large Language Model Embeddings.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Sin3DM: Learning a Diffusion Model from a Single 3D Textured Shape.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Raidar: geneRative AI Detection viA Rewriting.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Remote Sensing Vision-Language Foundation Models without Annotations via Ground Remote Alignment.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

INViTE: INterpret and Control Vision-Language Models with Text Explanations.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Whiteboard-of-Thought: Thinking Step-by-Step Across Modalities.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Controlling the World by Sleight of Hand.
Proceedings of the Computer Vision - ECCV 2024, 2024

How Video Meetings Change Your Expression.
Proceedings of the Computer Vision - ECCV 2024, 2024

Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis.
Proceedings of the Computer Vision - ECCV 2024, 2024

Evolving Interpretable Visual Classifiers with Large Language Models.
Proceedings of the Computer Vision - ECCV 2024, 2024

EraseDraw: Learning to Insert Objects by Erasing Them from Images.
Proceedings of the Computer Vision - ECCV 2024, 2024

pix2gestalt: Amodal Segmentation by Synthesizing Wholes.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

GES: Generalized Exponential Splatting for Efficient Radiance Field Rendering.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Seeing Science: Inquiry-Based Learning at Home Through Mobile Messaging System.
Proceedings of the 23rd Annual ACM Interaction Design and Children Conference, 2024

2023
Interpreting and Controlling Vision Foundation Models via Text Explanations.
CoRR, 2023

ClimSim: An open large-scale dataset for training high-resolution physics emulators in hybrid multi-scale climate simulators.
CoRR, 2023

Affective Faces for Goal-Driven Dyadic Communication.
CoRR, 2023


Objaverse-XL: A Universe of 10M+ 3D Objects.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Robust Perception through Equivariance.
Proceedings of the International Conference on Machine Learning, 2023

Visual Classification via Description from Large Language Models.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Understanding Zero-shot Adversarial Robustness for Large-Scale Models.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

ViperGPT: Visual Inference via Python Execution for Reasoning.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

SurfsUp: Learning Fluid Simulation for Novel Surfaces.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Zero-1-to-3: Zero-shot One Image to 3D Object.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Landscape Learning for Neural Network Inversion.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Muscles in Action.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

SHIFT3D: Synthesizing Hard Inputs For Tricking 3D Detectors.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

FLEX: Full-Body Grasping Without Full-Body Grasps.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Doubly Right Object Recognition: A Why Prompt for Visual Rationales.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Humans as Light Bulbs: 3D Human Reconstruction from Thermal Reflection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

What You Can Reconstruct from a Shadow.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Tracking Through Containers and Occluders in the Wild.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Fully body visual self-modeling of robot morphologies.
Sci. Robotics, 2022

Adversarially Robust Video Perception by Seeing Motion.
CoRR, 2022

Task Bias in Vision-Language Models.
CoRR, 2022

Shadows Shed Light on 3D Objects.
CoRR, 2022

There is a Time and Place for Reasoning Beyond the Image.
CoRR, 2022

Forget-me-not! Contrastive critics for mitigating posterior collapse.
Proceedings of the Uncertainty in Artificial Intelligence, 2022

Representing Spatial Trajectories as Distributions.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Private Multiparty Perception for Navigation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Discrete Representations Strengthen Vision Transformer Robustness.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Real-Time Neural Voice Camouflage.
Proceedings of the Tenth International Conference on Learning Representations, 2022

It's Time for Artistic Correspondence in Music and Video.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Globetrotter: Connecting Languages by Connecting Images.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

UnweaveNet: Unweaving Activity Stories.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Causal Transportability for Visual Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Revealing Occlusions with 4D Neural Fields.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

There's a Time and Place for Reasoning Beyond the Image.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Full-Body Visual Self-Modeling of Robot Morphologies.
CoRR, 2021

RESIN: A Dockerized Schema-Guided Cross-document Cross-lingual Cross-media Information Extraction and Event Tracking System.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Demonstrations, 2021

Adversarial Attacks are Reversible with Natural Supervision.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Dissecting Image Crops.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Learning the Predictability of the Future.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Generative Interventions for Causal Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Learning Goals From Failure.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

The Boombox: Visual Reconstruction from Acoustic Vibrations.
Proceedings of the Conference on Robot Learning, 8-11 November 2021, London, UK., 2021

Towards a Unifying Framework for Formal Theories of Novelty.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Moments in Time Dataset: One Million Videos for Event Understanding.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Globetrotter: Unsupervised Multilingual Translation from Visual Alignment.
CoRR, 2020

A Unifying Framework for Formal Theories of Novelty: Framework, Examples and Discussion.
CoRR, 2020

Analogical Reasoning for Visually Grounded Language Acquisition.
CoRR, 2020

Video Representations of Goals Emerge from Watching Failure.
CoRR, 2020

Listening to Sounds of Silence for Speech Denoising.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Visual Hide and Seek.
Proceedings of the 2020 Conference on Artificial Life, 2020

Learning to Learn Words from Visual Scenes.
Proceedings of the Computer Vision - ECCV 2020, 2020

Multitask Learning Strengthens Adversarial Robustness.
Proceedings of the Computer Vision - ECCV 2020, 2020

We Have So Much in Common: Modeling Semantic Relational Set Abstractions in Videos.
Proceedings of the Computer Vision - ECCV 2020, 2020

Oops! Predicting Unintentional Action in Video.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Bringing Engineering Rigor to Deep Learning.
ACM SIGOPS Oper. Syst. Rev., 2019

Learning to Learn Words from Narrated Video.
CoRR, 2019

DeepBase: Deep Inspection of Neural Networks.
Proceedings of the 2019 International Conference on Management of Data, 2019

Metric Learning for Adversarial Robustness.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

VideoBERT: A Joint Model for Video and Language Representation Learning.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Relational Action Forecasting.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Multi-Level Multimodal Common Semantic Space for Image-Phrase Grounding.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Cross-Modal Scene Networks.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

The Sound of Pixels.
Proceedings of the Computer Vision - ECCV 2018, 2018

Tracking Emerges by Colorizing Videos.
Proceedings of the Computer Vision - ECCV 2018, 2018

Actor-Centric Relation Network.
Proceedings of the Computer Vision - ECCV 2018, 2018

AVA: A Video Dataset of Spatio-Temporally Localized Atomic Visual Actions.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Predictive vision.
PhD thesis, 2017

See, Hear, and Read: Deep Aligned Representations.
CoRR, 2017

Following Gaze in Video.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Generating the Future with Adversarial Transformers.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Do We Need More Training Data?
Int. J. Comput. Vis., 2016

Visualizing Object Detection Features.
Int. J. Comput. Vis., 2016

Following Gaze Across Views.
CoRR, 2016

Who is Mistaken?
CoRR, 2016

Generating Videos with Scene Dynamics.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

SoundNet: Learning Sound Representations from Unlabeled Video.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Anticipating Visual Representations from Unlabeled Video.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Predicting Motivations of Actions by Leveraging Text.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Learning Aligned Cross-Modal Representations from Weakly Aligned Data.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015
Anticipating the future by watching unlabeled video.
CoRR, 2015

Learning visual biases from human imagination.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Where are they looking?
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

2014
Acquiring Visual Classifiers from Human Imagination.
CoRR, 2014

Inferring the Why in Images.
CoRR, 2014

Assessing the Quality of Actions.
Proceedings of the Computer Vision - ECCV 2014, 2014

2013
Efficiently Scaling up Crowdsourced Video Annotation - A Set of Best Practices for High Quality, Economical Video Labeling.
Int. J. Comput. Vis., 2013

HOGgles: Visualizing Object Detection Features.
Proceedings of the IEEE International Conference on Computer Vision, 2013

2012
Inverting and Visualizing Features for Object Detection
CoRR, 2012

Do We Need More Training Data or Better Models for Object Detection?.
Proceedings of the British Machine Vision Conference, 2012

2011
Video Annotation and Tracking with Active Learning.
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011


AVSS 2011 demo session: A large-scale benchmark dataset for event recognition in surveillance video.
Proceedings of the 8th IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2011

2010
Efficiently Scaling Up Video Annotation with Crowdsourced Marketplaces.
Proceedings of the Computer Vision, 2010


  Loading...