Aude Oliva

Affiliations:
  • MIT, Cambridge, MA, USA


According to our database1, Aude Oliva authored at least 99 papers between 1999 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
ConMe: Rethinking Evaluation of Compositional Reasoning for Modern VLMs.
CoRR, 2024

Trans-LoRA: towards data-free Transferable Parameter Efficient Finetuning.
CoRR, 2024

Dense Training, Sparse Inference: Rethinking Training of Mixture-of-Experts Language Models.
CoRR, 2024

Improved Techniques for Quantizing Deep Networks with Adaptive Bit-Widths.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

LangNav: Language as a Perceptual Representation for Navigation.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

Brain Netflix: Scaling Data to Reconstruct Videos from Brain Signals.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
Artifact magnification on deepfake videos increases human detection and subjective confidence.
CoRR, 2023

The Algonauts Project 2023 Challenge: How the Human Brain Makes Sense of Natural Scenes.
CoRR, 2023

Learning Human Action Recognition Representations Without Real Humans.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Going Beyond Nouns With Vision & Language Models Using Synthetic Data.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Leveraging Temporal Context in Low Representational Power Regimes.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Multi-Moments in Time: Learning and Interpreting Models for Multi-Action Video Understanding.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Deepfake Caricatures: Amplifying attention to artifacts increases deepfake detection by humans and machines.
CoRR, 2022

How Transferable are Video Representations Based on Synthetic Data?
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022


Cross-Modal Discrete Representation Learning.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Ego4D: Around the World in 3, 000 Hours of Egocentric Video.
CoRR, 2021

IA-RED<sup>2</sup>: Interpretability-Aware Redundancy Reduction for Vision Transformers.
CoRR, 2021

The Algonauts Project 2021 Challenge: How the Human Brain Makes Sense of a World in Motion.
CoRR, 2021

Memorability: An image-computable measure of information utility.
CoRR, 2021

Paint by Word.
CoRR, 2021

All at Once Network Quantization via Collaborative Knowledge Transfer.
CoRR, 2021

VA-RED<sup>2</sup>: Video Adaptive Redundancy Reduction.
CoRR, 2021

IA-RED$^2$: Interpretability-Aware Redundancy Reduction for Vision Transformers.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

VA-RED2: Video Adaptive Redundancy Reduction.
Proceedings of the 9th International Conference on Learning Representations, 2021

AdaFuse: Adaptive Temporal Fusion Network for Efficient Action Recognition.
Proceedings of the 9th International Conference on Learning Representations, 2021

Dynamic Network Quantization for Efficient Video Inference.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

AdaMML: Adaptive Multi-Modal Learning for Efficient Video Recognition.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Spoken Moments: Learning Joint Audio-Visual Representations From Video Descriptions.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Deep Analysis of CNN-Based Spatio-Temporal Representations for Action Recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Parsing and Summarizing Infographics with Synthetically Trained Icon Detection.
Proceedings of the 14th IEEE Pacific Visualization Symposium, 2021

2020
Moments in Time Dataset: One Million Videos for Event Understanding.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Experiences and Insights for Collaborative Industry-Academic Research in Artificial Intelligence.
AI Mag., 2020

Multimodal Memorability: Modeling Effects of Semantics and Decay on Video Memorability.
Proceedings of the Computer Vision - ECCV 2020, 2020

AR-Net: Adaptive Frame Resolution for Efficient Action Recognition.
Proceedings of the Computer Vision - ECCV 2020, 2020

We Have So Much in Common: Modeling Semantic Relational Set Abstractions in Videos.
Proceedings of the Computer Vision - ECCV 2020, 2020

Relationship Matters: Relation Guided Knowledge Transfer for Incremental Learning of Object Detectors.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

How Much Time Do You Have? Modeling Multi-Duration Saliency.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Comparing the Interpretability of Deep Networks via Network Dissection.
Proceedings of the Explainable AI: Interpreting, 2019

Interpreting Deep Visual Representations via Network Dissection.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

What Do Different Evaluation Metrics Tell Us About Saliency Models?
IEEE Trans. Pattern Anal. Mach. Intell., 2019

The Algonauts Project.
Nat. Mach. Intell., 2019

Lore Goetschalckx, Alex Andonian, Aude Oliva, Phillip Isola: GANalyze: Toward Visual Definitions of Cognitive Image Properties.
CoRR, 2019

Cross-view Semantic Segmentation for Sensing Surroundings.
CoRR, 2019

The Algonauts Project: A Platform for Communication between the Sciences of Biological and Artificial Intelligence.
CoRR, 2019

Reasoning About Human-Object Interactions Through Dual Attention Networks.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

GANalyze: Toward Visual Definitions of Cognitive Image Properties.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Identifying Interpretable Action Concepts in Deep Networks.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

2018
Places: A 10 Million Image Database for Scene Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Tracking the Spatiotemporal Neural Dynamics of Real-world Object Size and Animacy in the Human Brain.
J. Cogn. Neurosci., 2018

Synthetically Trained Icon Proposals for Parsing and Summarizing Infographics.
CoRR, 2018

Temporal Relational Reasoning in Videos.
Proceedings of the Computer Vision - ECCV 2018, 2018

2017
BubbleView: An Interface for Crowdsourcing Image Importance Maps and Tracking Visual Attention.
ACM Trans. Comput. Hum. Interact., 2017

Dynamics of scene representations in the human brain revealed by magnetoencephalography and deep neural networks.
NeuroImage, 2017

Memorability: A stimulus-driven perceptual neural signature distinctive from memory.
NeuroImage, 2017

Understanding Infographics through Textual and Visual Tag Prediction.
CoRR, 2017

BubbleView: an alternative to eye-tracking for crowdsourcing image importance.
CoRR, 2017

Network Dissection: Quantifying Interpretability of Deep Visual Representations.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Asynchronous Data Aggregation for Training End to End Visual Control Networks.
Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems, 2017

Content-Dependent Fusion: Combining Human MEG and FMRI Data to Reveal Spatiotemporal Dynamics of Animacy and Real-world Object Size.
Proceedings of the 2017 AAAI Spring Symposia, 2017

2016
Cognitive Society.
Proceedings of the Handbook of Science and Technology Convergence, 2016

Beyond Memorability: Visualization Recognition and Recall.
IEEE Trans. Vis. Comput. Graph., 2016

SUN Database: Exploring a Large Collection of Scene Categories.
Int. J. Comput. Vis., 2016

Places: An Image Database for Deep Scene Understanding.
CoRR, 2016

Deep Neural Networks predict Hierarchical Spatio-temporal Cortical Dynamics of Human Visual Object Recognition.
CoRR, 2016

Where Should Saliency Models Look Next?
Proceedings of the Computer Vision - ECCV 2016, 2016

Learning Deep Features for Discriminative Localization.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015
Interaction envelope: Local spatial representations of objects at all scales in scene-selective regions.
NeuroImage, 2015

Object Detectors Emerge in Deep Scene CNNs.
Proceedings of the 3rd International Conference on Learning Representations, 2015

Learning visual biases from human imagination.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Understanding and Predicting Image Memorability at a Large Scale.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Eye Fixation Metrics for Large Scale Evaluation and Comparison of Information Visualizations.
Proceedings of the Eye Tracking and Visualization, 2015

A Crowdsourced Alternative to Eye-tracking for Visualization Understanding.
Proceedings of the 33rd Annual ACM Conference Extended Abstracts on Human Factors in Computing Systems, 2015

2014
What Makes a Photograph Memorable?
IEEE Trans. Pattern Anal. Mach. Intell., 2014

Acquiring Visual Classifiers from Human Imagination.
CoRR, 2014

Learning Deep Features for Scene Recognition using Places Database.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Recognizing City Identity via Attribute Analysis of Geo-tagged Images.
Proceedings of the Computer Vision - ECCV 2014, 2014

Predicting Actions from Static Scenes.
Proceedings of the Computer Vision - ECCV 2014, 2014

2013
What Makes a Visualization Memorable?
IEEE Trans. Vis. Comput. Graph., 2013

Modifying the Memorability of Face Photographs.
Proceedings of the IEEE International Conference on Computer Vision, 2013

2012
Basic level scene understanding: from labels to structure and beyond.
Proceedings of the SIGGRAPH Asia 2012 Technical Briefs, Singapore, November 28, 2012

Image memorability and visual inception.
Proceedings of the SIGGRAPH Asia 2012 Technical Briefs, Singapore, November 28, 2012

Memorability of Image Regions.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Recognizing scene viewpoint using panoramic place representation.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Establishing a Database for Studying Human Face Photograph Memory.
Proceedings of the 34th Annual Meeting of the Cognitive Science Society, 2012

2011
Understanding the Intrinsic Memorability of Images.
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

What makes an image memorable?
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Estimating scene typicality from human ratings and image features.
Proceedings of the 33th Annual Meeting of the Cognitive Science Society, 2011

Canonical views of scenes depend on the shape of the space.
Proceedings of the 33th Annual Meeting of the Cognitive Science Society, 2011

2010
SUN database: Large-scale scene recognition from abbey to zoo.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

2006
Hybrid images.
ACM Trans. Graph., 2006

Ethics in image manipulation Video presentations from this panel session are available on the citation page.
Proceedings of the International Conference on Computer Graphics and Interactive Techniques, 2006

2005
Human Learning of Contextual Priors for Object Search: Where does the time go?
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2005

2003
Top-down control of visual attention in object detection.
Proceedings of the 2003 International Conference on Image Processing, 2003

2002
Depth Estimation from Image Structure.
IEEE Trans. Pattern Anal. Mach. Intell., 2002

Scene-Centered Description from Spatial Envelope Properties.
Proceedings of the Biologically Motivated Computer Vision Second International Workshop, 2002

2001
Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope.
Int. J. Comput. Vis., 2001

2000
Classification of scene photographs from local orientations features.
Pattern Recognit. Lett., 2000

1999
Semantic Organization of Scenes using Discriminant Structural Templates.
Proceedings of the International Conference on Computer Vision, 1999


  Loading...