Li Fei-Fei

Arnold Milstein

Nat., 2020

Automatic detection of hand hygiene using computer vision technology.

[BibT_eX]

[DOI]

J. Am. Medical Informatics Assoc., 2020

Learning task-oriented grasping for tool manipulation from simulated self-supervision.

[BibT_eX]

[DOI]

Int. J. Robotics Res., 2020

Human-in-the-Loop Imitation Learning using Remote Teleoperation.

[BibT_eX]

[DOI]

Ajay Mandlekar

CoRR, 2020

iGibson, a Simulation Environment for Interactive Tasks in Large Realistic Scenes.

[BibT_eX]

[DOI]

Bokui Shen

Fei Xia

Chengshu Li

CoRR, 2020

Learning to Generalize Across Long-Horizon Tasks from Human Demonstrations.

[BibT_eX]

[DOI]

Ajay Mandlekar

CoRR, 2020

GTI: Learning to Generalize across Long-Horizon Tasks from Human Demonstrations.

[BibT_eX]

[DOI]

Ajay Mandlekar

Proceedings of the Robotics: Science and Systems XVI, 2020

Learning Physical Graph Representations from Visual Scenes.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Vision-Based Estimation of MDS-UPDRS Gait Scores for Assessing Parkinson's Disease Motor Severity.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2020, 2020

DualSMC: Tunneling Differentiable Filtering and Planning under Continuous POMDPs.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

6-PACK: Category-level 6D Pose Tracker with Anchor-Based Keypoints.

[BibT_eX]

[DOI]

Chen Wang

Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

KETO: Learning Keypoint Representations for Tool Manipulation.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

IRIS: Implicit Reinforcement without Interaction at Scale for Learning Control from Offline Robot Manipulation Data.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

Motion Reasoning for Goal-Based Imitation Learning.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

Towards fairer datasets: filtering and balancing the distribution of the people subtree in the ImageNet hierarchy.

[BibT_eX]

[DOI]

Proceedings of the FAT* '20: Conference on Fairness, 2020

RubiksNet: Learnable 3D-Shift for Efficient Video Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Procedure Planning in Instructional Videos.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Action Genome: Actions As Compositions of Spatio-Temporal Scene Graphs.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

A computer vision system for deep learning-based detection of patient mobilization activities in the ICU.

[BibT_eX]

[DOI]

npj Digit. Medicine, 2019

Automated abnormality detection in lower extremity radiographs using deep learning.

[BibT_eX]

[DOI]

Nat. Mach. Intell., 2019

Action Genome: Actions as Composition of Spatio-temporal Scene Graphs.

[BibT_eX]

[DOI]

CoRR, 2019

Deep Bayesian Active Learning for Multiple Correct Outputs.

[BibT_eX]

[DOI]

CoRR, 2019

IRIS: Implicit Reinforcement without Interaction at Scale for Learning Control from Offline Robot Manipulation Data.

[BibT_eX]

[DOI]

CoRR, 2019

Bias-Resilient Neural Network.

[BibT_eX]

[DOI]

CoRR, 2019

Causal Induction from Visual Observations for Goal Directed Tasks.

[BibT_eX]

[DOI]

CoRR, 2019

Dual Sequential Monte Carlo: Tunneling Filtering and Planning in Continuous POMDPs.

[BibT_eX]

[DOI]

CoRR, 2019

SURREAL-System: Fully-Integrated Stack for Distributed Deep Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2019

D<sup>3</sup>TW: Discriminative Differentiable Dynamic Time Warping for Weakly Supervised Action Alignment and Segmentation.

[BibT_eX]

[DOI]

CoRR, 2019

HYPE: A Benchmark for Human eYe Perceptual Evaluation of Generative Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Regression Planning Networks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Scaling Robot Supervision to Hundreds of Hours with RoboTurk: Robotic Manipulation Dataset through Human Reasoning and Dexterity.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019

Continuous Relaxation of Symbolic Planner for One-Shot Imitation Learning.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019

Making Sense of Vision and Touch: Self-Supervised Learning of Multimodal Representations for Contact-Rich Tasks.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Robotics and Automation, 2019

Eidetic 3D LSTM: A Model for Video Prediction and Beyond.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Learning Representations, 2019

Visual Relationships as Functions: Enabling Few-Shot Scene Graph Prediction.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Scene Graph Prediction with Limited Labels.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Situational Fusion of Visual Representation for Visual Navigation.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Audio-linguistic Embeddings for Spoken Sentences.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

AI-Based Request Augmentation to Increase Crowdsourcing Participation.

[BibT_eX]

[DOI]

Proceedings of the Seventh AAAI Conference on Human Computation and Crowdsourcing, 2019

DenseFusion: 6D Object Pose Estimation by Iterative Dense Fusion.

[BibT_eX]

[DOI]

Chen Wang

Roberto Martin Martin

Cewu Lu

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Composing Text and Image for Image Retrieval - an Empirical Odyssey.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Auto-DeepLab: Hierarchical Neural Architecture Search for Semantic Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Peeking Into the Future: Predicting Future Person Activities and Locations in Videos.

[BibT_eX]

[DOI]

Junwei Liang

Lu Jiang

Alexander G. Hauptmann

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Information Maximizing Visual Question Generation.

[BibT_eX]

[DOI]

Ranjay Krishna

Michael S. Bernstein

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Neural Task Graphs: Generalizing to Unseen Tasks From a Single Video Demonstration.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Scene Memory Transformer for Embodied Agents in Long-Horizon Tasks.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

D3TW: Discriminative Differentiable Dynamic Time Warping for Weakly Supervised Action Alignment and Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Dynamics Learning with Cascaded Variational Inference for Multi-Step Manipulation.

[BibT_eX]

[DOI]

Proceedings of the 3rd Annual Conference on Robot Learning, 2019

Thoracic Disease Identification and Localization with Limited Supervision.

[BibT_eX]

[DOI]

Proceedings of the Deep Learning and Convolutional Neural Networks for Medical Imaging and Clinical Informatics, 2019

2018

Every Moment Counts: Dense Detailed Labeling of Actions in Complex Videos.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2018

Vision-Based Gait Analysis for Senior Care.

[BibT_eX]

[DOI]

CoRR, 2018

Faster CryptoNets: Leveraging Sparsity for Real-World Encrypted Inference.

[BibT_eX]

[DOI]

CoRR, 2018

A Fully Private Pipeline for Deep Learning on Electronic Health Records.

[BibT_eX]

[DOI]

CoRR, 2018

Privacy-Preserving Action Recognition for Smart Hospitals using Low-Resolution Depth Images.

[BibT_eX]

[DOI]

CoRR, 2018

Measuring Depression Symptom Severity from Spoken Language and 3D Facial Expressions.

[BibT_eX]

[DOI]

CoRR, 2018

DDRprog: A CLEVR Differentiable Dynamic Reasoning Programmer.

[BibT_eX]

[DOI]

Joseph Suarez

CoRR, 2018

Learning to Play with Intrinsically-Motivated Self-Aware Agents.

[BibT_eX]

[DOI]

CoRR, 2018

Scaling Human-Object Interaction Recognition Through Zero-Shot Learning.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018

Tool Detection and Operative Skill Assessment in Surgical Videos Using Region-Based Convolutional Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018

Engagement Learning: Expanding Visual Knowledge by Engaging Online Participants.

[BibT_eX]

[DOI]

Proceedings of the 31st Annual ACM Symposium on User Interface Software and Technology Adjunct Proceedings, 2018

Flexible neural representation for physics prediction.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Learning to Decompose and Disentangle Representations for Video Prediction.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Learning to Play With Intrinsically-Motivated, Self-Aware Agents.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

3D Point Cloud-Based Visual Prediction of ICU Mobility Care Activities.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning for Healthcare Conference, 2018

Neural Task Programming: Learning to Generalize Across Hierarchical Tasks.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Robotics and Automation, 2018

Distributed Asynchronous Optimization with Unbounded Delays: How Slow Can You Go?

[BibT_eX]

[DOI]

Zhengyuan Zhou

Panayotis Mertikopoulos

Proceedings of the 35th International Conference on Machine Learning, 2018

MentorNet: Learning Data-Driven Curriculum for Very Deep Neural Networks on Corrupted Labels.

[BibT_eX]

[DOI]

Proceedings of the 35th International Conference on Machine Learning, 2018

HiDDeN: Hiding Data With Deep Networks.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Graph Distillation for Action Detection with Privileged Modalities.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Progressive Neural Architecture Search.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Temporal Modular Networks for Retrieving Complex Compositional Activities in Videos.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Dynamic Task Prioritization for Multitask Learning.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Neural Graph Matching Networks for Fewshot 3D Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Thoracic Disease Identification and Localization With Limited Supervision.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Referring Relationships.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Image Generation From Scene Graphs.

[BibT_eX]

[DOI]

Agrim Gupta

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

What Makes a Video a Video: Analyzing Temporal Information in Video Understanding Models and Datasets.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Finding "It": Weakly-Supervised Reference-Aware Visual Grounding in Instructional Videos.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Social GAN: Socially Acceptable Trajectories With Generative Adversarial Networks.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Iterative Visual Reasoning Beyond Convolutions.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

ROBOTURK: A Crowdsourcing Platform for Robotic Skill Learning through Imitation.

[BibT_eX]

[DOI]

Proceedings of the 2nd Annual Conference on Robot Learning, 2018

SURREAL: Open-Source Reinforcement Learning Framework and Robot Manipulation Benchmark.

[BibT_eX]

[DOI]

Proceedings of the 2nd Annual Conference on Robot Learning, 2018

Emergence of Structured Behaviors from Curiosity-Based Intrinsic Motivation.

[BibT_eX]

[DOI]

Proceedings of the 40th Annual Meeting of the Cognitive Science Society, 2018

2017

Using deep learning and Google Street View to estimate the demographic makeup of neighborhoods across the United States.

[BibT_eX]

[DOI]

Proc. Natl. Acad. Sci. USA, 2017

Deep Visual-Semantic Alignments for Generating Image Descriptions.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2017

Evidence for similar patterns of neural activity elicited by picture- and word-based representations of natural scenes.

[BibT_eX]

[DOI]

NeuroImage, 2017

Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2017

MentorNet: Regularizing Very Deep Neural Networks on Corrupted Labels.

[BibT_eX]

[DOI]

CoRR, 2017

Progressive Neural Architecture Search.

[BibT_eX]

[DOI]

CoRR, 2017

Label Efficient Learning of Transferable Representations across Domains and Tasks.

[BibT_eX]

[DOI]

CoRR, 2017

Graph Distillation for Action Detection with Privileged Information.

[BibT_eX]

[DOI]

CoRR, 2017

Tackling Over-pruning in Variational Autoencoders.

[BibT_eX]

[DOI]

CoRR, 2017

Using Deep Learning and Google Street View to Estimate the Demographic Makeup of the US.

[BibT_eX]

[DOI]

CoRR, 2017

Label Efficient Learning of Transferable Representations acrosss Domains and Tasks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Towards Vision-Based Smart Hospitals: A System for Tracking and Monitoring Hand Hygiene Compliance.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning for Health Care Conference, 2017

AdaPT: Zero-Shot Adaptive Policy Transfer for Stochastic Dynamical Systems.

[BibT_eX]

[DOI]

Proceedings of the Robotics Research, The 18th International Symposium, 2017

Adversarially Robust Policy Learning: Active construction of physically-plausible perturbations.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2017

Target-driven visual navigation in indoor scenes using deep reinforcement learning.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Robotics and Automation, 2017

Unsupervised camera localization in crowded spaces.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Robotics and Automation, 2017

Visual Semantic Planning Using Deep Successor Representations.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2017

Dense-Captioning Events in Videos.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2017

Inferring and Executing Programs for Visual Reasoning.

[BibT_eX]

[DOI]

Bharath Hariharan

Laurens van der Maaten

Proceedings of the IEEE International Conference on Computer Vision, 2017

Characterizing and Improving Stability in Neural Style Transfer.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2017

Fine-Grained Recognition in the Wild: A Multi-task Domain Adaptation Approach.

[BibT_eX]

[DOI]

Timnit Gebru

Judy Hoffman

Proceedings of the IEEE International Conference on Computer Vision, 2017

Knowledge Acquisition for Visual Question Answering via Iterative Querying.

[BibT_eX]

[DOI]

Joseph J. Lim

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Learning to Learn from Noisy Web Videos.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Scene Graph Generation by Iterative Message Passing.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Jointly Learning Energy Expenditures and Activities Using Egocentric Multimodal Signals.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Unsupervised Learning of Long-Term Motion Dynamics for Videos.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

A Hierarchical Approach for Generating Descriptive Image Paragraphs.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning.

[BibT_eX]

[DOI]

Bharath Hariharan

Laurens van der Maaten

C. Lawrence Zitnick

Ross B. Girshick

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Unsupervised Visual-Linguistic Reference Resolution in Instructional Videos.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

A Glimpse Far into the Future: Understanding Long-term Crowd Worker Quality.

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM Conference on Computer Supported Cooperative Work and Social Computing, 2017

Scalable Annotation of Fine-Grained Categories Without Experts.

[BibT_eX]

[DOI]

Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems, 2017

End-to-End, Single-Stream Temporal Action Detection in Untrimmed Videos.

[BibT_eX]

[DOI]

Proceedings of the British Machine Vision Conference 2017, 2017

Computer Vision-based Approach to Maintain Independent Living for Seniors.

[BibT_eX]

[DOI]

Proceedings of the AMIA 2017, 2017

Fine-Grained Car Detection for Visual Census Estimation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Learning to Predict Human Behavior in Crowded Scenes.

[BibT_eX]

[DOI]

Proceedings of the Group and Crowd Behavior for Computer Vision, 1st Edition, 2017

Tracking Millions of Humans in Crowded Spaces.

[BibT_eX]

[DOI]

Proceedings of the Group and Crowd Behavior for Computer Vision, 1st Edition, 2017

2016

ITOP Dataset.

[BibT_eX]

[DOI]

Dataset, October, 2016

Leveraging the Wisdom of the Crowd for Fine-Grained Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2016

Typicality sharpens category representations in object-selective cortex.

[BibT_eX]

[DOI]

Marius Catalin Iordan

Michelle R. Greene

NeuroImage, 2016

Crowdsourcing in Computer Vision.

[BibT_eX]

[DOI]

Found. Trends Comput. Graph. Vis., 2016

A Glimpse Far into the Future: Understanding Long-term Crowd Worker Accuracy.

[BibT_eX]

[DOI]

CoRR, 2016

Viewpoint Invariant 3D Human Pose Estimation with Recurrent Error Feedback.

[BibT_eX]

[DOI]

CoRR, 2016

Toward More Gender Diversity in CS through an Artificial Intelligence Summer Program for High School Girls.

[BibT_eX]

[DOI]

Marie E. Vachovsky

Grace Wu

Sorathan Chaturapruek

Olga Russakovsky

Richard Sommer

Proceedings of the 47th ACM Technical Symposium on Computing Science Education, 2016

Vision-Based Classification of Developmental Disorders Using Eye-Movements.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer-Assisted Intervention - MICCAI 2016, 2016

Visual Relationship Detection with Language Priors.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2016, 2016

The Unreasonable Effectiveness of Noisy Data for Fine-Grained Recognition.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2016, 2016

Perceptual Losses for Real-Time Style Transfer and Super-Resolution.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2016, 2016

Connectionist Temporal Modeling for Weakly Supervised Action Labeling.

[BibT_eX]

[DOI]

De-An Huang

Proceedings of the Computer Vision - ECCV 2016, 2016

Towards Viewpoint Invariant 3D Human Pose Estimation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2016, 2016

What's the Point: Semantic Segmentation with Point Supervision.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2016, 2016

Visual7W: Grounded Question Answering in Images.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

End-to-End Learning of Action Detection from Frame Glimpses in Videos.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Detecting Events and Key Actors in Multi-person Videos.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

DenseCap: Fully Convolutional Localization Networks for Dense Captioning.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Recurrent Attention Models for Depth-Based Person Identification.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Social LSTM: Human Trajectory Prediction in Crowded Spaces.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Embracing Error to Enable Rapid Crowdsourcing.

[BibT_eX]

[DOI]

Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems, 2016

Vision-Based Hand Hygiene Monitoring in Hospitals.

[BibT_eX]

[DOI]

Proceedings of the AMIA 2016, 2016

2015

RGB-W Dataset.

[BibT_eX]

[DOI]

Dataset, December, 2015

Basic Level Category Structure Emerges Gradually across Human Ventral Visual Cortex.

[BibT_eX]

[DOI]

Marius Catalin Iordan

Michelle R. Greene

J. Cogn. Neurosci., 2015

ImageNet Large Scale Visual Recognition Challenge.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2015

Building a Large-scale Multimodal Knowledge Base for Visual Question Answering.

[BibT_eX]

[DOI]

CoRR, 2015

Visualizing and Understanding Recurrent Networks.

[BibT_eX]

[DOI]

CoRR, 2015

SentenceRacer: A Game with a Purpose for Image Sentence Annotation.

[BibT_eX]

[DOI]

CoRR, 2015

Improving Image Classification with Location Context.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Learning Temporal Embeddings for Complex Video Analysis.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Love Thy Neighbors: Image Annotation by Exploiting Image Metadata.

[BibT_eX]

[DOI]

Lamberto Ballan

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

RGB-W: When Vision Meets Wireless.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Best of both worlds: Human-machine collaboration for object annotation.

[BibT_eX]

[DOI]

Olga Russakovsky

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Learning semantic relationships for better action retrieval in images.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Fine-grained recognition without part annotations.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Image retrieval using scene graphs.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Generating Semantically Precise Scene Graphs from Textual Descriptions for Improved Image Retrieval.

[BibT_eX]

[DOI]

Christopher D. Manning

Proceedings of the Fourth Workshop on Vision and Language, 2015

2014

Object Bank: An Object-Level Image Representation for High-Level Visual Recognition.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2014

VideoSET: Video Summary Evaluation through Text.

[BibT_eX]

[DOI]

Serena Yeung

Alireza Fathi

CoRR, 2014

Affordances Provide a Fundamental Categorization Principle for Visual Scenes.

[BibT_eX]

[DOI]

Michelle R. Greene

Christopher Baldassano

Andre Esteva

CoRR, 2014

Visual Noise from Natural Scene Statistics Reveals Human Scene Category Representations.

[BibT_eX]

[DOI]

CoRR, 2014

Understanding the 3D layout of a cluttered room from multiple images.

[BibT_eX]

[DOI]

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2014

Deep Fragment Embeddings for Bidirectional Image Sentence Mapping.

[BibT_eX]

[DOI]

Armand Joulin

Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Learning Features and Parts for Fine-Grained Recognition.

[BibT_eX]

[DOI]

Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Reasoning about Object Affordances in a Knowledge Base Representation.

[BibT_eX]

[DOI]

Alireza Fathi

Proceedings of the Computer Vision - ECCV 2014, 2014

Linking People in Videos with "Their" Names Using Coreference Resolution.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2014, 2014

Efficient Image and Video Co-localization with Frank-Wolfe Algorithm.

[BibT_eX]

[DOI]

Armand Joulin

Kevin D. Tang

Proceedings of the Computer Vision - ECCV 2014, 2014

Co-localization in Real-World Images.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Large-Scale Video Classification with Convolutional Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Socially-Aware Large-Scale Crowd Forecasting.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Discovering the Signatures of Joint Attention in Child-Caregiver Interaction.

[BibT_eX]

[DOI]

Proceedings of the 36th Annual Meeting of the Cognitive Science Society, 2014

Scalable multi-label annotation.

[BibT_eX]

[DOI]

Proceedings of the CHI Conference on Human Factors in Computing Systems, 2014

Social Role Recognition for Human Event Understanding.

[BibT_eX]

[DOI]

Proceedings of the Human-Centered Social Media Analytics, 2014

Integrating Randomization and Discrimination for Classifying Human-Object Interaction Activities.

[BibT_eX]

[DOI]

Aditya Khosla

Proceedings of the Human-Centered Social Media Analytics, 2014

2013

Differential connectivity within the Parahippocampal Place Area.

[BibT_eX]

[DOI]

Christopher Baldassano

NeuroImage, 2013

TRECVID 2013 GENIE: Multimedia Event Detection and Recounting.

[BibT_eX]

[DOI]

Proceedings of the 2013 TREC Video Retrieval Evaluation, 2013

Object discovery in 3D scenes via shape analysis.

[BibT_eX]

[DOI]

Stephen D. Miller

Proceedings of the 2013 IEEE International Conference on Robotics and Automation, 2013

3D Object Representations for Fine-Grained Categorization.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE International Conference on Computer Vision Workshops, 2013

Discovering Object Functionality.

[BibT_eX]

[DOI]

Jiayuan Ma

Proceedings of the IEEE International Conference on Computer Vision, 2013

Combining the Right Features for Complex Event Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2013

Detecting Avocados to Zucchinis: What Have We Done, and Where Are We Going?

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2013

Video Event Understanding Using Natural Language Descriptions.

[BibT_eX]

[DOI]

Percy Liang

Proceedings of the IEEE International Conference on Computer Vision, 2013

Discriminative Segment Annotation in Weakly Labeled Video.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Social Role Discovery in Human Events.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Fine-Grained Crowdsourcing for Fine-Grained Recognition.

[BibT_eX]

[DOI]

Jia Deng

Jonathan Krause

Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Free your Camera: 3D Indoor Scene Understanding from Arbitrary Camera Motion.

[BibT_eX]

[DOI]

Proceedings of the British Machine Vision Conference, 2013

2012

Recognizing Human-Object Interactions in Still Images by Modeling the Mutual Context of Objects and Human Poses.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2012

Voxel-level functional connectivity using spatial regularization.

[BibT_eX]

[DOI]

Christopher Baldassano

Marius Catalin Iordan

NeuroImage, 2012

TRECVID 2012 GENIE: Multimedia Event Detection and Recounting.

[BibT_eX]

[DOI]

Proceedings of the 2012 TREC Video Retrieval Evaluation, 2012

Shifting Weights: Adapting Object Detectors from Image to Video.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Web image prediction using multivariate point processes.

[BibT_eX]

[DOI]

Gunhee Kim

Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2012

Efficient Euclidean Projections onto the Intersection of Norm Balls.

[BibT_eX]

[DOI]

Adams Wei Yu

Hao Su

Proceedings of the 29th International Conference on Machine Learning, 2012

Crowdsourcing Annotations for Visual Object Detection.

[BibT_eX]

[DOI]

Hao Su

Jia Deng

Proceedings of the 4th Human Computation Workshop, 2012

Action Recognition with Exemplar Based 2.5D Graph Matching.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2012, 2012

Object-Centric Spatial Pooling for Image Classification.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2012, 2012

A codebook-free and annotation-free approach for fine-grained image categorization.

[BibT_eX]

[DOI]

Gary R. Bradski

Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Learning latent temporal structure for complex event detection.

[BibT_eX]

[DOI]

Kevin D. Tang

Daphne Koller

Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Hedging your bets: Optimizing accuracy-specificity trade-offs in large scale visual recognition.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Multi-Level Structured Image Coding on High-Dimensional Image Representation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision, 2012

2011

ReVision: automated classification, analysis and redesign of chart images.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual ACM Symposium on User Interface Software and Technology, 2011

GENIE TRECVID 2011 Multimedia Event Detection: Late-Fusion Approaches to Combine Multiple Audio-Visual features.

[BibT_eX]

[DOI]

Proceedings of the 2011 TREC Video Retrieval Evaluation, 2011

Large-Scale Category Structure Aware Image Categorization.

[BibT_eX]

[DOI]

Bin Zhao

Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

Fast and Balanced: Efficient Label Tree Learning for Large Scale Object Recognition.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

Human action recognition by learning bases of action attributes and parts.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2011

Distributed cosegmentation via submodular optimization on anisotropic diffusion.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2011

Online detection of unusual events in videos via dynamic sparse coding.

[BibT_eX]

[DOI]

Bin Zhao

Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Combining randomization and discrimination for fine-grained image categorization.

[BibT_eX]

[DOI]

Aditya Khosla

Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Hierarchical semantic indexing for large scale image retrieval.

[BibT_eX]

[DOI]

Jia Deng

Alexander C. Berg

Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

2010

Multi-view Object Categorization and Pose Estimation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision: Detection, Recognition and Reconstruction, 2010

What, Where and Who? Telling the Story of an Image by Activity Classification, Scene Recognition and Object Categorization.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision: Detection, Recognition and Reconstruction, 2010

Learning Object Categories From Internet Image Searches.

[BibT_eX]

[DOI]

Proc. IEEE, 2010

OPTIMOL: Automatic Online Picture Collection via Incremental Model Learning.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2010

Large Margin Learning of Upstream Scene Understanding Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

Object Bank: A High-Level Image Representation for Scene Classification & Semantic Feature Sparsification.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

Image Segmentation with Topic Random Field.

[BibT_eX]

[DOI]

Bin Zhao

Proceedings of the Computer Vision - ECCV 2010, 2010

Attribute Learning in Large-Scale Datasets.

[BibT_eX]

[DOI]

Olga Russakovsky

Proceedings of the Trends and Topics in Computer Vision, 2010

Modeling Temporal Structure of Decomposable Motion Segments for Activity Classification.

[BibT_eX]

[DOI]

Chih-Wei Chen

Proceedings of the Computer Vision, 2010

Objects as Attributes for Scene Classification.

[BibT_eX]

[DOI]

Proceedings of the Trends and Topics in Computer Vision, 2010

What Does Classifying More Than 10, 000 Image Categories Tell Us?

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2010, 2010

Modeling mutual context of object and human pose in human-object interaction activities.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Grouplet: A structured image representation for recognizing human and object interactions.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Connecting modalities: Semi-supervised segmentation and annotation of images using unaligned text corpora.

[BibT_eX]

[DOI]

Richard Socher

Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Efficient extraction of human motion volumes by tracking.

[BibT_eX]

[DOI]

Bohyung Han

Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Building and using a semantivisual image hierarchy.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

2009

Hierarchical Mixture of Classification Experts Uncovers Interactions between Brain Regions.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

Exploring Functional Connectivities of the Human Brain using Multivariate Information Analysis.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

Learning a dense multi-view representation for detection, viewpoint classification and synthesis of object categories.

[BibT_eX]

[DOI]

Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

Mining discriminative adjectives and prepositions for natural scene recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2009

Simultaneous image classification and annotation.

[BibT_eX]

[DOI]

Chong Wang

David M. Blei

Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

A multi-view probabilistic model for 3D object classes.

[BibT_eX]

[DOI]

Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Towards total scene understanding: Classification, annotation and segmentation in an automatic framework.

[BibT_eX]

[DOI]

Richard Socher

Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

ImageNet: A large-scale hierarchical image database.

[BibT_eX]

[DOI]

Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

2008

Unsupervised Learning of Human Action Categories Using Spatial-Temporal Words.

[BibT_eX]

[DOI]

Hongcheng Wang

Int. J. Comput. Vis., 2008

Variational Transform Invariant Mixture of Probabilistic PCA.

[BibT_eX]

[DOI]

Proceedings of the 9th IEEE Workshop on Applications of Computer Vision (WACV 2008), 2008

View Synthesis for Recognizing Unseen Poses of Object Classes.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision, 2008

Extracting Moving People from Internet Videos.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision, 2008

Towards Scalable Dataset Construction: An Active Learning Approach.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision, 2008

2007

Learning generative visual models from few training examples: An incremental Bayesian approach tested on 101 object categories.

[BibT_eX]

[DOI]

Robert Fergus

Comput. Vis. Image Underst., 2007

3D generic object categorization, localization and pose estimation.

[BibT_eX]

[DOI]

Proceedings of the IEEE 11th International Conference on Computer Vision, 2007

What, where and who? Classifying events by scene and object recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE 11th International Conference on Computer Vision, 2007

Spatially Coherent Latent Topic Model for Concurrent Segmentation and Classification of Objects and Scenes.

[BibT_eX]

[DOI]

Liangliang Cao

Proceedings of the IEEE 11th International Conference on Computer Vision, 2007

A Hierarchical Model of Shape and Appearance for Human Action Classification.

[BibT_eX]

[DOI]

Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

OPTIMOL: automatic Online Picture collecTion via Incremental MOdel Learning.

[BibT_eX]

[DOI]

Gang Wang

Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

OPTIMOL: A Framework for Online Picture Collection via Incremental Model Learning.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence, 2007

2006

One-Shot Learning of Object Categories.

[BibT_eX]

[DOI]

Robert Fergus

IEEE Trans. Pattern Anal. Mach. Intell., 2006

Variational Shift Invariant Probabilistic PCA for Face Recognition.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

Audio-Visual Speaker Localization Using Graphical Models.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

Using Dependent Regions for Object Categorization in a Generative Framework.

[BibT_eX]

[DOI]

Gang Wang

Ye Zhang

Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006), 2006

2005

Learning Object Categories from Google's Image Search.

[BibT_eX]

[DOI]

Proceedings of the 10th IEEE International Conference on Computer Vision (ICCV 2005), 2005

A Bayesian Hierarchical Model for Learning Natural Scene Categories.

[BibT_eX]

[DOI]

Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), 2005

2004

What do reflections tell us about the shape of a mirror?

[BibT_eX]

[DOI]

Proceedings of the 1st Symposium on Applied Perception in Graphics and Visualization, 2004

2003

A Bayesian Approach to Unsupervised One-Shot Learning of Object Categories.

[BibT_eX]

[DOI]

Robert Fergus