Proceedings of the Joint Proceedings of Workshops at the 49th International Conference on Very Large Data Bases (VLDB 2023), Vancouver, Canada, August 28, 2023

VLSlice: Interactive Vision-and-Language Slice Discovery.

[BibT_eX]

[DOI]

Eric Slyman

Minsuk Kahng

Stefan Lee

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Navigating to Objects Specified by Images.

[BibT_eX]

[DOI]

Devendra Singh Chaplot

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Behavioral Analysis of Vision-and-Language Navigation Agents.

[BibT_eX]

[DOI]

Zijiao Yang

Arjun Majumdar

Stefan Lee

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Iterative Vision-and-Language Navigation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Instance-Specific Image Goal Navigation: Training Embodied Agents to Find Object Instances.

[BibT_eX]

[DOI]

Devendra Singh Chaplot

CoRR, 2022

Retrospectives on the Embodied AI Workshop.

[BibT_eX]

[DOI]

CoRR, 2022

PROMPT: Learning Dynamic Resource Allocation Policies for Edge-Network Applications.

[BibT_eX]

[DOI]

CoRR, 2022

Sim-2-Sim Transfer for Vision-and-Language Navigation in Continuous Environments.

[BibT_eX]

[DOI]

Jacob Krantz

Stefan Lee

Proceedings of the Computer Vision - ECCV 2022, 2022

2021

Piecewise-constant Neural ODEs.

[BibT_eX]

[DOI]

Sam Greydanus

Stefan Lee

Alan Fern

CoRR, 2021

Deep Convolution for Irregularly Sampled Temporal Point Clouds.

[BibT_eX]

[DOI]

CoRR, 2021

Auxiliary Tasks for Efficient Learning of Point-Goal Navigation.

[BibT_eX]

[DOI]

Saurabh Satish Desai

Stefan Lee

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

SOAT: A Scene- and Object-Aware Transformer for Vision-and-Language Navigation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

DeepAveragers: Offline Reinforcement Learning By Solving Derived Non-Parametric MDPs.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

THDA: Treasure Hunt Data Augmentation for Semantic Navigation.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Waypoint Models for Instruction-guided Navigation in Continuous Environments.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Improving Multilingual Translation by Representation and Gradient Regularization.

[BibT_eX]

[DOI]

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Semantic MapNet: Building Allocentric Semantic Maps and Representations from Egocentric Views.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Sim2Real Predictivity: Does Evaluation in Simulation Predict Real-World Performance?

[BibT_eX]

[DOI]

IEEE Robotics Autom. Lett., 2020

Semantic MapNet: Building Allocentric SemanticMaps and Representations from Egocentric Views.

[BibT_eX]

[DOI]

CoRR, 2020

Language-Conditioned Imitation Learning for Robot Manipulation Tasks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Dialog without Dialog Data: Learning Visual Dialog Agents from VQA Data.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

DD-PPO: Learning Near-Perfect PointGoal Navigators from 2.5 Billion Frames.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Learning Representations, 2020

On the Sub-Layer Functionalities of Transformer Decoder.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Where Are You? Localization from Embodied Dialog.

[BibT_eX]

[DOI]

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Improving Vision-and-Language Navigation with Image-Text Pairs from the Web.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Beyond the Nav-Graph: Vision-and-Language Navigation in Continuous Environments.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

12-in-1: Multi-Task Vision and Language Representation Learning.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Integrating Egocentric Localization for More Realistic Point-Goal Navigation Agents.

[BibT_eX]

[DOI]

Proceedings of the 4th Conference on Robot Learning, 2020

Sim-to-Real Transfer for Vision-and-Language Navigation.

[BibT_eX]

[DOI]

Proceedings of the 4th Conference on Robot Learning, 2020

2019

Visual Dialog.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2019

Are We Making Real Progress in Simulated Environments? Measuring the Sim2Real Gap in Embodied Visual Navigation.

[BibT_eX]

[DOI]

CoRR, 2019

Question-Conditioned Counterfactual Image Generation for VQA.

[BibT_eX]

[DOI]

Jingjing Pan

Yash Goyal

Stefan Lee

CoRR, 2019

Decentralized Distributed PPO: Solving PointGoal Navigation.

[BibT_eX]

[DOI]

CoRR, 2019

Emergence of Compositional Language with Deep Generational Transmission.

[BibT_eX]

[DOI]

CoRR, 2019

Taking a HINT: Leveraging Explanations to Make Vision and Language Models More Grounded.

[BibT_eX]

[DOI]

Ramprasaath R. Selvaraju

CoRR, 2019

EvalAI: Towards Better Evaluation Systems for AI Agents.

[BibT_eX]

[DOI]

Deshraj Yadav

Rishabh Jain

Harsh Agrawal

Prithvijit Chattopadhyay

CoRR, 2019

ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Chasing Ghosts: Instruction Following as Bayesian State Tracking.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Probabilistic Neural Symbolic Models for Interpretable Visual Question Answering.

[BibT_eX]

[DOI]

Proceedings of the 36th International Conference on Machine Learning, 2019

Trainable Decoding of Sets of Sequences for Neural Sequence Models.

[BibT_eX]

[DOI]

Proceedings of the 36th International Conference on Machine Learning, 2019

Counterfactual Visual Explanations.

[BibT_eX]

[DOI]

Proceedings of the 36th International Conference on Machine Learning, 2019

Taking a HINT: Leveraging Explanations to Make Vision and Language Models More Grounded.

[BibT_eX]

[DOI]

Ramprasaath Ramasamy Selvaraju

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

nocaps: novel object captioning at scale.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Sunny and Dark Outside?! Improving Answer Consistency in VQA through Entailed Question Generation.

[BibT_eX]

[DOI]

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Embodied Question Answering in Photorealistic Environments With Point Cloud Perception.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Audio Visual Scene-Aware Dialog.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018

Overcoming Language Priors in Visual Question Answering with Adversarial Regularization.

[BibT_eX]

[DOI]

Sainandan Ramakrishnan

Aishwarya Agrawal

Stefan Lee

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Learn from Your Neighbor: Learning Multi-modal Mappings from Sparse Annotations.

[BibT_eX]

[DOI]

Proceedings of the 35th International Conference on Machine Learning, 2018

Graph R-CNN for Scene Graph Generation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Choose Your Neuron: Incorporating Domain Knowledge Through Neuron-Importance.

[BibT_eX]

[DOI]

Ramprasaath R. Selvaraju

Prithvijit Chattopadhyay

Proceedings of the Computer Vision - ECCV 2018, 2018

Embodied Question Answering.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

Visual Curiosity: Learning to Ask Questions to Learn Visual Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2nd Annual Conference on Robot Learning, 2018

Neural Modular Control for Embodied Question Answering.

[BibT_eX]

[DOI]

Proceedings of the 2nd Annual Conference on Robot Learning, 2018

Diverse Beam Search for Improved Description of Complex Scenes.

[BibT_eX]

[DOI]

Ashwin K. Vijayakumar

Michael Cogswell

Ramprasaath R. Selvaraju

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017

Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2017

Evaluating Visual Conversational Agents via Cooperative Human-AI Games.

[BibT_eX]

[DOI]

Prithvijit Chattopadhyay

Proceedings of the Fifth AAAI Conference on Human Computation and Crowdsourcing, 2017

The Promise of Premise: Harnessing Question Premises in Visual Question Answering.

[BibT_eX]

[DOI]

Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Natural Language Does Not Emerge 'Naturally' in Multi-Agent Dialog.

[BibT_eX]

[DOI]

Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Bidirectional Beam Search: Forward-Backward Inference in Neural Sequence Models for Fill-in-the-Blank Image Captioning.

[BibT_eX]

[DOI]

Qing Sun

Stefan Lee

Dhruv Batra

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016

Recognizing Landmarks in Large-Scale Social Image Collections.

[BibT_eX]

[DOI]

David J. Crandall

Yunpeng Li

Stefan Lee

Daniel P. Huttenlocher

Proceedings of the Deep Learning and Convolutional Neural Networks for Medical Image Computing, 2016

Diverse Beam Search: Decoding Diverse Solutions from Neural Sequence Models.

[BibT_eX]

[DOI]

Ashwin K. Vijayakumar

Michael Cogswell

Ramprasaath R. Selvaraju

CoRR, 2016

Stochastic Multiple Choice Learning for Training Diverse Deep Ensembles.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

2015

Why M Heads are Better than One: Training a Diverse Ensemble of Deep Networks.

[BibT_eX]

[DOI]

CoRR, 2015

Predicting Geo-informative Attributes in Large-Scale Image Collections Using Convolutional Neural Networks.

[BibT_eX]

[DOI]

Stefan Lee

Haipeng Zhang

David J. Crandall

Proceedings of the 2015 IEEE Winter Conference on Applications of Computer Vision, 2015

Lending A Hand: Detecting Hands and Recognizing Activities in Complex Egocentric Interactions.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Linking Past to Present: Discovering Style in Two Centuries of Architecture.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Computational Photography, 2015

2014

Estimating bedrock and surface layer boundaries and confidence intervals in ice sheet radar imagery using MCMC.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

This Hand Is My Hand: A Probabilistic Approach to Hand Disambiguation in Egocentric Video.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2014

Stefan Lee

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...