Stefan Lee

Orcid: 0000-0001-5953-1963

According to our database1, Stefan Lee authored at least 79 papers between 2014 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
You Never Know: Quantization Induces Inconsistent Biases in Vision-Language Foundation Models.
CoRR, 2024

Benchmarking Out-of-Distribution Detection in Visual Question Answering.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Point Cloud Models Improve Visual Robustness in Robotic Learners.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

AUTOSGM: A Unified Lowpass Regularization Framework for Accelerated Learning.
Proceedings of the IEEE International Conference on Acoustics, 2024

FairDeDup: Detecting and Mitigating Vision-Language Fairness Disparities in Semantic Dataset Deduplication.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Viewpoint-Aware Visual Grounding in 3D Scenes.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Language-Informed Beam Search Decoding for Multilingual Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
PROMPT: Learning dynamic resource allocation policies for network applications.
Future Gener. Comput. Syst., August, 2023

Emergence of Maps in the Memories of Blind Navigation Agents.
AI Matters, June, 2023

Effective Entity Augmentation By Querying External Data Sources.
Proc. VLDB Endow., 2023

Generating Data Augmentation Queries Using Large Language Models.
Proceedings of the Joint Proceedings of Workshops at the 49th International Conference on Very Large Data Bases (VLDB 2023), Vancouver, Canada, August 28, 2023

VLSlice: Interactive Vision-and-Language Slice Discovery.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Navigating to Objects Specified by Images.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Behavioral Analysis of Vision-and-Language Navigation Agents.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Iterative Vision-and-Language Navigation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Instance-Specific Image Goal Navigation: Training Embodied Agents to Find Object Instances.
CoRR, 2022

Retrospectives on the Embodied AI Workshop.
CoRR, 2022

PROMPT: Learning Dynamic Resource Allocation Policies for Edge-Network Applications.
CoRR, 2022

Sim-2-Sim Transfer for Vision-and-Language Navigation in Continuous Environments.
Proceedings of the Computer Vision - ECCV 2022, 2022

2021
Piecewise-constant Neural ODEs.
CoRR, 2021

Deep Convolution for Irregularly Sampled Temporal Point Clouds.
CoRR, 2021

Auxiliary Tasks for Efficient Learning of Point-Goal Navigation.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

SOAT: A Scene- and Object-Aware Transformer for Vision-and-Language Navigation.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

DeepAveragers: Offline Reinforcement Learning By Solving Derived Non-Parametric MDPs.
Proceedings of the 9th International Conference on Learning Representations, 2021

THDA: Treasure Hunt Data Augmentation for Semantic Navigation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Waypoint Models for Instruction-guided Navigation in Continuous Environments.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Improving Multilingual Translation by Representation and Gradient Regularization.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Semantic MapNet: Building Allocentric Semantic Maps and Representations from Egocentric Views.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Sim2Real Predictivity: Does Evaluation in Simulation Predict Real-World Performance?
IEEE Robotics Autom. Lett., 2020

Semantic MapNet: Building Allocentric SemanticMaps and Representations from Egocentric Views.
CoRR, 2020

Language-Conditioned Imitation Learning for Robot Manipulation Tasks.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Dialog without Dialog Data: Learning Visual Dialog Agents from VQA Data.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

DD-PPO: Learning Near-Perfect PointGoal Navigators from 2.5 Billion Frames.
Proceedings of the 8th International Conference on Learning Representations, 2020

On the Sub-Layer Functionalities of Transformer Decoder.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Where Are You? Localization from Embodied Dialog.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Improving Vision-and-Language Navigation with Image-Text Pairs from the Web.
Proceedings of the Computer Vision - ECCV 2020, 2020

Beyond the Nav-Graph: Vision-and-Language Navigation in Continuous Environments.
Proceedings of the Computer Vision - ECCV 2020, 2020

12-in-1: Multi-Task Vision and Language Representation Learning.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Integrating Egocentric Localization for More Realistic Point-Goal Navigation Agents.
Proceedings of the 4th Conference on Robot Learning, 2020

Sim-to-Real Transfer for Vision-and-Language Navigation.
Proceedings of the 4th Conference on Robot Learning, 2020

2019
Visual Dialog.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Are We Making Real Progress in Simulated Environments? Measuring the Sim2Real Gap in Embodied Visual Navigation.
CoRR, 2019

Question-Conditioned Counterfactual Image Generation for VQA.
CoRR, 2019

Decentralized Distributed PPO: Solving PointGoal Navigation.
CoRR, 2019

Emergence of Compositional Language with Deep Generational Transmission.
CoRR, 2019

Taking a HINT: Leveraging Explanations to Make Vision and Language Models More Grounded.
CoRR, 2019

EvalAI: Towards Better Evaluation Systems for AI Agents.
CoRR, 2019

ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Chasing Ghosts: Instruction Following as Bayesian State Tracking.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Probabilistic Neural Symbolic Models for Interpretable Visual Question Answering.
Proceedings of the 36th International Conference on Machine Learning, 2019

Trainable Decoding of Sets of Sequences for Neural Sequence Models.
Proceedings of the 36th International Conference on Machine Learning, 2019

Counterfactual Visual Explanations.
Proceedings of the 36th International Conference on Machine Learning, 2019

Taking a HINT: Leveraging Explanations to Make Vision and Language Models More Grounded.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

nocaps: novel object captioning at scale.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Sunny and Dark Outside?! Improving Answer Consistency in VQA through Entailed Question Generation.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Embodied Question Answering in Photorealistic Environments With Point Cloud Perception.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Audio Visual Scene-Aware Dialog.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Overcoming Language Priors in Visual Question Answering with Adversarial Regularization.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Learn from Your Neighbor: Learning Multi-modal Mappings from Sparse Annotations.
Proceedings of the 35th International Conference on Machine Learning, 2018

Graph R-CNN for Scene Graph Generation.
Proceedings of the Computer Vision - ECCV 2018, 2018

Choose Your Neuron: Incorporating Domain Knowledge Through Neuron-Importance.
Proceedings of the Computer Vision - ECCV 2018, 2018

Embodied Question Answering.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

Visual Curiosity: Learning to Ask Questions to Learn Visual Recognition.
Proceedings of the 2nd Annual Conference on Robot Learning, 2018

Neural Modular Control for Embodied Question Answering.
Proceedings of the 2nd Annual Conference on Robot Learning, 2018

Diverse Beam Search for Improved Description of Complex Scenes.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Evaluating Visual Conversational Agents via Cooperative Human-AI Games.
Proceedings of the Fifth AAAI Conference on Human Computation and Crowdsourcing, 2017

The Promise of Premise: Harnessing Question Premises in Visual Question Answering.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Natural Language Does Not Emerge 'Naturally' in Multi-Agent Dialog.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Bidirectional Beam Search: Forward-Backward Inference in Neural Sequence Models for Fill-in-the-Blank Image Captioning.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Recognizing Landmarks in Large-Scale Social Image Collections.
Proceedings of the Deep Learning and Convolutional Neural Networks for Medical Image Computing, 2016

Diverse Beam Search: Decoding Diverse Solutions from Neural Sequence Models.
CoRR, 2016

Stochastic Multiple Choice Learning for Training Diverse Deep Ensembles.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

2015
Why M Heads are Better than One: Training a Diverse Ensemble of Deep Networks.
CoRR, 2015

Predicting Geo-informative Attributes in Large-Scale Image Collections Using Convolutional Neural Networks.
Proceedings of the 2015 IEEE Winter Conference on Applications of Computer Vision, 2015

Lending A Hand: Detecting Hands and Recognizing Activities in Complex Egocentric Interactions.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Linking Past to Present: Discovering Style in Two Centuries of Architecture.
Proceedings of the 2015 IEEE International Conference on Computational Photography, 2015

2014
Estimating bedrock and surface layer boundaries and confidence intervals in ice sheet radar imagery using MCMC.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

This Hand Is My Hand: A Probabilistic Approach to Hand Disambiguation in Egocentric Video.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2014


  Loading...