LocATe: End-to-end Localization of Actions in 3D with Transformers.
CoRR, 2022
Active Domain Adaptation via Clustering Uncertainty-weighted Embeddings.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
How much coffee was consumed during EMNLP 2019? Fermi Problems: A New Reasoning Challenge for AI.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
BABEL: Bodies, Action and Behavior With English Labels.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
Towards Natural Human-AI Interactions in Vision and Language.
PhD thesis, 2020
A Computational Model of Early Word Learning from the Infant's Point of View.
Proceedings of the 42th Annual Meeting of the Cognitive Science Society, 2020
Punny Captions: Witty Wordplay in Image Descriptions.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018
Do explanations make VQA models more predictable to a human?
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018
It Takes Two to Tango: Towards Theory of AI's Mind.
CoRR, 2017
Evaluating Visual Conversational Agents via Cooperative Human-AI Games.
Proceedings of the Fifth AAAI Conference on Human Computation and Crowdsourcing, 2017
Sort Story: Sorting Jumbled Images and Captions into Stories.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016
We are Humor Beings: Understanding and Predicting Visual Humor.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016