Douwe Kiela

Affiliations:
  • Facebook


According to our database1, Douwe Kiela authored at least 119 papers between 2013 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
OLMoE: Open Mixture-of-Experts Language Models.
CoRR, 2024

Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment.
CoRR, 2024

Lynx: An Open Source Hallucination Evaluation Model.
CoRR, 2024

Generative Representational Instruction Tuning.
CoRR, 2024

KTO: Model Alignment as Prospect Theoretic Optimization.
CoRR, 2024

Model Alignment as Prospect Theoretic Optimization.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Nearest Neighbor Normalization Improves Multimodal Retrieval.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Anchor Points: Benchmarking Models with Much Fewer Examples.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

I am a Strange Dataset: Metalinguistic Tests for Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Leveraging Diffusion Perturbations for Measuring Fairness in Computer Vision.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
FinanceBench: A New Benchmark for Financial Question Answering.
CoRR, 2023

OBELISC: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents.
CoRR, 2023

Towards Language Models That Can See: Computer Vision Through the LENS of Natural Language.
CoRR, 2023


OBELICS: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Investigating Multi-source Active Learning for Natural Language Inference.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

AfroDigits: A Community-Driven Spoken Digit Dataset for African Languages.
Proceedings of the 4th Workshop on African Natural Language Processing, 2023

2022
Measuring Data.
CoRR, 2022

Evaluate & Evaluation on the Hub: Better Best Practices for Data and Model Measurements.
CoRR, 2022

DataPerf: Benchmarks for Data-Centric AI Development.
CoRR, 2022

Models in the Loop: Aiding Crowdworkers with Generative Annotation Assistants.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Grounding, Meaning and Foundation Models: Adventures in Multimodal Machine Learning.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Perturbation Augmentation for Fairer NLP.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Winoground: Probing Vision and Language Models for Visio-Linguistic Compositionality.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

FLAVA: A Foundational Language And Vision Alignment Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Analyzing Dynamic Adversarial Training Data in the Limit.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Dynatask: A Framework for Creating Dynamic AI Benchmark Tasks.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics, 2022

2021
Human-Adversarial Visual Question Answering.
CoRR, 2021

Quasi-Equivalence Discovery for Zero-Shot Emergent Communication.
CoRR, 2021

Findings of the WMT 2021 Shared Task on Large-Scale Multilingual Machine Translation.
Proceedings of the Sixth Conference on Machine Translation, 2021

Human-Adversarial Visual Question Answering.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

True Few-Shot Learning with Language Models.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Dynaboard: An Evaluation-As-A-Service Platform for Holistic Next-Generation Benchmarking.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Dynabench: Rethinking Benchmarking in NLP.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Rissanen Data Analysis: Examining Dataset Characteristics via Description Length.
Proceedings of the 38th International Conference on Machine Learning, 2021

Answering Complex Open-Domain Questions with Multi-Hop Dense Retrieval.
Proceedings of the 9th International Conference on Learning Representations, 2021

Masked Language Modeling and the Distributional Hypothesis: Order Word Matters Pre-training for Little.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

What's Hidden in a One-layer Randomly Weighted Transformer?
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Cross-Modal Retrieval Augmentation for Multi-Modal Classification.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Gradient-based Adversarial Attacks against Text Transformers.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Improving Question Answering Model Robustness with Synthetic Adversarial Data Generation.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Retrieval Augmentation Reduces Hallucination in Conversation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

To what extent do human explanations of model behavior align with actual model behavior?
Proceedings of the Fourth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, 2021

Learning from the Worst: Dynamically Generated Datasets to Improve Online Hate Detection.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Reservoir Transformers.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

DynaSent: A Dynamic Benchmark for Sentiment Analysis.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

I like fish, especially dolphins: Addressing Contradictions in Dialogue Modeling.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

On the Efficacy of Adversarial Data Collection for Question Answering: Results from a Large-Scale Randomized Study.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Reservoir Transformer.
CoRR, 2020

Exploring Zero-Shot Emergent Communication in Embodied Multi-Agent Populations.
CoRR, 2020

ANLIzing the Adversarial Natural Language Inference Dataset.
CoRR, 2020

I love your chain mail! Making knights smile in a fantasy game world: Open-domain goal-oriented dialogue agents.
CoRR, 2020

Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

The Hateful Memes Challenge: Detecting Hate Speech in Multimodal Memes.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020


Learning Optimal Representations with the Decodable Information Bottleneck.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

On the interaction between supervision and self-play in emergent communication.
Proceedings of the 8th International Conference on Learning Representations, 2020

Unsupervised Question Decomposition for Question Answering.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Multi-Dimensional Gender Bias Classification.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Queens are Powerful too: Mitigating Gender Bias in Dialogue Generation.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Adversarial NLI: A New Benchmark for Natural Language Understanding.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Generating Interactive Worlds with Text.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Generalized Inner Loop Meta-Learning.
CoRR, 2019

Why Build an Assistant in Minecraft?
CoRR, 2019

The Second Conversational Intelligence Challenge (ConvAI2).
CoRR, 2019

Hyperbolic Graph Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Supervised Multimodal Bitransformers for Classifying Images and Text.
Proceedings of the Visually Grounded Interaction and Language (ViGIL), 2019

What makes a good conversation? How controllable attributes affect human judgments.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

No Training Required: Exploring Random Encoders for Sentence Classification.
Proceedings of the 7th International Conference on Learning Representations, 2019

Learning to Speak and Act in a Fantasy Text Adventure Game.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Finding Generalizable Evidence by Learning to Convince Q&A Models.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Countering Language Drift via Visual Grounding.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Seeded self-play for language learning.
Proceedings of the Beyond Vision and LANguage: inTEgrating Real-world kNowledge, 2019

Emergent Linguistic Phenomena in Multi-Agent Communication Games.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Inferring Concept Hierarchies from Text Corpora via Hyperbolic Embeddings.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Analysis of Joint Multilingual Sentence Representations and Semantic K-Nearest Neighbor Graphs.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Talk the Walk: Navigating New York City through Grounded Dialogue.
CoRR, 2018

Context-Attentive Embeddings for Improved Sentence Representations.
CoRR, 2018

Learning Visually Grounded Sentence Representations.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

SentEval: An Evaluation Toolkit for Universal Sentence Representations.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Learning Continuous Hierarchies in the Lorentz Model of Hyperbolic Geometry.
Proceedings of the 35th International Conference on Machine Learning, 2018

Mastering the Dungeon: Grounded Language Learning by Mechanical Turker Descent.
Proceedings of the 6th International Conference on Learning Representations, 2018

Emergent Translation in Multi-Agent Communication.
Proceedings of the 6th International Conference on Learning Representations, 2018

Emergent Communication in a Multi-Modal, Multi-Step Referential Game.
Proceedings of the 6th International Conference on Learning Representations, 2018

Dynamic Meta-Embeddings for Improved Sentence Representations.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Jump to better conclusions: SCAN both left and right.
Proceedings of the Workshop: Analyzing and Interpreting Neural Networks for NLP, 2018

Hearst Patterns Revisited: Automatic Hypernym Detection from Large Text Corpora.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Personalizing Dialogue Agents: I have a dog, do you have pets too?
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Code-Switched Named Entity Recognition with Embedding Attention.
Proceedings of the Third Workshop on Computational Approaches to Linguistic Code-Switching@ACL 2018, 2018

Efficient Large-Scale Multi-Modal Classification.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Visually Grounded and Textual Semantic Models Differentially Decode Brain Activity Associated with Concrete and Abstract Nouns.
Trans. Assoc. Comput. Linguistics, 2017

Learning Neural Audio Embeddings for Grounding Semantics in Auditory Perception.
J. Artif. Intell. Res., 2017

Emergent Language in a Multi-Modal, Multi-Step Referential Game.
CoRR, 2017

HyperLex: A Large-Scale Evaluation of Graded Lexical Entailment.
Comput. Linguistics, 2017

Poincaré Embeddings for Learning Hierarchical Representations.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Grasping the Finer Point: A Supervised Similarity Network for Metaphor Detection.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Supervised Learning of Universal Sentence Representations from Natural Language Inference Data.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Learning to Negate Adjectives with Bilinear Models.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Evaluation by Association: A Systematic Study of Quantitative Word Association Evaluation.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Automatically Generating Rhythmic Verse with Neural Networks.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016
Virtual Embodiment: A Scalable Long-Term Strategy for Artificial Intelligence Research.
CoRR, 2016

Black Holes and White Rabbits: Metaphor Identification with Visual Features.
Proceedings of the NAACL HLT 2016, 2016

Vision and Feature Norms: Improving automatic feature norm learning through cross-modal maps.
Proceedings of the NAACL HLT 2016, 2016

Comparing Data Sources and Architectures for Deep Visual Representation Learning in Semantics.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Robust Text Classification for Sparsely Labelled Data Using Multi-level Embeddings.
Proceedings of the COLING 2016, 2016

Multi-Modal Representations for Improved Bilingual Lexicon Learning.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

MMFeat: A Toolkit for Extracting Multi-Modal Features.
Proceedings of ACL-2016 System Demonstrations, Berlin, Germany, August 7-12, 2016, 2016

2015
Unsupervised discovery of information structure in biomedical documents.
Bioinform., 2015

Visual Bilingual Lexicon Induction with Transferred ConvNet Features.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Specializing Word Embeddings for Similarity or Relatedness.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Multi- and Cross-Modal Semantics Beyond Vision: Grounding in Auditory Perception.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Exploiting Image Generality for Lexical Entailment Detection.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

Grounding Semantics in Olfactory Perception.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

2014
Learning Image Embeddings using Convolutional Neural Networks for Improved Multi-Modal Semantics.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

Improving Multi-Modal Representations Using Image Dispersion: Why Less is Sometimes More.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

A Systematic Study of Semantic Vector Space Model Parameters.
Proceedings of the 2nd Workshop on Continuous Vector Space Models and their Compositionality, 2014

2013
UCAM-CORE: Incorporating structured distributional similarity into STS.
Proceedings of the Second Joint Conference on Lexical and Computational Semantics, 2013

Detecting Compositionality of Multi-Word Expressions using Nearest Neighbours in Vector Space Models.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

Concreteness and Corpora: A Theoretical and Practical Study.
Proceedings of the Fourth Annual Workshop on Cognitive Modeling and Computational Linguistics, 2013


  Loading...