Adina Williams

Orcid: 0000-0001-5281-3343

According to our database1, Adina Williams authored at least 70 papers between 2017 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
DIG In: Evaluating Disparities in Image Generations with Indicators for Geographic Diversity.
Trans. Mach. Learn. Res., 2024

Improving Model Evaluation using SMART Filtering of Benchmark Datasets.
CoRR, 2024

Are Female Carpenters like Blue Bananas? A Corpus Investigation of Occupation Gender Typicality.
CoRR, 2024

Changing Answer Order Can Decrease MMLU Accuracy.
CoRR, 2024

Decomposed evaluations of geographic disparities in text-to-image models.
CoRR, 2024

The Factorization Curse: Which Tokens You Predict Underlie the Reversal Curse and More.
CoRR, 2024

The PRISM Alignment Project: What Participatory, Representative and Individualised Human Feedback Reveals About the Subjective and Multicultural Alignment of Large Language Models.
CoRR, 2024

Introducing v0.5 of the AI Safety Benchmark from MLCommons.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
CoRR, 2024

[Call for Papers] The 2nd BabyLM Challenge: Sample-efficient pretraining on a developmentally plausible corpus.
CoRR, 2024

Improving Text-to-Image Consistency via Automatic Prompt Optimization.
CoRR, 2024

Compositional learning of functions in humans and machines.
CoRR, 2024

Towards Geographic Inclusion in the Evaluation of Text-to-Image Models.
Proceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency, 2024

EmphAssess : a Prosodic Benchmark on Assessing Emphasis Transfer in Speech-to-Speech Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Are Female Carpenters like Blue Bananas? A Corpus Investigation of Occupation Gender Typicality.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Grammatical Gender's Influence on Distributional Semantics: A Causal Perspective.
CoRR, 2023

Llama 2: Open Foundation and Fine-Tuned Chat Models.
CoRR, 2023

Weisfeiler and Lehman Go Measurement Modeling: Probing the Validity of the WL Test.
CoRR, 2023

Call for Papers - The BabyLM Challenge: Sample-efficient pretraining on a developmentally plausible corpus.
CoRR, 2023

The Gender-GAP Pipeline: A Gender-Aware Polyglot Pipeline for Gender Characterisation in 55 Languages.
Proceedings of the Eighth Conference on Machine Translation, 2023


Robustness of Named-Entity Replacements for In-Context Learning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

ROBBIE: Robust Bias Evaluation of Large Generative Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

The Validity of Evaluation Results: Assessing Concurrence Across Compositionality Benchmarks.
Proceedings of the 27th Conference on Computational Natural Language Learning, 2023

Language model acceptability judgements are not always robust to context.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

A Latent-Variable Model for Intrinsic Probing.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Learning Transductions to Test Systematic Compositionality.
CoRR, 2022

DataPerf: Benchmarks for Data-Centric AI Development.
CoRR, 2022

"I'm sorry to hear that": finding bias in language models with a holistic descriptor dataset.
CoRR, 2022

On the Machine Learning of Ethical Judgments from Natural Language.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

"I'm sorry to hear that": Finding New Biases in Language Models with a Holistic Descriptor Dataset.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

The Curious Case of Absolute Position Embeddings.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Perturbation Augmentation for Fairer NLP.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Winoground: Probing Vision and Language Models for Visio-Linguistic Compositionality.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Benchmarking Compositionality with Formal Languages.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Evaluating locality in NMT models.
Proceedings of the 44th Annual Meeting of the Cognitive Science Society, 2022

Analyzing Dynamic Adversarial Training Data in the Limit.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Dynatask: A Framework for Creating Dynamic AI Benchmark Tasks.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics, 2022

Investigating Failures of Automatic Translationin the Case of Unambiguous Gender.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
On the Relationships Between the Grammatical Genders of Inanimate Nouns and Their Co-Occurring Adjectives and Verbs.
Trans. Assoc. Comput. Linguistics, 2021

A Word on Machine Ethics: A Response to Jiang et al. (2021).
CoRR, 2021

Hi, my name is Martha: Using names to measure and mitigate bias in generative dialogue models.
CoRR, 2021

Investigating Failures of Automatic Translation in the Case of Unambiguous Gender.
CoRR, 2021

Sometimes We Want Translationese.
CoRR, 2021

Dynaboard: An Evaluation-As-A-Service Platform for Holistic Next-Generation Benchmarking.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Dynabench: Rethinking Benchmarking in NLP.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Masked Language Modeling and the Distributional Hypothesis: Order Word Matters Pre-training for Little.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Sometimes We Want Ungrammatical Translations.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Generalising to German Plural Noun Classes, from the Perspective of a Recurrent Neural Network.
Proceedings of the 25th Conference on Computational Natural Language Learning, 2021

To what extent do human explanations of model behavior align with actual model behavior?
Proceedings of the Fourth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, 2021

UnNatural Language Inference.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
ANLIzing the Adversarial Natural Language Inference Dataset.
CoRR, 2020


Pareto Probing: Trading Off Accuracy for Complexity.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Measuring the Similarity of Grammatical Gender Systems by Comparing Partitions.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Intrinsic Probing through Dimension Selection.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Multi-Dimensional Gender Bias Classification.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Queens are Powerful too: Mitigating Gender Bias in Dialogue Generation.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Predicting Declension Class from Form and Meaning.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Information-Theoretic Probing for Linguistic Structure.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Adversarial NLI: A New Benchmark for Natural Language Understanding.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

A Tale of a Probe and a Parser.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Are Natural Language Inference Models IMPPRESsive? Learning IMPlicature and PRESupposition.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
On the Idiosyncrasies of the Mandarin Chinese Classifier System.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Quantifying the Semantic Core of Gender Systems.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

2018
Do latent tree learning models identify meaningful structure in sentences?
Trans. Assoc. Comput. Linguistics, 2018

Verb Argument Structure Alternations in Word and Sentence Embeddings.
CoRR, 2018

A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

XNLI: Evaluating Cross-lingual Sentence Representations.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

2017
Learning to parse from a semantic objective: It works. Is it syntax?
CoRR, 2017

The RepEval 2017 Shared Task: Multi-Genre Natural Language Inference with Sentence Representations.
Proceedings of the 2nd Workshop on Evaluating Vector Space Representations for NLP, 2017


  Loading...