Catherine Arnett

According to our database¹, Catherine Arnett authored at least 10 papers between 2023 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

2023

2024

2025

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

Why do language models perform worse for morphologically complex languages?

[BibT_eX]

[DOI]

Catherine Arnett

Benjamin Bergen

Proceedings of the 31st International Conference on Computational Linguistics, 2025

2024

Toxicity of the Commons: Curating Open-Source Pre-Training Data.

[BibT_eX]

[DOI]

CoRR, 2024

Goldfish: Monolingual Language Models for 350 Languages.

[BibT_eX]

[DOI]

CoRR, 2024

Revenge of the Fallen? Recurrent Models Match Transformers at Predicting Human Language Comprehension Metrics.

[BibT_eX]

[DOI]

James A. Michaelov

Catherine Arnett

Benjamin K. Bergen

CoRR, 2024

Different Tokenization Schemes Lead to Comparable Performance in Spanish Number Agreement.

[BibT_eX]

[DOI]

CoRR, 2024

A Bit of a Problem: Measurement Disparities in Dataset Sizes Across Languages.

[BibT_eX]

[DOI]

Catherine Arnett

Tyler A. Chang

Benjamin K. Bergen

CoRR, 2024

BPE Gets Picky: Efficient Vocabulary Refinement During Tokenizer Training.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

When Is Multilinguality a Curse? Language Modeling for 250 High- and Low-Resource Languages.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023

Crosslingual Structural Priming and the Pre-Training Dynamics of Bilingual Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

Structural Priming Demonstrates Abstract Grammatical Representations in Multilingual Language Models.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Catherine Arnett

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...