Nathan Godey

According to our database¹, Nathan Godey authored at least 7 papers between 2022 and 2024.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of five.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

Why do small language models underperform? Studying Language Model Saturation via the Softmax Bottleneck.

[BibT_eX]

[DOI]

Nathan Godey

Éric de la Clergerie

Benoît Sagot

CoRR, 2024

Headless Language Models: Learning without Predicting with Contrastive Weight Tying.

[BibT_eX]

[DOI]

Nathan Godey

Éric Villemonte de la Clergerie

Benoît Sagot

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Anisotropy Is Inherent to Self-Attention in Transformers.

[BibT_eX]

[DOI]

Nathan Godey

Éric Villemonte de la Clergerie

Benoît Sagot

Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

On the Scaling Laws of Geographical Representation in Language Models.

[BibT_eX]

[DOI]

Nathan Godey

Éric de la Clergerie

Benoît Sagot

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023

Is Anisotropy Inherent to Transformers?

[BibT_eX]

[DOI]

Nathan Godey

Éric de la Clergerie

Benoît Sagot

CoRR, 2023

2022

MANTa: Efficient Gradient-Based Tokenization for Robust End-to-End Language Modeling.

[BibT_eX]

[DOI]

CoRR, 2022

MANTa: Efficient Gradient-Based Tokenization for End-to-End Robust Language Modeling.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Nathan Godey

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...