Nathan Godey

According to our database1, Nathan Godey authored at least 7 papers between 2022 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of five.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Why do small language models underperform? Studying Language Model Saturation via the Softmax Bottleneck.
CoRR, 2024

Headless Language Models: Learning without Predicting with Contrastive Weight Tying.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Anisotropy Is Inherent to Self-Attention in Transformers.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

On the Scaling Laws of Geographical Representation in Language Models.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
Is Anisotropy Inherent to Transformers?
CoRR, 2023

2022
MANTa: Efficient Gradient-Based Tokenization for Robust End-to-End Language Modeling.
CoRR, 2022

MANTa: Efficient Gradient-Based Tokenization for End-to-End Robust Language Modeling.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022


  Loading...