James Lee-Thorp

According to our database1, James Lee-Thorp authored at least 9 papers between 2021 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Memory Augmented Language Models through Mixture of Word Experts.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

2023
Scaling Up Models and Data with t5x and seqio.
J. Mach. Learn. Res., 2023

Sparse Upcycling: Training Mixture-of-Experts from Dense Checkpoints.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

CoLT5: Faster Long-Range Transformers with Conditional Computation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022
Scaling Up Models and Data with t5x and seqio.
CoRR, 2022

FNet: Mixing Tokens with Fourier Transforms.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Sparse Mixers: Combining MoE and Mixing to build a more efficient BERT.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

2021
ShopTalk: A System for Conversational Faceted Search.
CoRR, 2021


  Loading...