2022
Efficient Large Scale Language Modeling with Mixtures of Experts.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

2021
Efficient Large Scale Language Modeling with Mixtures of Experts.
CoRR, 2021