×
2022
Efficient Large Scale Language Modeling with Mixtures of Experts.
[DOI]
Mikel Artetxe
,
Shruti Bhosale
,
Naman Goyal
,
Todor Mihaylov
,
Myle Ott
,
Sam Shleifer
,
Xi Victoria Lin
,
Jingfei Du
,
Srinivasan Iyer
,
Ramakanth Pasunuru
,
Giridharan Anantharaman
,
Xian Li
,
Shuohui Chen
,
Halil Akin
,
Mandeep Baines
,
Louis Martin
,
Xing Zhou
,
Punit Singh Koura
,
Brian O'Horo
,
Jeffrey Wang
,
Luke Zettlemoyer
,
Mona T. Diab
,
Zornitsa Kozareva
,
Veselin Stoyanov
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
2021
Efficient Large Scale Language Modeling with Mixtures of Experts.
[DOI]
Mikel Artetxe
,
Shruti Bhosale
,
Naman Goyal
,
Todor Mihaylov
,
Myle Ott
,
Sam Shleifer
,
Xi Victoria Lin
,
Jingfei Du
,
Srinivasan Iyer
,
Ramakanth Pasunuru
,
Giri Anantharaman
,
Xian Li
,
Shuohui Chen
,
Halil Akin
,
Mandeep Baines
,
Louis Martin
,
Xing Zhou
,
Punit Singh Koura
,
Brian O'Horo
,
Jeff Wang
,
Luke Zettlemoyer
,
Mona T. Diab
,
Zornitsa Kozareva
,
Ves Stoyanov
CoRR, 2021