Alexandre Muzio

According to our database1, Alexandre Muzio authored at least 11 papers between 2017 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
SEER-MoE: Sparse Expert Efficiency through Regularization for Mixture-of-Experts.
CoRR, 2024

2022
Gating Dropout: Communication-efficient Regularization for Sparsely Activated Transformers.
CoRR, 2022

Gating Dropout: Communication-efficient Regularization for Sparsely Activated Transformers.
Proceedings of the International Conference on Machine Learning, 2022

2021
Scalable and Efficient MoE Training for Multitask Multilingual Models.
CoRR, 2021

DeltaLM: Encoder-Decoder Pre-training for Language Generation and Translation by Augmenting Pretrained Multilingual Encoders.
CoRR, 2021

Multilingual Machine Translation Systems from Microsoft for WMT21 Shared Task.
Proceedings of the Sixth Conference on Machine Translation, 2021

Improving Multilingual Translation by Representation and Gradient Regularization.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Discovering Representation Sprachbund For Multilingual Pre-Training.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

2020
XLM-T: Scaling up Multilingual Machine Translation with Pretrained Cross-lingual Transformer Encoders.
CoRR, 2020

Toward ML-centric cloud platforms.
Commun. ACM, 2020

2017
Resource Central: Understanding and Predicting Workloads for Improved Resource Management in Large Cloud Platforms.
Proceedings of the 26th Symposium on Operating Systems Principles, 2017


  Loading...