MMM: Multi-Layer Multi-Residual Multi-Stream Discrete Speech Representation from Self-supervised Learning Model.
CoRR, 2024
Toward Efficient Inference for Mixture of Experts.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Multi-resolution HuBERT: Multi-resolution Speech Self-Supervised Learning with Masked Unit Prediction.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Efficient Monotonic Multihead Attention.
CoRR, 2023
Towards MoE Deployment: Mitigating Inefficiencies in Mixture-of-Expert (MoE) Inference.
CoRR, 2023
Efficiently Upgrading Multilingual Machine Translation Models to Support More Languages.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023
Fixing MoE Over-Fitting on Low-Resource Languages in Multilingual Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
Hybrid Transducer and Attention based Encoder-Decoder Modeling for Speech-to-Text Tasks.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
stopes - Modular Machine Translation Pipelines.
Proceedings of the The 2022 Conference on Empirical Methods in Natural Language Processing, 2022
Playing Codenames with Language Graphs and Word Embeddings.
J. Artif. Intell. Res., 2021