2024
Findings of the IWSLT 2024 Evaluation Campaign.
CoRR, 2024

SSR: Alignment-Aware Modality Connector for Speech Language Models.
CoRR, 2024

MMM: Multi-Layer Multi-Residual Multi-Stream Discrete Speech Representation from Self-supervised Learning Model.
CoRR, 2024

MMM: Multi-Layer Multi-Residual Multi-Stream Discrete Speech Representation from Self-supervised Learning Model.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Multi-resolution HuBERT: Multi-resolution Speech Self-Supervised Learning with Masked Unit Prediction.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Evaluating the IWSLT2023 Speech Translation Tasks: Human Annotations, Automatic Metrics, and Segmentation.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
Seamless: Multilingual Expressive and Streaming Speech Translation.
CoRR, 2023

Efficient Monotonic Multihead Attention.
CoRR, 2023

SeamlessM4T-Massively Multilingual & Multimodal Machine Translation.
CoRR, 2023

Findings of the IWSLT 2023 Evaluation Campaign.
Proceedings of the 20th International Conference on Spoken Language Translation, 2023

Hybrid Transducer and Attention based Encoder-Decoder Modeling for Speech-to-Text Tasks.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Findings of the IWSLT 2022 Evaluation Campaign.
Proceedings of the 19th International Conference on Spoken Language Translation, 2022

From Start to Finish: Latency Reduction Strategies for Incremental Speech Synthesis in Simultaneous Speech-to-Speech Translation.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Direct Speech-to-Speech Translation With Discrete Units.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Direct simultaneous speech to speech translation.
CoRR, 2021

Incremental Speech Synthesis For Speech-To-Speech Translation.
CoRR, 2021

Direct speech-to-speech translation with discrete units.
CoRR, 2021

Findings of the IWSLT 2021 Evaluation Campaign.
Proceedings of the 18th International Conference on Spoken Language Translation, 2021

A General Multi-Task Learning Framework to Leverage Text Data for Speech to Text Tasks.
Proceedings of the IEEE International Conference on Acoustics, 2021

Streaming Simultaneous Speech Translation with Augmented Memory Transformer.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
FINDINGS OF THE IWSLT 2020 EVALUATION CAMPAIGN.
Proceedings of the 17th International Conference on Spoken Language Translation, 2020

Self-Training for End-to-End Speech Translation.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Fairseq S2T: Fast Speech-to-Text Modeling with Fairseq.
Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing: System Demonstrations, 2020

SimulMT to SimulST: Adapting Simultaneous Text Translation to End-to-End Simultaneous Speech Translation.
Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, 2020

Monotonic Multihead Attention.
Proceedings of the 8th International Conference on Learning Representations, 2020

SIMULEVAL: An Evaluation Toolkit for Simultaneous Translation.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, 2020

2019
Leveraging Out-of-Task Data for End-to-End Automatic Speech Translation.
CoRR, 2019

Robust Document Representations for Cross-Lingual Information Retrieval in Low-Resource Settings.
Proceedings of Machine Translation Summit XVII Volume 1: Research Track, 2019

Harnessing Indirect Training Data for End-to-End Automatic Speech Translation: Tricks of the Trade.
Proceedings of the 16th International Conference on Spoken Language Translation, 2019

Broad-Coverage Semantic Parsing as Transduction.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

AMR Parsing as Sequence-to-Graph Transduction.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
NIST TAC SM-KBP 2018 System Description: JHU/UR Pipeline.
Proceedings of the 2018 Text Analysis Conference, 2018

Cross-lingual Decompositional Semantic Parsing.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

An Analysis of Source Context Dependency in Neural Machine Translation.
Proceedings of the 21st Annual Conference of the European Association for Machine Translation, 2018

2015
Beyond Quantum Computation Based on Classical Entanglement.
CoRR, 2015