Xutai Ma

According to our database1, Xutai Ma authored at least 33 papers between 2015 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
SSR: Alignment-Aware Modality Connector for Speech Language Models.
CoRR, 2024

MMM: Multi-Layer Multi-Residual Multi-Stream Discrete Speech Representation from Self-supervised Learning Model.
CoRR, 2024

Multi-resolution HuBERT: Multi-resolution Speech Self-Supervised Learning with Masked Unit Prediction.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Evaluating the IWSLT2023 Speech Translation Tasks: Human Annotations, Automatic Metrics, and Segmentation.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
Seamless: Multilingual Expressive and Streaming Speech Translation.
CoRR, 2023

Efficient Monotonic Multihead Attention.
CoRR, 2023

SeamlessM4T-Massively Multilingual & Multimodal Machine Translation.
CoRR, 2023


Hybrid Transducer and Attention based Encoder-Decoder Modeling for Speech-to-Text Tasks.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022

From Start to Finish: Latency Reduction Strategies for Incremental Speech Synthesis in Simultaneous Speech-to-Speech Translation.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Direct Speech-to-Speech Translation With Discrete Units.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Direct simultaneous speech to speech translation.
CoRR, 2021

Incremental Speech Synthesis For Speech-To-Speech Translation.
CoRR, 2021

Direct speech-to-speech translation with discrete units.
CoRR, 2021


A General Multi-Task Learning Framework to Leverage Text Data for Speech to Text Tasks.
Proceedings of the IEEE International Conference on Acoustics, 2021

Streaming Simultaneous Speech Translation with Augmented Memory Transformer.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020

Self-Training for End-to-End Speech Translation.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Fairseq S2T: Fast Speech-to-Text Modeling with Fairseq.
Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing: System Demonstrations, 2020

SimulMT to SimulST: Adapting Simultaneous Text Translation to End-to-End Simultaneous Speech Translation.
Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, 2020

Monotonic Multihead Attention.
Proceedings of the 8th International Conference on Learning Representations, 2020

SIMULEVAL: An Evaluation Toolkit for Simultaneous Translation.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, 2020

2019
Leveraging Out-of-Task Data for End-to-End Automatic Speech Translation.
CoRR, 2019

Robust Document Representations for Cross-Lingual Information Retrieval in Low-Resource Settings.
Proceedings of Machine Translation Summit XVII Volume 1: Research Track, 2019

Harnessing Indirect Training Data for End-to-End Automatic Speech Translation: Tricks of the Trade.
Proceedings of the 16th International Conference on Spoken Language Translation, 2019

Broad-Coverage Semantic Parsing as Transduction.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

AMR Parsing as Sequence-to-Graph Transduction.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
NIST TAC SM-KBP 2018 System Description: JHU/UR Pipeline.
Proceedings of the 2018 Text Analysis Conference, 2018

Cross-lingual Decompositional Semantic Parsing.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

An Analysis of Source Context Dependency in Neural Machine Translation.
Proceedings of the 21st Annual Conference of the European Association for Machine Translation, 2018

2015
Beyond Quantum Computation Based on Classical Entanglement.
CoRR, 2015


  Loading...