Ming Tu
This page is a disambiguation page, it actually contains mutiple papers from persons of the same or a similar name.
Bibliography
2024
Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition.
CoRR, 2024
VoiceShop: A Unified Speech-to-Speech Framework for Identity-Preserving Zero-Shot Voice Editing.
CoRR, 2024
Proceedings of the IEEE International Conference on Communications, 2024
The Hybrid Diagnosability of Hypercube Under the rmHMM<sup>*</sup> (Hybrid rmMM<sup>*</sup>) Model.
Proceedings of the Computing and Combinatorics - 30th International Conference, 2024
2023
Language-Universal Phonetic Representation in Multilingual Speech Pretraining for Low-Resource Speech Recognition.
CoRR, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Language-Universal Phonetic Representation in Multilingual Speech Pretraining for Low-Resource Speech Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Memory Augmented Lookup Dictionary Based Language Modeling for Automatic Speech Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Streaming Voice Conversion via Intermediate Bottleneck Features and Non-Streaming Teacher Guidance.
Proceedings of the IEEE International Conference on Acoustics, 2023
2022
Proceedings of the IEEE International Conference on Acoustics, 2022
2020
IEEE Access, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Big Data (IEEE BigData 2020), 2020
Select, Answer and Explain: Interpretable Multi-Hop Reading Comprehension over Multiple Documents.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
EURASIP J. Audio Speech Music. Process., 2019
CoRR, 2019
Towards adversarial learning of speaker-invariant representation for speech emotion recognition.
CoRR, 2019
Multi-hop Reading Comprehension across Multiple Documents by Reasoning over Heterogeneous Graphs.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019
2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Simulating Dysarthric Speech for Training Data Augmentation in Clinical Speech Applications.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
2017
Neurocomputing, 2017
Interpretable Objective Assessment of Dysarthric Speech Based on Deep Neural Networks.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
2016
Proceedings of the IEEE Computer Society Annual Symposium on VLSI, 2016
Accent Identification by Combining Deep Neural Networks and Recurrent Neural Networks Trained on Long and Short Term Features.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Models for objective evaluation of dysarthric speech from data annotated by multiple listeners.
Proceedings of the 50th Asilomar Conference on Signals, Systems and Computers, 2016
2015
IEEE ACM Trans. Audio Speech Lang. Process., 2015
Proceedings of the 49th Asilomar Conference on Signals, Systems and Computers, 2015
2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 22nd International Conference on Pattern Recognition, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
2012
OpenCDS ePHR: an Open-Source, Standards-Based Decision Support Platform for Electronic Public Health Reporting.
Proceedings of the AMIA 2012, 2012