Minglun Han
Orcid: 0000-0002-5120-069X
According to our database1,
Minglun Han
authored at least 12 papers
between 2021 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition.
CoRR, 2024
ViLaS: Exploring the Effects of Vision and Language Context in Automatic Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2024
2023
X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages.
CoRR, 2023
Knowledge Transfer from Pre-trained Language Models to Cif-based Speech Recognizers via Hierarchical Distillation.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Enhancing Visual Question Answering via Deconstructing Questions and Explicating Answers.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Complex Dynamic Neurons Improved Spiking Transformer Network for Efficient Automatic Speech Recognition.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
2022
Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection.
Proceedings of the IEEE International Conference on Acoustics, 2022
2021
Proceedings of the IEEE International Conference on Acoustics, 2021