Minglun Han

Orcid: 0000-0002-5120-069X

According to our database1, Minglun Han authored at least 12 papers between 2021 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
NEST-RQ: Next Token Prediction for Speech Self-Supervised Pre-Training.
CoRR, 2024

Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition.
CoRR, 2024

ViLaS: Exploring the Effects of Vision and Language Context in Automatic Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
VLP: A Survey on Vision-language Pre-training.
Int. J. Autom. Comput., 2023

ViLaS: Integrating Vision and Language into Automatic Speech Recognition.
CoRR, 2023

X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages.
CoRR, 2023

Knowledge Transfer from Pre-trained Language Models to Cif-based Speech Recognizers via Hierarchical Distillation.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Enhancing Visual Question Answering via Deconstructing Questions and Explicating Answers.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Matching-Based Term Semantics Pre-Training for Spoken Patient Query Understanding.
Proceedings of the IEEE International Conference on Acoustics, 2023

Complex Dynamic Neurons Improved Spiking Transformer Network for Efficient Automatic Speech Recognition.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Cif-Based Collaborative Decoding for End-to-End Contextual Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021


  Loading...