Minglun Han

Orcid: 0000-0002-5120-069X

According to our database¹, Minglun Han authored at least 12 papers between 2021 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2024

NEST-RQ: Next Token Prediction for Speech Self-Supervised Pre-Training.

[BibT_eX]

[DOI]

CoRR, 2024

Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition.

[BibT_eX]

[DOI]

CoRR, 2024

ViLaS: Exploring the Effects of Vision and Language Context in Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

VLP: A Survey on Vision-language Pre-training.

[BibT_eX]

[DOI]

Int. J. Autom. Comput., 2023

ViLaS: Integrating Vision and Language into Automatic Speech Recognition.

[BibT_eX]

[DOI]

CoRR, 2023

X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages.

[BibT_eX]

[DOI]

CoRR, 2023

Knowledge Transfer from Pre-trained Language Models to Cif-based Speech Recognizers via Hierarchical Distillation.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Enhancing Visual Question Answering via Deconstructing Questions and Explicating Answers.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Matching-Based Term Semantics Pre-Training for Spoken Patient Query Understanding.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Complex Dynamic Neurons Improved Spiking Transformer Network for Efficient Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

Cif-Based Collaborative Decoding for End-to-End Contextual Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Minglun Han

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...