Zengwei Yao

Orcid: 0000-0002-2331-2387

According to our database¹, Zengwei Yao authored at least 17 papers between 2019 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

TransMLA: Multi-Head Latent Attention Is All You Need.

[BibT_eX]

[DOI]

Fanxu Meng

Zengwei Yao

Muhan Zhang

CoRR, February, 2025

2024

CR-CTC: Consistency regularization on CTC for improved speech recognition.

[BibT_eX]

[DOI]

CoRR, 2024

LibriheavyMix: A 20,000-Hour Dataset for Single-Channel Reverberant Multi-Talker Speech Separation, ASR and Speaker Diarization.

[BibT_eX]

[DOI]

CoRR, 2024

Zipformer: A faster and better encoder for automatic speech recognition.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

PromptASR for Contextualized ASR with Controllable Style.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Libriheavy: A 50, 000 Hours ASR Corpus with Punctuation Casing and Context.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

Delay-penalized CTC Implemented Based on Finite State Transducer.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Blank-regularized CTC for Frame Skipping in Neural Transducer.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Delay-Penalized Transducer for Low-Latency Streaming ASR.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Fast and Parallel Decoding for Transducer.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Predicting Multi-Codebook Vector Quantization Indexes for Knowledge Distillation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

2022

Stepwise-Refining Speech Separation Network via Fine-Grained Encoding in High-Order Latent Domain.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2022

Semantic-Aware Local-Global Vision Transformer.

[BibT_eX]

[DOI]

CoRR, 2022

Pruned RNN-T for fast, memory-efficient ASR training.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2020

Speech emotion recognition using fusion of three multi-task learning-based classifiers: HSF-DNN, MS-CNN and LLD-RNN.

[BibT_eX]

[DOI]

Speech Commun., 2020

Fingerprint restoration using cubic Bezier curve.

[BibT_eX]

[DOI]

BMC Bioinform., 2020

2019

Convolutional Two-Stream Network Using Multi-Facial Feature Fusion for Driver Fatigue Detection.

[BibT_eX]

[DOI]

Future Internet, 2019

Zengwei Yao

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...