Zengwei Yao

Orcid: 0000-0002-2331-2387

According to our database1, Zengwei Yao authored at least 16 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
CR-CTC: Consistency regularization on CTC for improved speech recognition.
CoRR, 2024

LibriheavyMix: A 20,000-Hour Dataset for Single-Channel Reverberant Multi-Talker Speech Separation, ASR and Speaker Diarization.
CoRR, 2024

Zipformer: A faster and better encoder for automatic speech recognition.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

PromptASR for Contextualized ASR with Controllable Style.
Proceedings of the IEEE International Conference on Acoustics, 2024

Libriheavy: A 50, 000 Hours ASR Corpus with Punctuation Casing and Context.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Delay-penalized CTC Implemented Based on Finite State Transducer.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Blank-regularized CTC for Frame Skipping in Neural Transducer.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Delay-Penalized Transducer for Low-Latency Streaming ASR.
Proceedings of the IEEE International Conference on Acoustics, 2023

Fast and Parallel Decoding for Transducer.
Proceedings of the IEEE International Conference on Acoustics, 2023

Predicting Multi-Codebook Vector Quantization Indexes for Knowledge Distillation.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Stepwise-Refining Speech Separation Network via Fine-Grained Encoding in High-Order Latent Domain.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Semantic-Aware Local-Global Vision Transformer.
CoRR, 2022

Pruned RNN-T for fast, memory-efficient ASR training.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2020
Speech emotion recognition using fusion of three multi-task learning-based classifiers: HSF-DNN, MS-CNN and LLD-RNN.
Speech Commun., 2020

Fingerprint restoration using cubic Bezier curve.
BMC Bioinform., 2020

2019
Convolutional Two-Stream Network Using Multi-Facial Feature Fusion for Driver Fatigue Detection.
Future Internet, 2019


  Loading...