Wei Zhou

Orcid: 0009-0006-3754-8872

Affiliations:
  • RWTH Aachen University, Computer Science Department, Human Language Technology and Pattern Recognition,, Germany
  • AppTek GmbH, Aachen, Germany


According to our database1, Wei Zhou authored at least 20 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Dynamic Encoder Size Based on Data-Driven Layer-wise Pruning for Speech Recognition.
CoRR, 2024

On the Relation Between Internal Language Model and Sequence Discriminative Training for Neural Transducers.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
RASR2: The RWTH ASR Toolkit for Generic Sequence-to-sequence Speech Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Enhancing and Adversarial: Improve ASR with Speaker Labels.
Proceedings of the IEEE International Conference on Acoustics, 2023

Lattice-Free Sequence Discriminative Training for Phoneme-Based Neural Transducers.
Proceedings of the IEEE International Conference on Acoustics, 2023

Investigating The Effect of Language Models in Sequence Discriminative Training For Neural Transducers.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
Monotonic Segmental Attention for Automatic Speech Recognition.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

HMM vs. CTC for Automatic Speech Recognition: Comparison Based on Full-Sum Training from Scratch.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Efficient Training of Neural Transducer for Speech Recognition.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

On Language Model Integration for RNN Transducer Based Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Prediction of Listener Perception of Argumentative Speech in a Crowdsourced Dataset Using (Psycho-)Linguistic and Fluency Features.
CoRR, 2021

Acoustic Data-Driven Subword Modeling for End-to-End Speech Recognition.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Equivalence of Segmental and Neural Transducer Modeling: A Proof of Concept.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

The Impact of ASR on the Automatic Analysis of Linguistic Complexity and Sophistication in Spontaneous L2 Speech.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Phoneme Based Neural Transducer for Large Vocabulary Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Investigations on Phoneme-Based End-To-End Speech Recognition.
CoRR, 2020

Robust Beam Search for Encoder-Decoder Attention Based Speech Recognition Without Length Bias.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Full-Sum Decoding for Hybrid Hmm Based Speech Recognition Using LSTM Language Model.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

The Rwth Asr System for Ted-Lium Release 2: Improving Hybrid Hmm With Specaugment.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
LSTM Language Models for LVCSR in First-Pass Decoding and Lattice-Rescoring.
CoRR, 2019


  Loading...