Kshitiz Kumar

Jian Wu

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

Sequence-Level Confidence Classifier for ASR Utterance Accuracy and Application to Acoustic Models.

[BibT_eX]

[DOI]

Amber Afshan

Jian Wu

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Multi-Dialect Speech Recognition in English Using Attention on Ensemble of Experts.

[BibT_eX]

[DOI]

Amit Das

Jian Wu

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

Fast and Slow Acoustic Model.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Bandpass Noise Generation and Augmentation for Unified ASR.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

1-D Row-Convolution LSTM: Fast Streaming ASR at Accuracy Parity with LC-BLSTM.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Transfer Learning Approaches for Streaming End-to-End Speech Recognition System.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

2019

Static and Dynamic State Predictions for Acoustic Model Combination.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Word Characters and Phone Pronunciation Embedding for ASR Confidence Classifier.

[BibT_eX]

[DOI]

Tasos Anastasakos

Proceedings of the IEEE International Conference on Acoustics, 2019

2018

Speaker Adaptation for End-to-End CTC Models.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

A Comparative Study of Spatial Speech Separation Techniques to Improve Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Networks - ISNN 2018, 2018

2017

Extended low-rank plus diagonal adaptation for deep and recurrent neural networks.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Challenges in and Solutions to Deep Learning Network Acoustic Modeling in Speech Recognition Products at Microsoft.

[BibT_eX]

[DOI]

Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

2016

Investigations on speaker adaptation of LSTM RNN models for speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Non-negative intermediate-layer DNN adaptation for a 10-KB speaker adaptation profile.

[BibT_eX]

[DOI]

Chaojun Liu

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015

Intermediate-layer DNN adaptation for offline and session-based iterative speaker adaptation.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Delta-melspectra features for noise robustness to DNN-based ASR systems.

[BibT_eX]

[DOI]

Chaojun Liu

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Confidence-features and confidence-scores for ASR applications in arbitration and DNN speaker adaptation.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

2014

Normalization of ASR confidence classifier scores via confidence mapping.

[BibT_eX]

[DOI]

Chaojun Liu

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

2013

Predicting speech recognition confidence using deep learning with word identity and score features.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

2011

A Spectro-Temporal Framework for Compensation of Reverberation for Speech Recognition.

[BibT_eX]

[DOI]

PhD thesis, 2011

Gammatone sub-band magnitude-domain dereverberation for ASR.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

An iterative least-squares technique for dereverberation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Delta-spectral cepstral coefficients for robust speech recognition.

[BibT_eX]

[DOI]

Chanwoo Kim

Proceedings of the IEEE International Conference on Acoustics, 2011

Binaural sound source separation motivated by auditory processing.

[BibT_eX]

[DOI]

Chanwoo Kim

Proceedings of the IEEE International Conference on Acoustics, 2011

2010

Maximum-likelihood-based cepstral inverse filtering for blind speech dereverberation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

2009

Robust audio-visual speech synchrony detection by generalized bimodal linear prediction.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Signal separation for robust speech recognition based on phase difference information obtained in the frequency domain.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Audio-visual speech synchronization detection using a bimodal linear prediction model.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2009

Robust speech recognition using a Small Power Boosting algorithm.

[BibT_eX]

[DOI]

Chanwoo Kim

Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

2008

Environment-invariant compensation for reverberation using linear post-filtering for minimum distortion.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

Noise robust speaker identification using Bhattacharyya distance in adapted Gaussian models space.

[BibT_eX]

[DOI]

Proceedings of the 2008 16th European Signal Processing Conference, 2008

2007

Profile View Lip Reading.

[BibT_eX]

[DOI]

Tsuhan Chen