Yifan Gong

Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Diarisation Using Location Tracking with Agglomerative Clustering.

[BibT_eX]

[DOI]

Igor Abramovski

Xiong Xiao

Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Streaming, Fast and Accurate on-Device Inverse Text Normalization for Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Internal Language Model Adaptation with Text-Only Data for End-to-End Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Have Best of Both Worlds: Two-Pass Hybrid and E2E Cascading Framework for Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Endpoint Detection for Streaming End-to-End Multi-Talker ASR.

[BibT_eX]

[DOI]

Liang Lu

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

Speaker Separation Using Speaker Inventories and Estimated Speech.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2021

Streaming End-to-End Multi-Talker Speech Recognition.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2021

Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition.

[BibT_eX]

[DOI]

Sarangarajan Parthasarathy

Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Improving Multilingual Transformer Transducer Models by Reducing Language Confusions.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Multiple Softmax Architecture for Streaming Multilingual End-to-End ASR Systems.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Improving RNN-T for Domain Scaling Using Semi-Supervised Training with Neural TTS.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Rapid Speaker Adaptation for Conformer Transducer: Attention and Bias Are All You Need.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

On Minimum Word Error Rate Training of the Hybrid Autoregressive Transducer.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Streaming Multi-Talker Speech Recognition with Joint Speaker Identification.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Microsoft Speaker Diarization System for the Voxceleb Speaker Recognition Challenge 2020.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Hidden Markov Model Diarisation with Speaker Location Information.

[BibT_eX]

[DOI]

Xiong Xiao

Proceedings of the IEEE International Conference on Acoustics, 2021

Ensemble Combination between Different Time Segmentations.

[BibT_eX]

[DOI]

Dimitrios Dimitriadis

Proceedings of the IEEE International Conference on Acoustics, 2021

Sequence-Level Self-Teaching Regularization.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Internal Language Model Training for Domain-Adaptive End-To-End Speech Recognition.

[BibT_eX]

[DOI]

Sarangarajan Parthasarathy

Naoyuki Kanda

Yashesh Gaur

Proceedings of the IEEE International Conference on Acoustics, 2021

On Addressing Practical Challenges for RNN-Transducer.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020

Combination of End-to-End and Hybrid Models for Speech Recognition.

[BibT_eX]

[DOI]

Sarangarajan Parthasarathy

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Exploring Transformers for Large-Scale Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Bandpass Noise Generation and Augmentation for Unified ASR.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

1-D Row-Convolution LSTM: Fast Streaming ASR at Accuracy Parity with LC-BLSTM.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Rapid RNN-T Adaptation Using Personalized Speech Synthesis and Neural Language Generator.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Adaptation of RNN Transducer with Text-To-Speech Technology for Keyword Spotting.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

L-Vector: Neural Label Embedding for Domain Adaptation.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

High-Accuracy and Low-Latency Speech Recognition with Two-Head Contextual Layer Trajectory LSTM Model.

[BibT_eX]

[DOI]

Rui Zhao

Eric Sun

Amit Das

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Minimum Latency Training Strategies for Streaming Sequence-to-Sequence ASR.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Exploring Pre-Training with Alignments for RNN Transducer Based End-to-End Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Using Personalized Speech Synthesis and Neural Language Generator for Rapid Speaker Adaptation.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Acoustic Model Adaptation for Presentation Transcription and Intelligent Meeting Assistant Systems.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019

Advancing Acoustic-to-Word CTC Model With Attention and Mixed-Units.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2019

PyKaldi2: Yet another speech toolkit based on Kaldi and PyTorch.

[BibT_eX]

[DOI]

CoRR, 2019

Layer Trajectory BLSTM.

[BibT_eX]

[DOI]

Eric Sun

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Speaker Adaptation for Attention-Based End-to-End Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Self-Teaching Networks.

[BibT_eX]

[DOI]

Liang Lu

Eric Sun

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Acoustic-to-Phrase Models for Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Encrypted Speech Recognition Using Deep Polynomial Networks.

[BibT_eX]

[DOI]

Shi-Xiong Zhang

Dong Yu

Proceedings of the IEEE International Conference on Acoustics, 2019

Single-channel Speech Extraction Using Speaker Inventory and Attention Network.

[BibT_eX]

[DOI]

Dimitrios Dimitriadis

Jasha Droppo

Proceedings of the IEEE International Conference on Acoustics, 2019

Adversarial Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Conditional Teacher-student Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Attentive Adversarial Learning for Domain-invariant Training.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Adversarial Speaker Adaptation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Towards Code-switching ASR for End-to-end CTC Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Improving Layer Trajectory LSTM with Future Context Frames.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Static and Dynamic State Predictions for Acoustic Model Combination.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Word Characters and Phone Pronunciation Embedding for ASR Confidence Classifier.

[BibT_eX]

[DOI]

Tasos Anastasakos

Proceedings of the IEEE International Conference on Acoustics, 2019

Universal Acoustic Modeling Using Neural Mixture Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

CNN with Phonetic Attention for Text-Independent Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Advances in Online Audio-Visual Meeting Transcription.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Domain Adaptation via Teacher-Student Learning for End-to-End Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Character-Aware Attention-Based End-to-End Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Improving RNN Transducer Modeling for End-to-End Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018

Speaker-Invariant Training via Adversarial Learning.

[BibT_eX]

[DOI]

CoRR, 2018

Speaker Adaptation for End-to-End CTC Models.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Exploring Layer Trajectory LSTM with Depth Processing Units and Attention.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Multi-Channel Overlapped Speech Recognition with Location Guided Speech Extraction Network.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Adversarial Feature-Mapping for Speech Enhancement.

[BibT_eX]

[DOI]

Biing-Hwang Fred Juang

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Cycle-Consistent Speech Enhancement.

[BibT_eX]

[DOI]

Biing-Hwang Fred Juang

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Layer Trajectory LSTM.

[BibT_eX]

[DOI]

Changliang Liu

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Domain and Speaker Adaptation for Cortana Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Adversarial Teacher-Student Learning for Unsupervised Domain Adaptation.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Speaker-Invariant Training Via Adversarial Learning.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Developing Far-Field Speaker System Via Teacher-Student Learning.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Advancing Acoustic-to-Word CTC Model.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Advancing Connectionist Temporal Classification with Attention Modeling.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Efficient Integration of Fixed Beamformers and Speech Separation Networks for Multi-Channel Far-Field Speech Separation.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017

Large-Scale Domain Adaptation via Teacher-Student Learning.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Don't Count on ASR to Transcribe for You: Breaking Bias with Two Crowds.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Improving Mask Learning Based Speech Enhancement System with Restoration Layers and Residual Connection.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Extended low-rank plus diagonal adaptation for deep and recurrent neural networks.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Improved cepstra minimum-mean-square-error noise reduction algorithm for robust speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Unsupervised adaptation with domain separation networks for robust speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Acoustic-to-word model without OOV.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Cracking the cocktail party problem by multi-beam deep attractor network.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Challenges in and Solutions to Deep Learning Network Acoustic Modeling in Speech Recognition Products at Microsoft.

[BibT_eX]

[DOI]

Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

2016

End-to-End attention based text-dependent speaker verification.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Semi-Supervised Training in Deep Learning Acoustic Model.

[BibT_eX]

[DOI]

Yongqiang Wang

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Low-rank plus diagonal adaptation for deep neural networks.

[BibT_eX]

[DOI]

Yong Zhao

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Recurrent support vector machines for speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Geo-location dependent deep neural network acoustic model for speech recognition.

[BibT_eX]

[DOI]

Guoli Ye

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Simplifying long short-term memory acoustic models for fast training and decoding.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Investigations on speaker adaptation of LSTM RNN models for speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Exploring multidimensional lstms for large vocabulary ASR.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Non-negative intermediate-layer DNN adaptation for a 10-KB speaker adaptation profile.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015

SVD-based universal DNN modeling for multiple scenarios.

[BibT_eX]

[DOI]

Changliang Liu

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Intermediate-layer DNN adaptation for offline and session-based iterative speaker adaptation.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Delta-melspectra features for noise robustness to DNN-based ASR systems.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Confidence-features and confidence-scores for ASR applications in arbitration and DNN speaker adaptation.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Regularized sequence-level deep neural network model adaptation.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Investigating online low-footprint speaker adaptation using generalized linear regression and click-through data.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Deep neural support vector machines for speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Small-footprint high-performance deep neural network-based speech recognition using split-VQ.

[BibT_eX]

[DOI]

Yongqiang Wang

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Estimating confidence scores on ASR results using recurrent neural networks.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

An analysis of convolutional neural networks for speech recognition.

[BibT_eX]

[DOI]

Jui-Ting Huang

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

LSTM time and frequency recurrence for automatic speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014

An Overview of Noise-Robust Automatic Speech Recognition.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2014

A fast maximum likelihood nonlinear feature transformation method for GMM-HMM speaker adaptation.

[BibT_eX]

[DOI]

Neurocomputing, 2014

Variable-activation and variable-input deep neural network for robust speech recognition.

[BibT_eX]

[DOI]

Rui Zhao

Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Variable-component deep neural network for robust speech recognition.

[BibT_eX]

[DOI]

Rui Zhao

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Learning small-size DNN with output-distribution-based criteria.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Normalization of ASR confidence classifier scores via confidence mapping.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Multi-accent deep neural network acoustic model with accent-specific top layer using the KLD-regularized model adaptation.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

A comparative analytic study on the Gaussian mixture and context dependent deep neural network hidden Markov models.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Towards better performance with heterogeneous training data in acoustic modeling using deep neural networks.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Singular value decomposition based low-footprint speaker adaptation and personalization for deep neural network.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Factorized adaptation for deep neural network.

[BibT_eX]

[DOI]

Jui-Ting Huang

Proceedings of the IEEE International Conference on Acoustics, 2014

2013

Restructuring of deep neural network acoustic models with singular value decomposition.

[BibT_eX]

[DOI]

Jian Xue

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Semi-supervised GMM and DNN acoustic model training with multi-system combination and confidence re-calibration.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Cross-language knowledge transfer using multilingual deep neural network with shared hidden layers.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

Predicting speech recognition confidence using deep learning with word identity and score features.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

Recent advances in deep learning for speech research at Microsoft.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

2012

Adaptation of context-dependent deep neural networks for automatic speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

Improving wideband speech recognition using mixed-bandwidth training data in CD-DNN-HMM.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

A Feature Space Transformation Method for Personalization using Generalized I-Vector Clustering.

[BibT_eX]

[DOI]

Kaisheng Yao

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Efficient VTS Adaptation Using Jacobian Approximation.

[BibT_eX]

[DOI]

Michael L. Seltzer

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Improvements to VTS feature enhancement.

[BibT_eX]

[DOI]

Michael L. Seltzer

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2010

Unscented transform with online distortion estimation for HMM adaptation.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

2009

A Novel Framework and Training Algorithm for Variable-Parameter Hidden Markov Models.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2009

A unified framework of HMM adaptation with joint compensation of additive and convolutive distortions.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2009

Cross-lingual speech recognition under runtime resource constraints.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

A study on multilingual acoustic modeling for large vocabulary ASR.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

2008

Robust Speech Recognition Using a Cepstral Minimum-Mean-Square-Error-Motivated Noise Suppressor.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2008

Improvements on Mel-Frequency Cepstrum Minimum-Mean-Square-Error Noise Suppressor for Robust Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

Parameter clustering and sharing in variable-parameter HMMs for noise robust speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Discriminative training of variable-parameter HMMs for noise robust speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

A minimum-mean-square-error noise reduction algorithm on Mel-frequency cepstra for robust speech recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

Adaptation of compressed HMM parameters for resource-constrained speech recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

HMM adaptation using a phase-sensitive acoustic distortion model for environment-robust speech recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

2007

A Study of Variable-Parameter Gaussian Mixture Hidden Markov Modeling for Noisy Speech Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2007

High-performance hmm adaptation with joint compensation of additive and convolutive distortions via Vector Taylor Series.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2006

Modeling Variance Variation in a Variable Parameter HMM Framework for Noise Robust Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005

A Method of Joint Compensation of Additive and Convolutive Distortions for Speaker-Independent Speech Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2005

2004

Can back-ends be more robust than front-ends? Investigation over the Aurora-2 database.

[BibT_eX]

[DOI]

Alexis Bernard

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003

Model-space compensation of microphone and noise for speaker-independent speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Variable parameter Gaussian mixture hidden Markov modeling for speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002

Noise-dependent Gaussian mixture classifiers for robust rejection decision.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2002

The effects of speech compression on speech recognition and text-to-speech synthesis.

[BibT_eX]

[DOI]

Yeshwant K. Muthusamy

Roshan Gupta

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Experiments on speaker-independent voice command recognition using in-vehicle hands free speech.

[BibT_eX]

[DOI]

Lorin Netsch

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

A comparative study of approximations for parallel model combination of static and dynamic parameters.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Noise-robust open-set speaker recognition using noise-dependent Gaussian mixture classifier.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2002

2000

HMM adaptation and microphone array processing for distant speech recognition.

[BibT_eX]

[DOI]

Jim Kleban

Proceedings of the IEEE International Conference on Acoustics, 2000

Implementing a high accuracy speaker-independent continuous speech recognizer on a fixed-point DSP.

[BibT_eX]

[DOI]

Yu-Hung Kao

Proceedings of the IEEE International Conference on Acoustics, 2000

1999

A minimum cross-entropy approach to hidden Markov model adaptation.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 1999

Speaker-dependent name dialing in a car environment with out-of-vocabulary rejection.

[BibT_eX]

[DOI]

Coimbatore S. Ramalingam

Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

Speech-enabled information retrieval in the automobile environment.

[BibT_eX]

[DOI]

Yeshwant K. Muthusamy

Rajeev Agarwal

Vishu Viswanathan

Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

Transforming HMMs for speaker-independent hands-free speech recognition in the car.

[BibT_eX]

[DOI]

John J. Godfrey

Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

1998

A general joint additive and convolutive bias compensation approach applied to noisy Lombard speech recognition.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 1998

Assessing the importance of the segmentation probability in segment-based speech recognition.

[BibT_eX]

[DOI]

Speech Commun., 1998

Environment normalization training and environment adaptation using mixture stochastic trajectory model.

[BibT_eX]

[DOI]

Speech Commun., 1998

1997

Stochastic trajectory modeling and sentence searching for continuous speech recognition.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 1997

Speaker normalization training for mixture stochastic trajectory model.

[BibT_eX]

[DOI]

Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Source normalization training for HMM applied to noisy telephone speech recognition.

[BibT_eX]

[DOI]

Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

An acoustic subword unit approach to non-linguistic speech feature identification.

[BibT_eX]

[DOI]

Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Correlation based predictive adaptation of hidden Markov models.

[BibT_eX]

[DOI]

Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

The importance of segmentation probability in segment based speech recognizers.

[BibT_eX]

[DOI]

Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

Elimination of trajectory folding phenomenon: HMM, trajectory mixture HMM and mixture stochastic trajectory model.

[BibT_eX]

[DOI]

Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

A unified maximum likelihood approach to acoustic mismatch compensation: application to noisy Lombard speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

1996

Comparative experiments of several adaptation approaches to noisy speech recognition using stochastic trajectory models.

[BibT_eX]

[DOI]

Speech Commun., 1996

Estimation of mixtures of stochastic dynamic trajectories: application to continuous speech recognition.

[BibT_eX]

[DOI]

Comput. Speech Lang., 1996

A study on continuous Chinese speech recognition based on stochastic trajectory models.

[BibT_eX]

[DOI]

Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Improvement in n-best search for continuous speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Stochastic trajectory model with state-mixture for continuous speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Modelling long term variability information in mixture stochastic trajectory framework.

[BibT_eX]

[DOI]

Proceedings of the 4th International Conference on Spoken Language Processing, 1996

A semi-continuous stochastic trajectory model for phoneme-based continuous speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

Probabilistic mapping networks for speaker recognition.

[BibT_eX]

[DOI]

Haizhou Li

Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

1995

Speech recognition in noisy environments: A survey.

[BibT_eX]

[DOI]

Speech Commun., 1995

Noise adaptation using linear regression for continuous noisy speech recognition.

[BibT_eX]

[DOI]

Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Speaker recognition with temporal transition models.

[BibT_eX]

[DOI]

Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

On MMI learning of Gaussian mixture for speaker models.

[BibT_eX]

[DOI]

Haizhou Li

Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Evaluation of Bayes decision approach to automatic determination of thresholds for speaker verification.

[BibT_eX]

[DOI]

Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Stochastic trajectory models for speech recognition: an extension to modelling time correlation.

[BibT_eX]

[DOI]

Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Stochastic trajectory modeling for recognition of unconstrained handwritten words.

[BibT_eX]

[DOI]

George Saon

Abdel Belaïd

Proceedings of the Third International Conference on Document Analysis and Recognition, 1995

1994

Off-line Handwriting Recognition by Statistical Correlation.

[BibT_eX]

[DOI]

George Saon

Abdel Belaïd

Proceedings of IAPR Workshop on Machine Vision Applications, 1994

A comparison of three noisy speech recognition approaches.

[BibT_eX]

[DOI]

Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Nonlinear time alignment in stochastic trajectory models for speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Noise independent speech recognition for a variety of noise types.

[BibT_eX]

[DOI]

William C. Treurniet

Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

Stochastic trajectory modeling for speech recognition.

[BibT_eX]

[DOI]

Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

1993

Plausibility functions in continuous speech recognition: The VINICS system.

[BibT_eX]

[DOI]

Speech Commun., 1993

A Bayesian approach to phone duration adaptation for lombard speech recognition.

[BibT_eX]

[DOI]

Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Use of explicit context-dependent phonemic model in continuous speech recognition.

[BibT_eX]

[DOI]

Feriel Mouria

Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Duration of phones as function of utterance length and its use in automatic speech recognition.

[BibT_eX]

[DOI]

William C. Treurniet

Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Iterative transformation and alignment for speech labeling.

[BibT_eX]

[DOI]

Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Base transformation for environment adaptation in continuous speech recognition.

[BibT_eX]

[DOI]

Proceedings of the Third European Conference on Speech Communication and Technology, 1993

1992

Minimization of speech alignment error by iterative transformation for speaker adaptation.

[BibT_eX]

[DOI]

Proceedings of the Second International Conference on Spoken Language Processing, 1992

DTW-based phonetic labeling using explicit phoneme duration constraints.

[BibT_eX]

[DOI]

Proceedings of the Second International Conference on Spoken Language Processing, 1992

Hand-written text recognition based on a new formulation.

[BibT_eX]

[DOI]

Anne Boyer

Proceedings of the 11th IAPR International Conference on Pattern Recognition, 1992

Nonlinear vectorial interpolation for speaker recognition.

[BibT_eX]

[DOI]

Proceedings of the 1992 IEEE International Conference on Acoustics, 1992

1991

Signal-to-String Conversion Based on High Likelihood Regions Using Embedded Dynamic Programming.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 1991

VINICS: a continuous speech recognizer based on a new robust formulation.

[BibT_eX]

[DOI]

Proceedings of the Second European Conference on Speech Communication and Technology, 1991

Comparing two phoneme identification methods using a continuous speech recognizer.

[BibT_eX]

[DOI]

Proceedings of the Second European Conference on Speech Communication and Technology, 1991

Continuous speech recognition based on high plausibility regions.

[BibT_eX]

[DOI]

Feriel Mouria

Proceedings of the 1991 International Conference on Acoustics, 1991

Non-linear vector interpolation by neural network for phoneme identification in continuous speech.

[BibT_eX]

[DOI]

Proceedings of the 1991 International Conference on Acoustics, 1991

Neural network coupled with IIR sequential adapter for phoneme recognition in continuous speech.

[BibT_eX]

[DOI]

Ying Cheng

Proceedings of the 1991 International Conference on Acoustics, 1991

1990

A multiknowledge base system for continuous speech understanding.

[BibT_eX]

[DOI]

Proceedings of the 10th IAPR International Conference on Pattern Recognition, 1990

Towards a general signal interpretation system-signal-to-symbol conversion level.

[BibT_eX]

[DOI]

Proceedings of the 10th IAPR International Conference on Pattern Recognition, 1990

Text-independent speaker recognition by trajectory space comparison.

[BibT_eX]

[DOI]

Proceedings of the 1990 International Conference on Acoustics, 1990

1989

Parallel construction of syntactic structure for continuous speech recognition.

[BibT_eX]

[DOI]

Anne Boyer

Proceedings of the First European Conference on Speech Communication and Technology, 1989

1988

A specialist society for continuous speech understanding.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 1988

1987

Time domain harmonic matching pitch estimation using time-dependent speech modeling.

[BibT_eX]

[DOI]

IEEE Trans. Acoust. Speech Signal Process., 1987

Phoneme-based continuous speech recognition without pre-segmentation.

[BibT_eX]

[DOI]