Xiaodong Cui

Wei Zhang

Kailash Gopalakrishnan

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

A Highly Efficient Distributed Deep Learning System for Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Challenging the Boundaries of Speech Recognition: The MALACH Corpus.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Large-Scale Mixed-Bandwidth Deep Neural Network Acoustic Modeling for Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Acoustic Model Optimization Based on Evolutionary Stochastic Gradient Descent with Anchors for Automatic Speech Recognition.

[BibT_eX]

[DOI]

Michael Picheny

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Distributed Deep Learning Strategies for Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Cyclegan Bandwidth Extension Acoustic Modeling for Automatic Speech Recognition.

[BibT_eX]

[DOI]

David Haws

Proceedings of the IEEE International Conference on Acoustics, 2019

2018

MeTDiff: A Novel Differential RNA Methylation Analysis for MeRIP-Seq Data.

[BibT_eX]

[DOI]

IEEE ACM Trans. Comput. Biol. Bioinform., 2018

Evolutionary Stochastic Gradient Descent for Optimization of Deep Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

2017

Dilated Recurrent Neural Networks.

[BibT_eX]

[DOI]

Mark A. Hasegawa-Johnson

Thomas S. Huang

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

English Conversational Telephone Speech Recognition by Humans and Machines.

[BibT_eX]

[DOI]

Dimitrios Dimitriadis

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Embedding-Based Speaker Adaptive Training of Deep Neural Networks.

[BibT_eX]

[DOI]

George Saon

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Network architectures for multilingual speech representation learning.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016

Maximum Likelihood Nonlinear Transformations Based on Deep Neural Networks.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2016

A novel algorithm for calling mRNA m<sup>6</sup>A peaks by modeling biological variances in MeRIP-seq data.

[BibT_eX]

[DOI]

Bioinform., 2016

Efficient non-linear feature adaptation using Maxout networks.

[BibT_eX]

[DOI]

Steven J. Rennie

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015

Data Augmentation for Deep Neural Network Acoustic Modeling.

[BibT_eX]

[DOI]

Brian Kingsbury

IEEE ACM Trans. Audio Speech Lang. Process., 2015

Annealed dropout trained maxout networks for improved LVCSR.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Data augmentation for deep convolutional neural network acoustic modeling.

[BibT_eX]

[DOI]

Brian Kingsbury

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Modeling of replicates variances for detecting RNA methylation site in MERIP-SEQ data.

[BibT_eX]

[DOI]

Proceedings of the IEEE China Summit and International Conference on Signal and Information Processing, 2015

Sketching the distribution of transcriptomic features on RNA transcripts with Travis coordinates.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Bioinformatics and Biomedicine, 2015

Multilingual representations for low resource speech recognition and keyword search.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014

Exploiting vocal-source features to improve ASR accuracy for low-resource languages.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Recent improvements in neural network acoustic modeling for LVCSR in low resource languages.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Improving deep neural network acoustic modeling for audio corpus indexing under the IARPA babel program.

[BibT_eX]

[DOI]

Mohammad Sadegh Rasooli

Owen Rambow

Nizar Habash

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

A family of discriminative training criteria based on the F-divergence for deep neural networks.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Differential analysis of RNA methylome with improved spatial resolution.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE Global Conference on Signal and Information Processing, 2014

Detecting differentially methylated mRNA from MeRIP-Seq with likelihood ratio test.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE Global Conference on Signal and Information Processing, 2014

2013

The IBM speech-to-speech translation system for smartphone: Improvements for resource-constrained tasks.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2013

Stereo hidden Markov modeling for noise robust speech recognition.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2013

Exome-based analysis for RNA epigenome sequencing data.

[BibT_eX]

[DOI]

Bioinform., 2013

Adaptive stereo-based stochastic mapping.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Mixtures of Bayesian joint factor analyzers for noise robust automatic speech recognition.

[BibT_eX]

[DOI]

Brian Kingsbury

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

System combination and score normalization for spoken term detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

A high-performance Cantonese keyword search system.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

Developing speech recognition systems for corpus indexing under the IARPA Babel program.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

Differential analysis of rna methylation sequencing data.

[BibT_eX]

[DOI]

Proceedings of the IEEE Global Conference on Signal and Information Processing, 2013

An HMM-based Exome Peak-finding package for RNA epigenome sequencing data.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE International Workshop on Genomic Signal Processing and Statistics, 2013

Unveiling the dynamics in RNA epigenetic regulations.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE International Conference on Bioinformatics and Biomedicine, 2013

An empirical study of confusion modeling in keyword search for low resource languages.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2012

Hidden Markov Acoustic Modeling With Bootstrap and Restructuring for Low-Resourced Languages.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2012

Multi-View and Multi-Objective Semi-Supervised Learning for HMM-Based Automatic Speech Recognition.

[BibT_eX]

[DOI]

Jing Huang

Jen-Tzung Chien

IEEE Trans. Speech Audio Process., 2012

Sparse Bayesian Factor Analysis for Stereo-based Stochastic Mapping.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Stereo-based stochastic mapping with context using probabilistic PCA for noise robust automatic speech recognition.

[BibT_eX]

[DOI]

Bowen Zhou

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011

Towards High Performance LVCSR in Speech-to-Speech Translation System on Smart Phones.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Acoustic Modeling with Bootstrap and Restructuring Based on Full Covariance.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Multi-view and multi-objective semi-supervised learning for large vocabulary continuous speech recognition.

[BibT_eX]

[DOI]

Jing Huang

Jen-Tzung Chien

Proceedings of the IEEE International Conference on Acoustics, 2011

Clustering of bootstrapped acoustic model with full covariance.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

An investigation of heuristic, manual and statistical pronunciation derivation for Pashto.

[BibT_eX]

[DOI]

Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

2010

Applying scalable phonetic context similarity in unit selection of concatenative text-to-speech.

[BibT_eX]

[DOI]

Wei Zhang

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Acoustic modeling with bootstrap and restructuring for low-resourced languages.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

A comparative study on system combination schemes for LVCSR.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

2009

Stereo-Based Stochastic Mapping for Robust Speech Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2009

A study of bootstrapping with multiple acoustic features for improved automatic speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Stereo-based stochastic mapping with discriminative training for noise robust speech recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

Improving online incremental speaker adaptation with eigen feature space MLLR.

[BibT_eX]

[DOI]

Jian Xue

Bowen Zhou

Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

2008

High-performance low-latency speech recognition via multi-layered feature streaming and fast Gaussian computation.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

N-best based stochastic mapping on stereo HMM for noise robust speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Developing high performance asr in the IBM multilingual speech-to-speech translation system.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

MMSE-based stereo feature stochastic mapping for noise robust speech recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

2007

Speaker Adaptation With Limited Data Using Regression-Tree-Based Spectral Peak Alignment.

[BibT_eX]

[DOI]

Shizhen Wang

IEEE Trans. Speech Audio Process., 2007

A Study of Variable-Parameter Gaussian Mixture Hidden Markov Modeling for Noisy Speech Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2007

Robust Speaker Adaptation by Weighted Model Averaging Based on the Minimum Description Length Criterion.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2007

2006

Adaptation of children's speech with limited data based on formant-like peak alignment.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2006

Rapid speaker adaptation using regression-tree based spectral peak alignment.

[BibT_eX]

[DOI]

Shizhen Wang

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

A Database of Vocal Tract Resonance Trajectories for Research in Speech Processing.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Modeling Variance Variation in a Variable Parameter HMM Framework for Noise Robust Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005

Noise robust speech recognition using feature compensation based on polynomial regression of utterance SNR.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2005

TBALL data collection: the making of a young children's speech corpus.

[BibT_eX]

[DOI]

Shrikanth S. Narayanan

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

MLLR-like speaker adaptation based on linearization of VTLN with MFCC features.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

2004

Combining feature compensation and weighted Viterbi decoding for noise robust speech recognition with limited adaptation data.

[BibT_eX]

[DOI]

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Can back-ends be more robust than front-ends? Investigation over the Aurora-2 database.

[BibT_eX]

[DOI]

Alexis Bernard

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003

A noise-robust ASR back-end technique based on weighted viterbi recognition.

[BibT_eX]

[DOI]

Alexis Bernard

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Variable parameter Gaussian mixture hidden Markov modeling for speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002

Evaluation of noise robust features on the Aurora databases.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Efficient adaptation text design based on the Kullback-Leibler measure.

[BibT_eX]

[DOI]