Erik McDermott

According to our database1, Erik McDermott authored at least 63 papers between 1989 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Focused Discriminative Training For Streaming CTC-Trained Automatic Speech Recognition Models.
CoRR, 2024

Optimizing Byte-level Representation for End-to-end ASR.
CoRR, 2024

Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition.
CoRR, 2024

2023
Variable Attention Masking for Configurable Transformer Transducer Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

Neural Transducer Training: Reduced Memory Consumption with Sample-Wise Computation.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Optimizing Bilingual Neural Transducer with Synthetic Code-switching Text Generation.
CoRR, 2022

2020
Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
A Density Ratio Approach to Language Model Fusion in End-to-End Automatic Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018
Sampled Connectionist Temporal Classification.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
End-to-End Training of Acoustic Models for Large Vocabulary Continuous Speech Recognition with TensorFlow.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Acoustic Modeling for Google Home.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

2015
A Gaussian Mixture Model layer jointly optimized with discriminative features within a Deep Neural Network architecture.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014
Sequence discriminative distributed training of long short-term memory recurrent neural networks.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Asynchronous stochastic optimization for sequence training of deep neural networks: towards big data.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Deep neural networks for small footprint text-dependent speaker verification.
Proceedings of the IEEE International Conference on Acoustics, 2014

Asynchronous stochastic optimization for sequence training of deep neural networks.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Large scale deep neural network acoustic modeling with semi-supervised training data for YouTube video transcription.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2012
An integrated framework for "margin" based sequential discriminative training over lattices based on differenced maximum mutual information (dMMI).
Proceedings of the 2012 Symposium on Machine Learning in Speech and Language Processing, 2012

2010
A Sequential Pattern Classifier Based on Hidden Markov Kernel Machine and Its Application to Phoneme Classification.
IEEE J. Sel. Top. Signal Process., 2010

Minimum Error Classification with geometric margin control.
Proceedings of the IEEE International Conference on Acoustics, 2010

A discriminative model for continuous speech recognition based on Weighted Finite State Transducers.
Proceedings of the IEEE International Conference on Acoustics, 2010

Discriminative training based on an integrated view of MPE and MMI in margin and error space.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Margin-space integration of MPE loss via differencing of MMI functionals for generalized error-weighted discriminative training.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

A unified view for discriminative objective functions based on negative exponential of difference measure between strings.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
Flexible discriminative training based on equal error group scores obtained from an error-indexed forward-backward algorithm.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

2007
Discriminative Training for Large-Vocabulary Speech Recognition Using Minimum Classification Error.
IEEE Trans. Speech Audio Process., 2007

Inverting mappings from smooth paths through R<sup>n</sup> to paths through R<sup>m</sup>: A technique applied to recovering articulation from acoustics.
Speech Commun., 2007

String and lattice based discriminative training for the corpus of spontaneous Japanese lecture transcription task.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Discriminative MCE-based speaker adaptation of acoustic models for a spoken lecture processing task.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

2006
Production-Oriented Models for Speech Recognition.
IEICE Trans. Inf. Syst., 2006

Advanced computational models and learning theories for spoken language processing.
IEEE Comput. Intell. Mag., 2006

Discriminative training via minimization of risk estimates based on Parzen smoothing.
Appl. Intell., 2006

Training Conditional Random Fields with Multivariate Evaluation Measures.
Proceedings of the ACL 2006, 2006

2005
Optimization methods for discriminative training.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Minimum Classification Error for Large Scale Speech Recognition Tasks using Weighted Finite State Transducers.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
A derivation of minimum classification error from the theoretical classification risk using Parzen estimation.
Comput. Speech Lang., 2004

A theoretical analysis of speech recognition based on feature trajectory models.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Minimum classification error training of landmark models for real-time continuous speech recognition.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003
Blind inversion of multidimensional functions for speech enhancement.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Pervasive unsupervised adaptation for lecture speech transcription.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Recognition method with parametric trajectory generated from mixture distribution HMMs.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

A new formalization of minimum classification error using a Parzen estimate of classification chance.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002
Minimum classification error via a Parzen window based estimate of the theoretical Bayes classification risk.
Proceedings of the 12th IEEE Workshop on Neural Networks for Signal Processing, 2002

Classification error from the theoretical Bayes classification risk.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

A recognition method with parametric trajectory synthesized using direct relations between static and dynamic feature vector time series.
Proceedings of the IEEE International Conference on Acoustics, 2002

2001
An application of discriminative feature extraction to filter-bank-based speech recognition.
IEEE Trans. Speech Audio Process., 2001

Time and memory efficient viterbi decoding for LVCSR using a precompiled search network.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

2000
Discriminative training for large vocabulary telephone-based name recognition.
Proceedings of the IEEE International Conference on Acoustics, 2000

1998
Computer-based second language production training by using spectrographic representation and HMM-based speech recognition scores.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

1997
String-level MCE for continuous phoneme recognition.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Efficient normalization based upon GPD [generalized probabilistic descent].
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

1996
A telephone-based directory assistance system adaptively trained using minimum classification error/generalized probabilistic descent.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

1995
A discriminative filter bank model for speech recognition.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

1994
Prototype-based minimum classification error/generalized probabilistic descent training for various speech units.
Comput. Speech Lang., 1994

Prototype-based minimum error training for speech recognition.
Appl. Intell., 1994

1993
Prototype-based MCE/GPD training for word spotting and connected word recognition.
Proceedings of the IEEE International Conference on Acoustics, 1993

1992
Prototype-based discriminative training for various speech units.
Proceedings of the 1992 IEEE International Conference on Acoustics, 1992

1991
LVQ-based shift-tolerant phoneme recognition.
IEEE Trans. Signal Process., 1991

Speaker-independent large vocabulary word recognition using an LVQ/HMM hybrid algorithm.
Proceedings of the 1991 International Conference on Acoustics, 1991

1990
On the robustness of HMM and ANN speech recognition algorithms.
Proceedings of the First International Conference on Spoken Language Processing, 1990

A hybrid speech recognition system using HMMs with an LVQ-trained codebook.
Proceedings of the 1990 International Conference on Acoustics, 1990

1989
Shift-invariant, multi-category phoneme recognition using Kohonen's LVQ2.
Proceedings of the IEEE International Conference on Acoustics, 1989

A new algorithm for representing acoustic feature dynamics.
Proceedings of the IEEE International Conference on Acoustics, 1989


  Loading...