Patrick Nguyen

According to our database1, Patrick Nguyen authored at least 73 papers between 1998 and 2021.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.




In proceedings 
PhD thesis 




RNN-T Models Fail to Generalize to Out-of-Domain Audio: Causes and Solutions.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Enhancing the Natural Biological Control in the Thyroid Hormone Homeostasis As a First-Order Control System.
Proceedings of the 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2021

StarNet: Targeted Computation for Object Detection in Point Clouds.
CoRR, 2019

Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling.
CoRR, 2019

Model Unit Exploration for Sequence-to-Sequence Speech Recognition.
CoRR, 2019

On the Choice of Modeling Unit for Sequence-to-Sequence Speech Recognition.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Hierarchical Generative Modeling for Controllable Speech Synthesis.
Proceedings of the 7th International Conference on Learning Representations, 2019

A Comparison of End-to-End Models for Long-Form Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Speech Recognition for Medical Conversations.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

No Need for a Lexicon? Evaluating the Value of the Pronunciation Lexica in End-to-End Models.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Improving the Performance of Online Neural Transducer Models.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Minimum Word Error Rate Training for Attention-Based Sequence-to-Sequence Models.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Multi-Dialect Speech Recognition with a Single Sequence-to-Sequence Model.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

An Analysis of Incorporating an External Language Model into a Sequence-to-Sequence Model.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

State-of-the-Art Speech Recognition with Sequence-to-Sequence Models.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

CaLcs: Continuously Approximating Longest Common Subsequence for Sequence Level Optimization.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Multi-Dialect Speech Recognition With A Single Sequence-To-Sequence Model.
CoRR, 2017

From BigBench to TPCx-BB: Standardization of a Big Data Benchmark.
Proceedings of the Performance Evaluation and Benchmarking. Traditional - Big Data - Interest of Things, 2016

On rectified linear units for speech processing.
Proceedings of the IEEE International Conference on Acoustics, 2013

Multilingual acoustic models using distributed deep neural networks.
Proceedings of the IEEE International Conference on Acoustics, 2013

Large Scale Language Modeling in Automatic Speech Recognition
CoRR, 2012

Recurrent Neural Networks for Noise Reduction in Robust ASR.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Application of Pretrained Deep Neural Networks to Large Vocabulary Speech Recognition.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Investigations on exemplar-based features for speech recognition towards thousands of hours of unsupervised, noisy data.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Speech recognitionwith segmental conditional random fields: A summary of the JHU CLSP 2010 Summer Workshop.
Proceedings of the IEEE International Conference on Acoustics, 2011

MLP based phoneme detectors for Automatic Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2011

Discriminative duration modeling for speech recognition with segmental conditional random fields.
Proceedings of the IEEE International Conference on Acoustics, 2011

Integrating meta-information into exemplar-based speech recognition with segmental conditional random fields.
Proceedings of the IEEE International Conference on Acoustics, 2011

Speech Recognition With Flat Direct Models.
IEEE J. Sel. Top. Signal Process., 2010

Continuous speech recognition with a TF-IDF acoustic model.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

SCARF: a segmental conditional random field toolkit for speech recognition.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

From flat direct models to segmental CRF models.
Proceedings of the IEEE International Conference on Acoustics, 2010

Discriminative template extraction for direct modeling.
Proceedings of the IEEE International Conference on Acoustics, 2010

Improved Monolingual Hypothesis Alignment for Machine Translation System Combination.
ACM Trans. Asian Lang. Inf. Process., 2009

Techware: Speech recognition software and resources on the web [Best of the Web].
IEEE Signal Process. Mag., 2009

Multi-scale Personalization for Voice Search Applications.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009

Maximum mutual information multi-phone units in direct modeling.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Leveraging multiple query logs to improve language models for spoken query recognition.
Proceedings of the IEEE International Conference on Acoustics, 2009

A flat direct model for speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2009

A segmental CRF approach to large vocabulary continuous speech recognition.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

Joint n-best rescoring for repeated utterances in spoken dialog systems.
Proceedings of the 2008 IEEE Spoken Language Technology Workshop, 2008

Optimal Dialog in Consumer-Rating Systems using POMDP Framework.
Proceedings of the SIGDIAL 2008 Workshop, 2008

Learning N-Best Correction Models from Implicit User Feedback in a Multi-Modal Local Search Application.
Proceedings of the SIGDIAL 2008 Workshop, 2008

Scalable summaries of spoken conversations.
Proceedings of the 13th International Conference on Intelligent User Interfaces, 2008

Structured models for joint decoding of repeated utterances.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

An empirical study of automatic accent classification.
Proceedings of the IEEE International Conference on Acoustics, 2008

Live search for mobile: Web services by voice on the cellphone.
Proceedings of the IEEE International Conference on Acoustics, 2008

Indirect-HMM-based Hypothesis Alignment for Combining Outputs from Machine Translation Systems.
Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, 2008

Training Non-Parametric Features for Statistical Machine Translation.
Proceedings of the Second Workshop on Statistical Machine Translation, 2007

The voice-rate dialog system for consumer ratings.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

A Generative-Discriminative Framework using Ensemble Methods for Text-Dependent Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2007

Finding Speaker Identities with a Conditional Maximum Entropy Model.
Proceedings of the IEEE International Conference on Acoustics, 2007

Uncertainty in training large vocabulary speech recognizers.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

Speech technology for multimedia content management.
Proceedings of the 1st IEEE Consumer Communications and Networking Conference, 2004

Large vocabulary noise robustness on Aurora4.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Large corpus experiments for broadcast news recognition.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Towards domain independent speaker clustering.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

LU factorization for feature transformation.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Blind channel estimation based on speech correlation structure.
Proceedings of the IEEE International Conference on Acoustics, 2002

Piecewise linear constraints for model space adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2002

Separating speaker and environment variabilities for improved recognition in non-stationary conditions.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Maximum-likelihood training of a bipartite acoustic model for speech recognition.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Very fast adaptation with a compact context-dependent eigenvoice model.
Proceedings of the IEEE International Conference on Acoustics, 2001

Rapid speaker adaptation in eigenvoice space.
IEEE Trans. Speech Audio Process., 2000

Eigenvoices: A compact representation of speakers in model space.
Ann. des Télécommunications, 2000

Speaker identification and verification using eigenvoices.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

EWAVES: an efficient decoding algorithm for lexical tree based speech recognition.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Maximum likelihood eigenspace and MLLR for speech recognition in noisy environments.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

N-best based supervised and unsupervised adaptation for native and non-native speakers in cars.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

Fast speaker adaptation using a priori knowledge.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

Eigenfaces and eigenvoices: dimensionality reduction for specialized pattern recognition.
Proceedings of the Second IEEE Workshop on Multimedia Signal Processing, 1998

Eigenvoices for speaker adaptation.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
