Li Deng
Orcid: 0000-0002-1014-0790Affiliations:
- Artificial Intelligence, Citadel, USA
- Microsoft Research, Redmond, WA, USA
- University of Waterloo, Department of Electrical and Computer Engineering, ON, Canada (1989 - 1999)
- University of Wisconsin-Madison, WI, USA (PhD 1986)
According to our database1,
Li Deng
authored at least 325 papers
between 1991 and 2023.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2023
2020
Multimodal Intelligence: Representation Learning, Information Fusion, and Applications.
IEEE J. Sel. Top. Signal Process., 2020
Introduction to the Special Issue on Deep Learning for Multi-Modal Intelligence Across Speech, Language, Vision, and Heterogeneous Signals.
IEEE J. Sel. Top. Signal Process., 2020
Prediction model of the response to neoadjuvant chemotherapy in breast cancers by a Naive Bayes algorithm.
Comput. Methods Programs Biomed., 2020
2019
From Caesar Cipher to Unsupervised Learning: A New Method for Classifier Parameter Estimation.
CoRR, 2019
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
2018
Artificial Intelligence in the Rising Wave of Deep Learning: The Historical Path and Future Outlook [Perspectives].
IEEE Signal Process. Mag., 2018
CoRR, 2018
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018
Proceedings of the 6th International Conference on Learning Representations, 2018
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018
BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018
Perspectives on predictive power of multimodal deep learning: surprises and future directions.
Proceedings of the Handbook of Multimodal-Multisensor Interfaces: Foundations, User Modeling, and Common Modality Combinations, 2018
2017
IEEE Signal Process. Mag., 2017
Challenges and Open Problems in Signal Processing: Panel Discussion Summary from ICASSP 2017 [Panel and Forum].
IEEE Signal Process. Mag., 2017
Signal Process., 2017
Deep Learning of Grammatically-Interpretable Representations Through Question-Answering.
CoRR, 2017
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017
Proceedings of the 34th International Conference on Machine Learning, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017
2016
IEEE Trans. Signal Process., 2016
Deep Sentence Embedding Using Long Short-Term Memory Networks: Analysis and Application to Information Retrieval.
IEEE ACM Trans. Audio Speech Lang. Process., 2016
Efficient Exploration for Dialog Policy Learning with Deep BBQ Networks \& Replay Buffer Spiking.
CoRR, 2016
Proceedings of the 4th International Conference on Learning Representations, 2016
Deep Reinforcement Learning with a Combinatorial Action Space for Predicting and Tracking Popular Discussion Threads.
CoRR, 2016
CoRR, 2016
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016
Proceedings of the Workshop on Cognitive Computation: Integrating neural and symbolic approaches 2016 co-located with the 30th Annual Conference on Neural Information Processing Systems (NIPS 2016), 2016
The 2015 NIST Language Recognition Evaluation: The Shared View of I2R, Fantastic4 and SingaMS.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
End-to-End Memory Networks with Knowledge Carryover for Multi-Turn Spoken Language Understanding.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Exploiting correlations among channels in distributed compressive sensing with convolutional deep stacking networks.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Interpreting the prediction process of a deep network constructed from supervised topic models.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Reconstruction of sparse vectors in compressive sensing with multiple measurement vectors using bidirectional long short-term memory.
Proceedings of the 2016 IEEE Global Conference on Signal and Information Processing, 2016
Deep Reinforcement Learning with a Combinatorial Action Space for Predicting Popular Reddit Threads.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016
2015
IEEE ACM Trans. Audio Speech Lang. Process., 2015
Deep Learning for Acoustic Modeling in Parametric Speech Generation: A systematic review of existing techniques and future trends.
IEEE Signal Process. Mag., 2015
Proceedings of the 3rd International Conference on Learning Representations, 2015
Deep Sentence Embedding Using the Long Short Term Memory Network: Analysis and Application to Information Retrieval.
CoRR, 2015
End-to-end Learning of Latent Dirichlet Allocation by Mirror-Descent Back Propagation.
CoRR, 2015
End-to-end Learning of LDA by Mirror-Descent Back Propagation over a Deep Architecture.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015
Representation Learning Using Multi-Task Deep Neural Networks for Semantic Classification and Information Retrieval.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015
Proceedings of the IEEE International Conference on Data Mining Workshop, 2015
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015
2014
IEEE ACM Trans. Audio Speech Lang. Process., 2014
IEEE ACM Trans. Audio Speech Lang. Process., 2014
IEEE ACM Trans. Audio Speech Lang. Process., 2014
IEEE ACM Trans. Audio Speech Lang. Process., 2014
A fast maximum likelihood nonlinear feature transformation method for GMM-HMM speaker adaptation.
Neurocomputing, 2014
Proceedings of the 2nd International Conference on Learning Representations, 2014
Learning semantic representations using convolutional neural networks for web search.
Proceedings of the 23rd International World Wide Web Conference, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Achievements and challenges of deep learning - from speech analysis and recognition to language and multimodal processing.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Sequence classification using the high-level features extracted from deep neural networks.
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014
A Latent Semantic Model with Convolutional-Pooling Structure for Information Retrieval.
Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, 2014
Proceedings of the IEEE China Summit & International Conference on Signal and Information Processing, 2014
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014
2013
The Deep Tensor Neural Network With Applications to Large Vocabulary Speech Recognition.
IEEE Trans. Speech Audio Process., 2013
IEEE Trans. Speech Audio Process., 2013
Modeling Spectral Envelopes Using Restricted Boltzmann Machines and Deep Belief Networks for Statistical Parametric Speech Synthesis.
IEEE Trans. Speech Audio Process., 2013
IEEE Signal Process. Lett., 2013
Proc. IEEE, 2013
Proc. IEEE, 2013
IEEE Trans. Pattern Anal. Mach. Intell., 2013
Neurocomputing, 2013
Investigation of recurrent-neural-network architectures and learning methods for spoken language understanding.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Exploring convolutional neural network structures and optimization techniques for speech recognition.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Using deep stacking network to improve structured compressed sensing with Multiple Measurement Vectors.
Proceedings of the IEEE International Conference on Acoustics, 2013
Modeling spectral envelopes using restricted Boltzmann machines for statistical parametric speech synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2013
Cross-language knowledge transfer using multilingual deep neural network with shared hidden layers.
Proceedings of the IEEE International Conference on Acoustics, 2013
Predicting speech recognition confidence using deep learning with word identity and score features.
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Multi-style adaptive training for robust cross-lingual spoken language understanding.
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
New types of deep neural network learning for speech recognition and related applications: an overview.
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
A deep convolutional neural network using heterogeneous pooling for trading acoustic invariance with phonetic confusion.
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013
2012
Inaugural Editorial: Riding the Tidal Wave of Human-Centric Information Processing - Innovate, Outreach, Collaborate, Connect, Expand, and Win.
IEEE Trans. Speech Audio Process., 2012
Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition.
IEEE Trans. Speech Audio Process., 2012
The MNIST Database of Handwritten Digit Images for Machine Learning Research [Best of the Web].
IEEE Signal Process. Mag., 2012
Pattern Recognit. Lett., 2012
Adaptation of context-dependent deep neural networks for automatic speech recognition.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012
Use of kernel deep convex networks and end-to-end learning for spoken language understanding.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012
Being deep and being dynamic - new-generation models and methodology for advancing speech technology.
Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012
Proceedings of the 2012 Symposium on Machine Learning in Speech and Language Processing, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Exploiting sparseness in deep neural networks for large vocabulary speech recognition.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Boosting attribute and phone estimation accuracies with deep neural networks for detection-based speech recognition.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Towards deeper understanding: Deep convex networks for semantic utterance classification.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
A deep architecture with bilinear modeling of hidden representations: Applications to phonetic recognition.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
New methods and evaluation experiments on translating TED talks in the IWSLT benchmark.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012
2011
IEEE ACM Trans. Audio Speech Lang. Process., 2011
Deep Learning and Its Applications to Signal and Information Processing [Exploratory DSP].
IEEE Signal Process. Mag., 2011
IEEE Signal Process. Mag., 2011
Speech Recognition, Machine Translation, and Speech Translation - A Unified Discriminative Learning Paradigm [Lecture Notes].
IEEE Signal Process. Mag., 2011
IEEE Signal Process. Mag., 2011
IEEE Signal Process. Mag., 2011
Proceedings of the 2011 International Workshop on Spoken Language Translation, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
A novel decision function and the associated decision-feedback learning for speech translation.
Proceedings of the IEEE International Conference on Acoustics, 2011
Why word error rate is not a good metric for speech recognizer training for the speech translation task?
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the Robust Speech Recognition of Uncertain or Missing Data, 2011
2010
Proceedings of the Handbook of Natural Language Processing, Second Edition., 2010
IEEE Signal Process. Mag., 2010
IEEE J. Sel. Top. Signal Process., 2010
Introduction to the Issue on Statistical Learning Methods for Speech and Language Processing.
IEEE J. Sel. Top. Signal Process., 2010
Active learning and semi-supervised learning for speech recognition: A unified framework using the global entropy reduction maximization criterion.
Comput. Speech Lang., 2010
Proceedings of the 2010 International Workshop on Spoken Language Translation, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Investigation of full-sequence training of deep belief networks for speech recognition.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Word confidence calibration using a maximum entropy model with constraints on confidence and word distributions.
Proceedings of the IEEE International Conference on Acoustics, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
2009
A Novel Framework and Training Algorithm for Variable-Parameter Hidden Markov Models.
IEEE Trans. Speech Audio Process., 2009
IEEE Signal Process. Mag., 2009
Updated MINDS report on speech recognition and understanding, Part 2 [DSP Education].
IEEE Signal Process. Mag., 2009
Developments and directions in speech recognition and understanding, Part 1 [DSP Education].
IEEE Signal Process. Mag., 2009
Pattern Recognit. Lett., 2009
A unified framework of HMM adaptation with joint compensation of additive and convolutive distortions.
Comput. Speech Lang., 2009
Hidden conditional random field with distribution constraints for phone classification.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Rethinking of computation for future-generation, knowledge-rich speech recognition and understanding.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
Discriminative pronounciation learning using phonetic decoder and minimum-classification-error criterion.
Proceedings of the IEEE International Conference on Acoustics, 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
2008
Synthesis Lectures on Speech and Audio Processing, Morgan & Claypool Publishers, ISBN: 978-3-031-02557-0, 2008
Robust Speech Recognition Using a Cepstral Minimum-Mean-Square-Error-Motivated Noise Suppressor.
IEEE Trans. Speech Audio Process., 2008
IEEE Trans. Speech Audio Process., 2008
IEEE Signal Process. Mag., 2008
Large-margin minimum classification error training: A theoretical risk minimization perspective.
Comput. Speech Lang., 2008
Improvements on Mel-Frequency Cepstrum Minimum-Mean-Square-Error Noise Suppressor for Robust Speech Recognition.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008
Parameter clustering and sharing in variable-parameter HMMs for noise robust speech recognition.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Discriminative training of variable-parameter HMMs for noise robust speech recognition.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
A minimum-mean-square-error noise reduction algorithm on Mel-frequency cepstra for robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2008
Proceedings of the IEEE International Conference on Acoustics, 2008
HMM adaptation using a phase-sensitive acoustic distortion model for environment-robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2008
2007
Guest Editors' Introduction: Special Section on Emergent Systems, Algorithms and Architectures for Speech-Based Human-Machine Interaction.
IEEE Trans. Computers, 2007
Adaptive Kalman Filtering and Smoothing for Tracking Vocal Tract Resonances Using a Continuous-Valued Hidden Dynamic Model.
IEEE Trans. Speech Audio Process., 2007
Pattern Recognit. Lett., 2007
Speaker-adaptive learning of resonance targets in a hidden trajectory model of speech coarticulation.
Comput. Speech Lang., 2007
Proceedings of the First IEEE International Conference on Semantic Computing (ICSC 2007), 2007
Handling phonetic context and speaker variation in a structure-based speech recognizer.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Phone-discriminating minimum classification error (p-MCE) training for phonetic recognition.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Structure-based and template-based automatic speech recognition - comparing parametric and non-parametric approaches.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Large-Margin Minimum Classification Error Training for Large-Scale Speech Recognition Tasks.
Proceedings of the IEEE International Conference on Acoustics, 2007
A Discriminative Training Framework using N-Best Speech Recognition Transcriptions and Scores for Spoken Utterance Classification.
Proceedings of the IEEE International Conference on Acoustics, 2007
Efficient and Robust Language Modeling in an Automatic Children's Reading Tutor System.
Proceedings of the IEEE International Conference on Acoustics, 2007
Use of Differential Cepstra as Acoustic Features in Hidden Trajectory Modeling for Phonetic Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2007
High-performance hmm adaptation with joint compensation of additive and convolutive distortions via Vector Taylor Series.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007
2006
Synthesis Lectures on Speech and Audio Processing, Morgan & Claypool Publishers, ISBN: 978-3-031-02555-6, 2006
A bidirectional target-filtering model of speech coarticulation and reduction: two-stage implementation for phonetic recognition.
IEEE Trans. Speech Audio Process., 2006
Tracking vocal tract resonances using a quantized nonlinear function embeddedin a temporal constraint.
IEEE Trans. Speech Audio Process., 2006
A lattice search technique for a long-contextual-span hidden trajectory model of speech.
Speech Commun., 2006
A state-space model with neural-network prediction for recovering vocal tract resonances in fluent speech from Mel-cepstral coefficients.
Speech Commun., 2006
Proceedings of the IEEE 8th Workshop on Multimedia Signal Processing, 2006
Use of incrementally regulated discriminative margins in MCE training for speech recognition.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
A time-synchronous phonetic decoder for a long-contextual-Span hidden trajectory model.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
2005
J. VLSI Signal Process., 2005
Dynamic compensation of HMM variances using the feature enhancement uncertainty computed from a parametric model of speech distortion.
IEEE Trans. Speech Audio Process., 2005
Speech technology and systems in human-machine communication [from the Guest Editors].
IEEE Signal Process. Mag., 2005
IEEE Signal Process. Lett., 2005
Evaluation of a long-contextual-Span hidden trajectory model and phonetic recognizer using a* lattice search.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Learning statistically characterized resonance targets in a hidden trajectory model of speech coarticulation and reduction.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Multi-sensory speech processing: incorporating automatically extracted hidden dynamic information.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005
A Hidden Trajectory Model with Bi-directional Target-Filtering: Cascaded vs. Integrated Implementation for Phonetic Recognition.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
2004
J. VLSI Signal Process., 2004
IEEE Trans. Speech Audio Process., 2004
Estimating cepstrum of speech under the presence of noise using a joint prior of static and dynamic features.
IEEE Trans. Speech Audio Process., 2004
Enhancement of log Mel power spectra of speech using a phase-sensitive model of the acoustic environment and sequential estimation of the corrupting noise.
IEEE Trans. Speech Audio Process., 2004
Comput. Speech Lang., 2004
Nonlinear information fusion in multi-sensor processing - extracting and exploiting hidden dynamics of speech captured by a bone-conductive microphone.
Proceedings of the IEEE 6th Workshop on Multimedia Signal Processing, 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Use of neural network mapping and extended kalman filter to recover vocal tract resonances from the MFCC parameters of speech.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
A quantitative model for formant dynamics and contextually assimilated reduction in fluent speech.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004
A multimodal variational approach to learning and inference in switching state space models [speech processing application].
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004
A structured speech model with continuous hidden dynamics and prediction-residual training for tracking vocal tract resonances.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004
2003
Joint state and parameter estimation for a target-directed nonlinear dynamic system model.
IEEE Trans. Signal Process., 2003
Efficient decoding strategies for conversational speech recognition using a constrained nonlinear state-space model.
IEEE Trans. Speech Audio Process., 2003
Recursive estimation of nonstationary noise using iterative stochastic approximation for robust speech recognition.
IEEE Trans. Speech Audio Process., 2003
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Tracking vocal tract resonances using an analytical nonlinear predictor and a target-guided temporal constraint.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Coarticulation modeling by embedding a target-directed hidden trajectory model into HMM - model and training.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
Coarticulation modeling by embedding a target-directed hidden trajectory model into HMM - MAP decoding and evaluation.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
Variational inference and learning for segmental switching state space models of hidden speech dynamics.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
Incremental Bayes learning with prior evolution for tracking nonstationary noise statistics from noisy speech data.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
An expectation maximization approach for formant tracking using a parameter-free non-linear predictor.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
2002
A robust compensation strategy for extraneous acoustic variations in spontaneous speech recognition.
IEEE Trans. Speech Audio Process., 2002
IEEE Trans. Speech Audio Process., 2002
Nonstationary-state hidden Markov model representation of speech signals for speech enhancement.
Signal Process., 2002
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Exploiting variances in robust feature extraction based on a parametric model of speech distortion.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Sequential MAP noise estimation and a phase-sensitive model of the acoustic environment.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
A new approach to speech enhancement by a microphone array using EM and mixture models.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
A mixture linear model with target-directed dynamics for spontaneous speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2002
Proceedings of the IEEE International Conference on Acoustics, 2002
Proceedings of the IEEE International Conference on Acoustics, 2002
Proceedings of the IEEE 5th Workshop on Multimedia Signal Processing, 2002
2001
A Bayesian approach to the verification problem: applications to speaker verification.
IEEE Trans. Speech Audio Process., 2001
A maximum a posteriori approach to speaker adaptation using the trended hidden Markov model.
IEEE Trans. Speech Audio Process., 2001
Parameter estimation of a target-directed dynamic system model with switching states.
Signal Process., 2001
ALGONQUIN - Learning Dynamic Noise Models From Noisy Speech for Robust Speech Recognition.
Proceedings of the Advances in Neural Information Processing Systems 14 [Neural Information Processing Systems: Natural and Synthetic, 2001
Efficient decoding strategy for conversational speech recognition using state-space models for vocal-tract-resonance dynamics.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
ALGONQUIN: iterating laplace's method to remove multiple types of acoustic distortion for robust speech recognition.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
A new method for speech denoising and robust speech recognition using probabilistic models for clean speech and for noise.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
An EKF-based algorithm for learning statistical hidden dynamic model parameters for phonetic recognition.
Proceedings of the IEEE International Conference on Acoustics, 2001
Proceedings of the IEEE International Conference on Acoustics, 2001
Towards non-stationary model-based noise adaptation for large vocabulary speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2001
Proceedings of the IEEE International Conference on Acoustics, 2001
Efficient on-line acoustic environment estimation for FCDCN in a continuous speech recognition system.
Proceedings of the IEEE International Conference on Acoustics, 2001
Proceedings of the IEEE International Conference on Acoustics, 2001
2000
A path-stack algorithm for optimizing dynamic regimes in a statistical hidden dynamic model of speech.
Comput. Speech Lang., 2000
Proceedings of the Advances in Neural Information Processing Systems 13, 2000
Annotation and Use of Speech Production Corpus for Building Language-Universal Speech Recognizers.
Proceedings of the 2000 International Symposium on Chinese Spoken Language Processing, 2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Data-driven model construction for continuous speech recognition using overlapping articulatory features.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
A robust training strategy against extraneous acoustic variations for spontaneous speech recognition.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
1999
A dynamic system approach to speech enhancement using the H<sub>∞</sub> filtering algorithm.
IEEE Trans. Speech Audio Process., 1999
A layered neural network interfaced with a cochlear model for the study of speech encoding in the auditory system.
Comput. Speech Lang., 1999
Optimization of dynamic regimes in a statistical hidden dynamic model for conversational speech recognition.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999
1998
Speech analysis and recognition using interval statistics generated from a composite auditory model.
IEEE Trans. Speech Audio Process., 1998
HMM-based strategies for enhancement of speech signals embedded in nonstationary noise.
IEEE Trans. Speech Audio Process., 1998
IEEE Trans. Speech Audio Process., 1998
A dynamic, feature-based approach to the interface between phonology and phonetics for speech modeling and recognition.
Speech Commun., 1998
Integrated-multilingual Speech Recognition and Its Impact on Chinese Spoken Language Processing.
Proceedings of the 1998 International Symposium on Chinese Spoken Language Processing, 1998
Use of high-level linguistic constraints for constructing feature-based phonological model in speech recognition.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
1997
IEEE Trans. Signal Process., 1997
HMM-based speech recognition using state-dependent, discriminatively derived transforms on mel-warped DFT features.
IEEE Trans. Speech Audio Process., 1997
IEEE Trans. Speech Audio Process., 1997
Speech Commun., 1997
Speech recognition using autosegmental representation of phonological units with interface to the trended HMM.
Speech Commun., 1997
Maximum likelihood in statistical estimation of dynamic systems: Decomposition algorithm and simulation results.
Signal Process., 1997
Speaker adaptation experiments using nonstationary-state hidden Markov models: a MAP approach.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997
Integrated-multilingual speech recognition using universal phonological features in a functional speech production model.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997
1996
Transitional speech units and their representation by regressive Markov states: applications to speech recognition.
IEEE Trans. Speech Audio Process., 1996
Signal Process., 1996
Construction of state-dependent dynamic parameters using the maximum likelihood approach: Applications to speech recognition.
Signal Process., 1996
Transiems as dynamically defined, sub-phonemic units of speech: A computational model.
Signal Process., 1996
Modeling context-dependent phonetic units in a continuous speech recognition system for Mandarin Chinese.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996
Proceedings of the 4th International Conference on Spoken Language Processing, 1996
Proceedings of the 4th International Conference on Spoken Language Processing, 1996
Optimal filtering and smoothing for speech recognition using a stochastic target model.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996
Interaction of speech disorders with speech coders: effects on speech intelligibility.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996
Hierarchical partition of the articulatory state space for overlapping-feature based speech recognition.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996
Proceedings of the 4th International Conference on Spoken Language Processing, 1996
HMM-based speech recognition using state-dependent, linear transforms on Mel-warped DFT features.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996
1995
Tracking nonstationary targets using a dynamical system with Markov-modulated parameters.
IEEE Signal Process. Lett., 1995
A Markov model containing state-conditioned second-order non-stationarity: application to speech recognition.
Comput. Speech Lang., 1995
Maximum-likelihood estimation for articulatory speech recognition using a stochastic target model.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995
Proceedings of the 1995 International Conference on Acoustics, 1995
Improved speech modeling and recognition using multi-dimensional articulatory states as primitive speech units.
Proceedings of the 1995 International Conference on Acoustics, 1995
Use of generalized dynamic feature parameters for speech recognition: maximum likelihood and minimum classification error approaches.
Proceedings of the 1995 International Conference on Acoustics, 1995
1994
Waveform-based speech recognition using hidden filter models: parameter selection and sensitivity to power normalization.
IEEE Trans. Speech Audio Process., 1994
A statistical model for formant-transition microsegments of speech incorporating locus equations.
Signal Process., 1994
Neural Parallel Sci. Comput., 1994
Analysis of the correlation structure for a neural predictive model with application to speech recognition.
Neural Networks, 1994
Nonstationary-state hidden Markov model with state-dependent time warping: application to speech recognition.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994
Comparative performance of spectral subtraction and HMM-based speech enhancement strategies with application to hearing and design.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994
Vowel classification using a neural predictive HMM: a discriminative training approach.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994
Phonetic classification and recognition using HMM representation of overlapping articulatory features for all classes of English sounds.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994
1993
IEEE Trans. Speech Audio Process., 1993
Hidden Markov model representation of quantized articulatory features for speech recognition.
Comput. Speech Lang., 1993
Speech recognition using the atomic speech units constructed from overlapping articulatory features.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993
1992
A generalized hidden Markov model with state-conditioned trend functions of time for the speech signal.
Signal Process., 1992
Processing of acoustic signals in a cochlear model incorporating laterally coupled suppressive elements.
Neural Networks, 1992
HMM representation of quantized articulatory features for recognition of highly confusable words.
Proceedings of the 1992 IEEE International Conference on Acoustics, 1992
1991
Microstructural speech units and their HMM representation for discrete utterance speech recognition.
Proceedings of the 1991 International Conference on Acoustics, 1991