2025
The 1st Industry Forum on Large Language Models in Consumer Technology at IEEE ICCE 2025.
IEEE Consumer Electron. Mag., May, 2025
2021
DEXTER: Deep Encoding of External Knowledge for Named Entity Recognition in Virtual Assistants.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
2020
Robust Multichannel Linear Prediction for Online Speech Dereverberation Using Weighted Householder Least Squares Lattice Adaptive Filter.
IEEE Trans. Signal Process., 2020
2016
We Need Your Help to Take the Society to New Heights [President's Message].
IEEE Signal Process. Mag., 2016
Siri's voice gets deep learning.
Proceedings of the 9th ISCA Speech Synthesis Workshop, 2016
2015
Signal Processing: The Science Behind Our Digital Life [President's Message].
IEEE Signal Process. Mag., 2015
Should We Experiment with New Peer-Review Models? [President's Message].
IEEE Signal Process. Mag., 2015
SigPort: A Paper Repository for Signal Processing [President's Message].
IEEE Signal Process. Mag., 2015
The IEEE Gives Our Society the "Thumbs Up" [President's Message].
IEEE Signal Process. Mag., 2015
The IEEE Signal Processing Cup: A Competition for Undergraduate Students [President's Message].
IEEE Signal Process. Mag., 2015
SigView: Video Tutorials in Emerging Signal Processing Topics [President's Message].
IEEE Signal Process. Mag., 2015
2014
Chapters? Role in Networking and Continuing Education [President's Message].
IEEE Signal Process. Mag., 2014
Where Does Your Conference Registration Fee Go? [President's Message].
IEEE Signal Process. Mag., 2014
At the Forefront in Technical Publications [President's Message].
IEEE Signal Process. Mag., 2014
2013
Recent advances in deep learning for speech research at Microsoft.
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the IEEE International Conference on Acoustics, 2013
Learning deep structured semantic models for web search using clickthrough data.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013
2012
Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition.
IEEE Trans. Speech Audio Process., 2012
Factored adaptation using a combination of feature-space and model-space transforms.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
New methods and evaluation experiments on translating TED talks in the IWSLT benchmark.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
2011
Media Search in Mobile Devices [From the Guest Editors].
IEEE Signal Process. Mag., 2011
The MSR SYSTEM for IWSLT 2011 evaluation.
Proceedings of the 2011 International Workshop on Spoken Language Translation, 2011
Separating Speaker and Environmental Variability Using Factored Transforms.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
A novel decision function and the associated decision-feedback learning for speech translation.
Proceedings of the IEEE International Conference on Acoustics, 2011
Lexicon modeling for query understanding.
Proceedings of the IEEE International Conference on Acoustics, 2011
Why word error rate is not a good metric for speech recognizer training for the speech translation task?
Proceedings of the IEEE International Conference on Acoustics, 2011
Joint encoding of the waveform and speech recognition features using a transform codec.
Proceedings of the IEEE International Conference on Acoustics, 2011
A new speaker identification algorithm for gaming scenarios.
Proceedings of the IEEE International Conference on Acoustics, 2011
Large vocabulary continuous speech recognition with context-dependent DBN-HMMS.
Proceedings of the IEEE International Conference on Acoustics, 2011
Factored adaptation for separable compensation of speaker and environmental variability.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011
Speaker adaptation with an Exponential Transform.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011
2010
Learning with click graph for query intent classification.
ACM Trans. Inf. Syst., 2010
Noise Adaptive Training for Robust Automatic Speech Recognition.
IEEE Trans. Speech Audio Process., 2010
Active learning and semi-supervised learning for speech recognition: A unified framework using the global entropy reduction maximization criterion.
Comput. Speech Lang., 2010
Continuous speech recognition with a TF-IDF acoustic model.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
HMM adaptation using linear spline interpolation with integrated spline parameter training for robust speech recognition.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Binary coding of speech spectrograms using a deep auto-encoder.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Information retrieval methods for automatic speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2010
Acoustic model adaptation via Linear Spline Interpolation for robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2010
Reverberated speech signal separation based on regularized subband feedforward ICA and instantaneous direction of arrival.
Proceedings of the IEEE International Conference on Acoustics, 2010
Discriminative training methods for language models using conditional entropy criteria.
Proceedings of the IEEE International Conference on Acoustics, 2010
Context dependent phonetic string edit distance for automatic speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2010
2009
A Novel Framework and Training Algorithm for Variable-Parameter Hidden Markov Models.
IEEE Trans. Speech Audio Process., 2009
Using continuous features in the maximum entropy model.
Pattern Recognit. Lett., 2009
A unified framework of HMM adaptation with joint compensation of additive and convolutive distortions.
Comput. Speech Lang., 2009
Extracting structured information from user queries with semi-supervised conditional random fields.
Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2009
Hidden conditional random field with distribution constraints for phone classification.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Cross-lingual speech recognition under runtime resource constraints.
Proceedings of the IEEE International Conference on Acoustics, 2009
Discriminative pronounciation learning using phonetic decoder and minimum-classification-error criterion.
Proceedings of the IEEE International Conference on Acoustics, 2009
Maximizing global entropy reduction for active learning in speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2009
Using collective information in semi-supervised learning for speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2009
Voice search of structured media data.
Proceedings of the IEEE International Conference on Acoustics, 2009
A study on multilingual acoustic modeling for large vocabulary ASR.
Proceedings of the IEEE International Conference on Acoustics, 2009
Noise adaptive training using a vector taylor series approach for noise robust automatic speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2009
Experimenting with a global decision tree for state clustering in automatic speech recognition systems.
Proceedings of the IEEE International Conference on Acoustics, 2009
Noise robust model adaptation using linear spline interpolation.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009
2008
Robust Speech Recognition Using a Cepstral Minimum-Mean-Square-Error-Motivated Noise Suppressor.
IEEE Trans. Speech Audio Process., 2008
An Integrative and Discriminative Technique for Spoken Utterance Classification.
IEEE Trans. Speech Audio Process., 2008
An introduction to voice search.
IEEE Signal Process. Mag., 2008
Multisensory processing for speech enhancement and magnitude-normalized spectra for speech modeling.
Speech Commun., 2008
Large-margin minimum classification error training: A theoretical risk minimization perspective.
Comput. Speech Lang., 2008
Learning query intent from regularized click graphs.
Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2008
Improvements on Mel-Frequency Cepstrum Minimum-Mean-Square-Error Noise Suppressor for Robust Speech Recognition.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008
Parameter clustering and sharing in variable-parameter HMMs for noise robust speech recognition.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Discriminative training of variable-parameter HMMs for noise robust speech recognition.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Inductive and example-based learning for text classification.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Sound capture system and spatial filter for small devices.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Automatic children's reading tutor on hand-held devices.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Towards a non-parametric acoustic model: an acoustic decision tree for observation probability calculation.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
A minimum-mean-square-error noise reduction algorithm on Mel-frequency cepstra for robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2008
Maximum a posteriori ICA: Applying prior knowledge to the separation of acoustic sources.
Proceedings of the IEEE International Conference on Acoustics, 2008
Robust design of wideband loudspeaker arrays.
Proceedings of the IEEE International Conference on Acoustics, 2008
AN EM-based probabilistic approach for Acoustic Echo Suppression.
Proceedings of the IEEE International Conference on Acoustics, 2008
Language modeling for voice search: A machine translation approach.
Proceedings of the IEEE International Conference on Acoustics, 2008
Adaptation of compressed HMM parameters for resource-constrained speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2008
HMM adaptation using a phase-sensitive acoustic distortion model for environment-robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2008
Speech enhancement using a pitch predictive model.
Proceedings of the IEEE International Conference on Acoustics, 2008
Live search for mobile: Web services by voice on the cellphone.
Proceedings of the IEEE International Conference on Acoustics, 2008
2007
Training Wideband Acoustic Models Using Mixed-Bandwidth Training Data for Speech Recognition.
IEEE Trans. Speech Audio Process., 2007
Adaptive Kalman Filtering and Smoothing for Tracking Vocal Tract Resonances Using a Continuous-Valued Hidden Dynamic Model.
IEEE Trans. Speech Audio Process., 2007
Automatic Removal of Typed Keystrokes From Speech Signals.
IEEE Signal Process. Lett., 2007
Speaker-adaptive learning of resonance targets in a hidden trajectory model of speech coarticulation.
Comput. Speech Lang., 2007
Soft indexing of speech content for search in spoken documents.
Comput. Speech Lang., 2007
Commute UX: Telephone Dialog System for Location-based Services.
Proceedings of the 8th SIGdial Workshop on Discourse and Dialogue, 2007
Report on the NSF-sponsored Human Language Technology Workshop on Industrial Centers.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of Machine Translation Summit XI: Papers, 2007
The voice-rate dialog system for consumer ratings.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Automated directory assistance system - from theory to practice.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Handling phonetic context and speaker variation in a structure-based speech recognizer.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Confidence measures for voice search applications.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Voicepedia: towards speech-based access to unstructured information.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Robust location understanding in spoken dialog systems using intersections.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
A fine pitch model for speech.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Large-Margin Minimum Classification Error Training for Large-Scale Speech Recognition Tasks.
Proceedings of the IEEE International Conference on Acoustics, 2007
Robust Adaptive Beamforming Algorithm using Instantaneous Direction of Arrival with Enhanced Noise Suppression Capability.
Proceedings of the IEEE International Conference on Acoustics, 2007
A Discriminative Training Framework using N-Best Speech Recognition Transcriptions and Scores for Spoken Utterance Classification.
Proceedings of the IEEE International Conference on Acoustics, 2007
Maximum Entropy Confidence Estimation for Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2007
A Generative-Discriminative Framework using Ensemble Methods for Text-Dependent Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2007
Microphone Array Post-Filter using Incremental Bayes Learning to Track the Spatial Distributions of Speech and Noise.
Proceedings of the IEEE International Conference on Acoustics, 2007
Efficient and Robust Language Modeling in an Automatic Children's Reading Tutor System.
Proceedings of the IEEE International Conference on Acoustics, 2007
Maximum entropy model parameterization with TF∗IDF weighted vector space model.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007
Adapting grapheme-to-phoneme conversion for name recognition.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007
High-performance hmm adaptation with joint compensation of additive and convolutive distortions via Vector Taylor Series.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007
2006
Structured speech modeling.
IEEE Trans. Speech Audio Process., 2006
A bidirectional target-filtering model of speech coarticulation and reduction: two-stage implementation for phonetic recognition.
IEEE Trans. Speech Audio Process., 2006
Tracking vocal tract resonances using a quantized nonlinear function embeddedin a temporal constraint.
IEEE Trans. Speech Audio Process., 2006
A lattice search technique for a long-contextual-span hidden trajectory model of speech.
Speech Commun., 2006
Rapid development of spoken language understanding grammars.
Speech Commun., 2006
Adaptation of maximum entropy capitalizer: Little data can help a lot.
Comput. Speech Lang., 2006
Integration of Metadata in spoken Document Search Using Position Specific Posterior latices.
Proceedings of the 2006 IEEE ACL Spoken Language Technology Workshop, 2006
An effective and efficient utterance verification technology using word n-gram filler models.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Use of incrementally regulated discriminative margins in MCE training for speech recognition.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Discriminative models for spoken language understanding.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
A time-synchronous phonetic decoder for a long-contextual-Span hidden trajectory model.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Call analysis with classification using speech and non-speech features.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Speech Modelingwith Magnitude-Normalized Complex Spectra and Its Application to Multisensory Speech Enhancement.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006
N-Gram Based Filler Model for Robust Grammar Authoring.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
Speech Utterance Classification Model Training without Manual Transcriptions.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
Pruning Analysis for the Position Specific Posterior Lattices for Spoken Document Search.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
Training Algorithms for Hidden Conditional Random Fields.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
Joint Discriminative Front End and Back End Training for Improved Speech Recognition Accuracy.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
Combining Statistical and Knowledge-Based Spoken Language Understanding in Conditional Models.
Proceedings of the ACL 2006, 2006
2005
Semiautomatic Improvements of System-Initiative Spoken Dialog Applications Using Interactive Clustering.
IEEE Trans. Speech Audio Process., 2005
Dynamic compensation of HMM variances using the feature enhancement uncertainty computed from a parametric model of speech distortion.
IEEE Trans. Speech Audio Process., 2005
Spoken language understanding.
IEEE Signal Process. Mag., 2005
Analysis and comparison of two speech feature extraction/compensation algorithms.
IEEE Signal Process. Lett., 2005
Evaluation of a long-contextual-Span hidden trajectory model and phonetic recognizer using a* lattice search.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
SGStudio: rapid semantic grammar development for spoken language understanding.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
A graphical model for multi-sensory speech processing in air-and-bone conductive microphones.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Robust bandwidth extension of noise-corrupted narrowband speech.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Hidden conditional random fields for phone classification.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Maximum mutual information SPLICE transform for seen and unseen conditions.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Learning statistically characterized resonance targets in a hidden trajectory model of speech coarticulation and reduction.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Indexing uncertainty for spoken document search.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Automatic Head-size Equalization in Panorama Images for Video Conferencing.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005
Maximum Entropy Based Generic Filter for Language Model Adaptation.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
Training Wideband Acoustic Models using Mixed-Bandwidth Training Data via Feature Bandwidth Extension.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
Leakage Model and Teeth Clack Removal for Air- and Bone-Conductive Integrated Microphones.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
Unsupervised Semantic Intent Discovery from Call Log Acoustics.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
A Hidden Trajectory Model with Bi-directional Target-Filtering: Cascaded vs. Integrated Implementation for Phonetic Recognition.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
SPEECH OGLE: Indexing Uncertainty for Spoken Document Search.
Proceedings of the ACL 2005, 2005
Position Specific Posterior Lattices for Indexing Speech.
Proceedings of the ACL 2005, 2005
2004
Speech and Language Processing for Multimodal Human-Computer Interaction.
J. VLSI Signal Process., 2004
Estimating cepstrum of speech under the presence of noise using a joint prior of static and dynamic features.
IEEE Trans. Speech Audio Process., 2004
Enhancement of log Mel power spectra of speech using a phase-sensitive model of the acoustic environment and sequential estimation of the corrupting noise.
IEEE Trans. Speech Audio Process., 2004
Use and Acquisition of Semantic Language Model.
Proceedings of the Demonstration Papers at HLT-NAACL 2004, 2004
Direct filtering for air- and bone-conductive microphones.
Proceedings of the IEEE 6th Workshop on Multimedia Signal Processing, 2004
Nonlinear information fusion in multi-sensor processing - extracting and exploiting hidden dynamics of speech captured by a bone-conductive microphone.
Proceedings of the IEEE 6th Workshop on Multimedia Signal Processing, 2004
Unsupervised learning from users' error correction in speech dictation.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
A quantitative model for formant dynamics and contextually assimilated reduction in fluent speech.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Multi-sensory microphones for robust speech detection, enhancement and recognition.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004
Noise robust speech recognition with a switching linear dynamic model.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004
A structured speech model with continuous hidden dynamics and prediction-residual training for tracking vocal tract resonances.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004
Adaptation of Maximum Entropy Capitalizer: Little Data Can Help a Lo.
Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing , 2004
2003
Recursive estimation of nonstationary noise using iterative stochastic approximation for robust speech recognition.
IEEE Trans. Speech Audio Process., 2003
Speech Recognition and Understanding.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, 2003
Improved name recognition with user modeling.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Combination of CFG and n-gram modeling in semantic grammar learning.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
A harmonic-model-based front end for robust speech recognition.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Adapting acoustic models to new domains and conditions using untranscribed data.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
A comparison of three non-linear observation models for noisy speech features.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Estimating speech recognition error rate without acoustic test data.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Tracking vocal tract resonances using an analytical nonlinear predictor and a target-guided temporal constraint.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Discriminative training of n-gram classifiers for speech and text routing.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Concept acquisition in example-based grammar authoring.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
Incremental Bayes learning with prior evolution for tracking nonstationary noise statistics from noisy speech data.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
Speech utterance classification.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
An expectation maximization approach for formant tracking using a parameter-free non-linear predictor.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
2002
Distributed speech processing in miPad's multimodal user interface.
,
,
,
,
,
,
,
,
,
,
IEEE Trans. Speech Audio Process., 2002
Combination of statistical and rule-based approaches for spoken language understanding.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Evaluation of SPLICE on the Aurora 2 and 3 tasks.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Noise from corrupted speech log mel-spectral energies.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Exploiting variances in robust feature extraction based on a parametric model of speech distortion.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Sequential MAP noise estimation and a phase-sensitive model of the acoustic environment.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Separating colorred signals distorted by convolutive channels using diagonal constrained decorrelation.
Proceedings of the IEEE International Conference on Acoustics, 2002
Evaluation of spoken language grammar learning in the ATIS domain.
Proceedings of the IEEE International Conference on Acoustics, 2002
Uncertainty decoding with SPLICE for noise robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2002
A Bayesian approach to speech feature enhancement using the dynamic cepstral prior.
Proceedings of the IEEE International Conference on Acoustics, 2002
A speech-centric perspective for human-computer interface.
Proceedings of the IEEE 5th Workshop on Multimedia Signal Processing, 2002
2001
ALGONQUIN - Learning Dynamic Noise Models From Noisy Speech for Robust Speech Recognition.
Proceedings of the Advances in Neural Information Processing Systems 14 [Neural Information Processing Systems: Natural and Synthetic, 2001
ALGONQUIN: iterating laplace's method to remove multiple types of acoustic distortion for robust speech recognition.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
Evaluation of the SPLICE algorithm on the Aurora2 database.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
A new method for speech denoising and robust speech recognition using probabilistic models for clean speech and for noise.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
Experimental investigation of delayed instantaneous demixer for speech enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2001
Towards non-stationary model-based noise adaptation for large vocabulary speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2001
MiPad: a multimodal interaction prototype.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the IEEE International Conference on Acoustics, 2001
Efficient on-line acoustic environment estimation for FCDCN in a continuous speech recognition system.
Proceedings of the IEEE International Conference on Acoustics, 2001
High-performance robust speech recognition using stereo training data.
Proceedings of the IEEE International Conference on Acoustics, 2001
2000
Speech Denoising and Dereverberation Using Probabilistic Models.
Proceedings of the Advances in Neural Information Processing Systems 13, 2000
Automatically extracting highlights for TV Baseball programs.
Proceedings of the 8th ACM International Conference on Multimedia 2000, Los Angeles, CA, USA, October 30, 2000
Mipad: a next generation PDA prototype.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Large-vocabulary speech recognition under adverse acoustic environments.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
HMM adaptation using vector taylor series for noisy speech recognition.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Speech/noise separation using two microphones and a VQ model of speech signals.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
1999
Improvements on speech recognition for fast talkers.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999
Formant analysis and synthesis using hidden Markov models.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999
1998
Speech Research: Near and Not-so-near Results and What They Might Mean for IUI (Panel).
Proceedings of the 3rd International Conference on Intelligent User Interfaces, 1998
HMM-based smoothing for concatenative speech synthesis.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
Maximum a posteriori pitch tracking.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
A mixed-excitation frequency domain model for time-scale pitch-scale modification of speech.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
Automatic generation of synthesis units for trainable text-to-speech systems.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998
Source-filter models for time-scale pitch-scale modification of speech.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998
1997
Recent improvements on Microsoft's trainable text-to-speech system-Whistler.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997
1996
Whistler: a trainable text-to-speech system.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996
Speaker and gender normalization for continuous-density hidden Markov models.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996
1995
Microsoft Windows highly intelligent speech recognizer: Whisper.
Proceedings of the 1995 International Conference on Acoustics, 1995
1994
Discriminative training of garbage model for non-vocabulary utterance rejection.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994
The VESTEL telephone speech database.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994
Signal processing for robust speech recognition.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994
Environment normalization for robust speech recognition using direct cepstral comparison.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994
1993
Efficient Cepstral Normalization For Robust Speech Recognition.
Proceedings of the Human Language Technology: Proceedings of a Workshop Held at Plainsboro, 1993
Robust HMM-based endpoint detector.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993
Rejection techniques for digit recognition in telecommunication applications.
Proceedings of the IEEE International Conference on Acoustics, 1993
1992
Multiple approaches to robust speech recognition.
Proceedings of the Second International Conference on Spoken Language Processing, 1992
Efficient joint compensation of speech for the effects of additive noise and linear filtering.
Proceedings of the 1992 IEEE International Conference on Acoustics, 1992
1991
Robust speech recognition by normalization of the acoustic space.
Proceedings of the 1991 International Conference on Acoustics, 1991
1990
Towards Environment-Independent Spoken Language Systems.
Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Hidden Valley, 1990
Acoustical pre-processing for robust spoken language systems.
Proceedings of the First International Conference on Spoken Language Processing, 1990
Environmental robustness in automatic speech recognition.
Proceedings of the 1990 International Conference on Acoustics, 1990
1989
ACOUSTICAL PRE-PROCESSING FOR ROBUST SPEECH RECOGNITION.
Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Cape Cod, 1989