Yifan Gong

Affiliations:
  • Microsoft Corporation, Redmond, WA, USA
  • Texas Instruments Inc., Dallas, TX, USA
  • INRIA-Lorraine, Nancy, France
  • Henri Poincaré University, Department of Mathematics and Computer Science, Nancy, France (PhD)


According to our database1, Yifan Gong authored at least 214 papers between 1987 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
NOTSOFAR-1 Challenge: New Datasets, Baseline, and Tasks for Distant Meeting Transcription.
CoRR, 2024

Value FULCRA: Mapping Large Language Models to the Multidimensional Spectrum of Basic Human Value.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Adapting Large Language Model with Speech for Fully Formatted End-to-End Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Twenty-Five Years of Evolution in Speech and Language Processing.
IEEE Signal Process. Mag., July, 2023

Value FULCRA: Mapping Large Language Models to the Multidimensional Spectrum of Basic Human Values.
CoRR, 2023

Hybrid Attention-based Encoder-decoder Model for Efficient Language Model Adaptation.
CoRR, 2023

Building High-Accuracy Multilingual ASR With Gated Language Experts and Curriculum Training.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Multi Transcription-Style Speech Transcription Using Attention-Based Encoder-Decoder Model.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
Streaming, fast and accurate on-device Inverse Text Normalization for Automatic Speech Recognition.
CoRR, 2022

Joint Speaker Diarisation and Tracking in Switching State-Space Model.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Diarisation Using Location Tracking with Agglomerative Clustering.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Streaming, Fast and Accurate on-Device Inverse Text Normalization for Automatic Speech Recognition.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Internal Language Model Adaptation with Text-Only Data for End-to-End Speech Recognition.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Have Best of Both Worlds: Two-Pass Hybrid and E2E Cascading Framework for Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022

Endpoint Detection for Streaming End-to-End Multi-Talker ASR.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Speaker Separation Using Speaker Inventories and Estimated Speech.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Streaming End-to-End Multi-Talker Speech Recognition.
IEEE Signal Process. Lett., 2021

Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Improving Multilingual Transformer Transducer Models by Reducing Language Confusions.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Multiple Softmax Architecture for Streaming Multilingual End-to-End ASR Systems.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Improving RNN-T for Domain Scaling Using Semi-Supervised Training with Neural TTS.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Rapid Speaker Adaptation for Conformer Transducer: Attention and Bias Are All You Need.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

On Minimum Word Error Rate Training of the Hybrid Autoregressive Transducer.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Streaming Multi-Talker Speech Recognition with Joint Speaker Identification.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Microsoft Speaker Diarization System for the Voxceleb Speaker Recognition Challenge 2020.
Proceedings of the IEEE International Conference on Acoustics, 2021

Hidden Markov Model Diarisation with Speaker Location Information.
Proceedings of the IEEE International Conference on Acoustics, 2021

Ensemble Combination between Different Time Segmentations.
Proceedings of the IEEE International Conference on Acoustics, 2021

Sequence-Level Self-Teaching Regularization.
Proceedings of the IEEE International Conference on Acoustics, 2021

Internal Language Model Training for Domain-Adaptive End-To-End Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

On Addressing Practical Challenges for RNN-Transducer.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
Combination of End-to-End and Hybrid Models for Speech Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Exploring Transformers for Large-Scale Speech Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Bandpass Noise Generation and Augmentation for Unified ASR.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

1-D Row-Convolution LSTM: Fast Streaming ASR at Accuracy Parity with LC-BLSTM.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Rapid RNN-T Adaptation Using Personalized Speech Synthesis and Neural Language Generator.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Adaptation of RNN Transducer with Text-To-Speech Technology for Keyword Spotting.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

L-Vector: Neural Label Embedding for Domain Adaptation.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

High-Accuracy and Low-Latency Speech Recognition with Two-Head Contextual Layer Trajectory LSTM Model.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Minimum Latency Training Strategies for Streaming Sequence-to-Sequence ASR.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Exploring Pre-Training with Alignments for RNN Transducer Based End-to-End Speech Recognition.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Using Personalized Speech Synthesis and Neural Language Generator for Rapid Speaker Adaptation.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Acoustic Model Adaptation for Presentation Transcription and Intelligent Meeting Assistant Systems.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Advancing Acoustic-to-Word CTC Model With Attention and Mixed-Units.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

PyKaldi2: Yet another speech toolkit based on Kaldi and PyTorch.
CoRR, 2019

Layer Trajectory BLSTM.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Speaker Adaptation for Attention-Based End-to-End Speech Recognition.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Self-Teaching Networks.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Acoustic-to-Phrase Models for Speech Recognition.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Encrypted Speech Recognition Using Deep Polynomial Networks.
Proceedings of the IEEE International Conference on Acoustics, 2019

Single-channel Speech Extraction Using Speaker Inventory and Attention Network.
Proceedings of the IEEE International Conference on Acoustics, 2019

Adversarial Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2019

Conditional Teacher-student Learning.
Proceedings of the IEEE International Conference on Acoustics, 2019

Attentive Adversarial Learning for Domain-invariant Training.
Proceedings of the IEEE International Conference on Acoustics, 2019

Adversarial Speaker Adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2019

Towards Code-switching ASR for End-to-end CTC Models.
Proceedings of the IEEE International Conference on Acoustics, 2019

Improving Layer Trajectory LSTM with Future Context Frames.
Proceedings of the IEEE International Conference on Acoustics, 2019

Static and Dynamic State Predictions for Acoustic Model Combination.
Proceedings of the IEEE International Conference on Acoustics, 2019

Word Characters and Phone Pronunciation Embedding for ASR Confidence Classifier.
Proceedings of the IEEE International Conference on Acoustics, 2019

Universal Acoustic Modeling Using Neural Mixture Models.
Proceedings of the IEEE International Conference on Acoustics, 2019

CNN with Phonetic Attention for Text-Independent Speaker Verification.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019


Domain Adaptation via Teacher-Student Learning for End-to-End Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Character-Aware Attention-Based End-to-End Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Improving RNN Transducer Modeling for End-to-End Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018
Speaker-Invariant Training via Adversarial Learning.
CoRR, 2018

Speaker Adaptation for End-to-End CTC Models.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Exploring Layer Trajectory LSTM with Depth Processing Units and Attention.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Multi-Channel Overlapped Speech Recognition with Location Guided Speech Extraction Network.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Adversarial Feature-Mapping for Speech Enhancement.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Cycle-Consistent Speech Enhancement.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Layer Trajectory LSTM.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Domain and Speaker Adaptation for Cortana Speech Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Adversarial Teacher-Student Learning for Unsupervised Domain Adaptation.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Speaker-Invariant Training Via Adversarial Learning.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Developing Far-Field Speaker System Via Teacher-Student Learning.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Advancing Acoustic-to-Word CTC Model.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Advancing Connectionist Temporal Classification with Attention Modeling.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Efficient Integration of Fixed Beamformers and Speech Separation Networks for Multi-Channel Far-Field Speech Separation.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Large-Scale Domain Adaptation via Teacher-Student Learning.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Don't Count on ASR to Transcribe for You: Breaking Bias with Two Crowds.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Improving Mask Learning Based Speech Enhancement System with Restoration Layers and Residual Connection.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Extended low-rank plus diagonal adaptation for deep and recurrent neural networks.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Improved cepstra minimum-mean-square-error noise reduction algorithm for robust speech recognition.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Unsupervised adaptation with domain separation networks for robust speech recognition.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Acoustic-to-word model without OOV.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Cracking the cocktail party problem by multi-beam deep attractor network.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Challenges in and Solutions to Deep Learning Network Acoustic Modeling in Speech Recognition Products at Microsoft.
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

2016
End-to-End attention based text-dependent speaker verification.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Semi-Supervised Training in Deep Learning Acoustic Model.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Low-rank plus diagonal adaptation for deep neural networks.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Recurrent support vector machines for speech recognition.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Geo-location dependent deep neural network acoustic model for speech recognition.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Simplifying long short-term memory acoustic models for fast training and decoding.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Investigations on speaker adaptation of LSTM RNN models for speech recognition.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Exploring multidimensional lstms for large vocabulary ASR.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Non-negative intermediate-layer DNN adaptation for a 10-KB speaker adaptation profile.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
SVD-based universal DNN modeling for multiple scenarios.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Intermediate-layer DNN adaptation for offline and session-based iterative speaker adaptation.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Delta-melspectra features for noise robustness to DNN-based ASR systems.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Confidence-features and confidence-scores for ASR applications in arbitration and DNN speaker adaptation.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Regularized sequence-level deep neural network model adaptation.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Investigating online low-footprint speaker adaptation using generalized linear regression and click-through data.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Deep neural support vector machines for speech recognition.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Small-footprint high-performance deep neural network-based speech recognition using split-VQ.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Estimating confidence scores on ASR results using recurrent neural networks.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

An analysis of convolutional neural networks for speech recognition.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

LSTM time and frequency recurrence for automatic speech recognition.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014
An Overview of Noise-Robust Automatic Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2014

A fast maximum likelihood nonlinear feature transformation method for GMM-HMM speaker adaptation.
Neurocomputing, 2014

Variable-activation and variable-input deep neural network for robust speech recognition.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Variable-component deep neural network for robust speech recognition.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Learning small-size DNN with output-distribution-based criteria.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Normalization of ASR confidence classifier scores via confidence mapping.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Multi-accent deep neural network acoustic model with accent-specific top layer using the KLD-regularized model adaptation.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

A comparative analytic study on the Gaussian mixture and context dependent deep neural network hidden Markov models.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Towards better performance with heterogeneous training data in acoustic modeling using deep neural networks.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Singular value decomposition based low-footprint speaker adaptation and personalization for deep neural network.
Proceedings of the IEEE International Conference on Acoustics, 2014

Factorized adaptation for deep neural network.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Restructuring of deep neural network acoustic models with singular value decomposition.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Semi-supervised GMM and DNN acoustic model training with multi-system combination and confidence re-calibration.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Cross-language knowledge transfer using multilingual deep neural network with shared hidden layers.
Proceedings of the IEEE International Conference on Acoustics, 2013

Predicting speech recognition confidence using deep learning with word identity and score features.
Proceedings of the IEEE International Conference on Acoustics, 2013

Recent advances in deep learning for speech research at Microsoft.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Adaptation of context-dependent deep neural networks for automatic speech recognition.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

Improving wideband speech recognition using mixed-bandwidth training data in CD-DNN-HMM.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

A Feature Space Transformation Method for Personalization using Generalized I-Vector Clustering.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Efficient VTS Adaptation Using Jacobian Approximation.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Improvements to VTS feature enhancement.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2010
Unscented transform with online distortion estimation for HMM adaptation.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

2009
A Novel Framework and Training Algorithm for Variable-Parameter Hidden Markov Models.
IEEE Trans. Speech Audio Process., 2009

A unified framework of HMM adaptation with joint compensation of additive and convolutive distortions.
Comput. Speech Lang., 2009

Cross-lingual speech recognition under runtime resource constraints.
Proceedings of the IEEE International Conference on Acoustics, 2009

A study on multilingual acoustic modeling for large vocabulary ASR.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
Robust Speech Recognition Using a Cepstral Minimum-Mean-Square-Error-Motivated Noise Suppressor.
IEEE Trans. Speech Audio Process., 2008

Improvements on Mel-Frequency Cepstrum Minimum-Mean-Square-Error Noise Suppressor for Robust Speech Recognition.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

Parameter clustering and sharing in variable-parameter HMMs for noise robust speech recognition.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Discriminative training of variable-parameter HMMs for noise robust speech recognition.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

A minimum-mean-square-error noise reduction algorithm on Mel-frequency cepstra for robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2008

Adaptation of compressed HMM parameters for resource-constrained speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2008

HMM adaptation using a phase-sensitive acoustic distortion model for environment-robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
A Study of Variable-Parameter Gaussian Mixture Hidden Markov Modeling for Noisy Speech Recognition.
IEEE Trans. Speech Audio Process., 2007

High-performance hmm adaptation with joint compensation of additive and convolutive distortions via Vector Taylor Series.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2006
Modeling Variance Variation in a Variable Parameter HMM Framework for Noise Robust Speech Recognition.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
A Method of Joint Compensation of Additive and Convolutive Distortions for Speaker-Independent Speech Recognition.
IEEE Trans. Speech Audio Process., 2005

2004
Can back-ends be more robust than front-ends? Investigation over the Aurora-2 database.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003
Model-space compensation of microphone and noise for speaker-independent speech recognition.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Variable parameter Gaussian mixture hidden Markov modeling for speech recognition.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002
Noise-dependent Gaussian mixture classifiers for robust rejection decision.
IEEE Trans. Speech Audio Process., 2002

The effects of speech compression on speech recognition and text-to-speech synthesis.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Experiments on speaker-independent voice command recognition using in-vehicle hands free speech.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

A comparative study of approximations for parallel model combination of static and dynamic parameters.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Noise-robust open-set speaker recognition using noise-dependent Gaussian mixture classifier.
Proceedings of the IEEE International Conference on Acoustics, 2002

2000
HMM adaptation and microphone array processing for distant speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2000

Implementing a high accuracy speaker-independent continuous speech recognizer on a fixed-point DSP.
Proceedings of the IEEE International Conference on Acoustics, 2000

1999
A minimum cross-entropy approach to hidden Markov model adaptation.
IEEE Signal Process. Lett., 1999

Speaker-dependent name dialing in a car environment with out-of-vocabulary rejection.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

Speech-enabled information retrieval in the automobile environment.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

Transforming HMMs for speaker-independent hands-free speech recognition in the car.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

1998
A general joint additive and convolutive bias compensation approach applied to noisy Lombard speech recognition.
IEEE Trans. Speech Audio Process., 1998

Assessing the importance of the segmentation probability in segment-based speech recognition.
Speech Commun., 1998

Environment normalization training and environment adaptation using mixture stochastic trajectory model.
Speech Commun., 1998

1997
Stochastic trajectory modeling and sentence searching for continuous speech recognition.
IEEE Trans. Speech Audio Process., 1997

Speaker normalization training for mixture stochastic trajectory model.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Source normalization training for HMM applied to noisy telephone speech recognition.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

An acoustic subword unit approach to non-linguistic speech feature identification.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Correlation based predictive adaptation of hidden Markov models.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

The importance of segmentation probability in segment based speech recognizers.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

Elimination of trajectory folding phenomenon: HMM, trajectory mixture HMM and mixture stochastic trajectory model.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

A unified maximum likelihood approach to acoustic mismatch compensation: application to noisy Lombard speech recognition.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

1996
Comparative experiments of several adaptation approaches to noisy speech recognition using stochastic trajectory models.
Speech Commun., 1996

Estimation of mixtures of stochastic dynamic trajectories: application to continuous speech recognition.
Comput. Speech Lang., 1996

A study on continuous Chinese speech recognition based on stochastic trajectory models.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Improvement in n-best search for continuous speech recognition.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Stochastic trajectory model with state-mixture for continuous speech recognition.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Modelling long term variability information in mixture stochastic trajectory framework.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

A semi-continuous stochastic trajectory model for phoneme-based continuous speech recognition.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

Probabilistic mapping networks for speaker recognition.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

1995
Speech recognition in noisy environments: A survey.
Speech Commun., 1995

Noise adaptation using linear regression for continuous noisy speech recognition.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Speaker recognition with temporal transition models.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

On MMI learning of Gaussian mixture for speaker models.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Evaluation of Bayes decision approach to automatic determination of thresholds for speaker verification.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Stochastic trajectory models for speech recognition: an extension to modelling time correlation.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Stochastic trajectory modeling for recognition of unconstrained handwritten words.
Proceedings of the Third International Conference on Document Analysis and Recognition, 1995

1994
Off-line Handwriting Recognition by Statistical Correlation.
Proceedings of IAPR Workshop on Machine Vision Applications, 1994

A comparison of three noisy speech recognition approaches.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Nonlinear time alignment in stochastic trajectory models for speech recognition.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Noise independent speech recognition for a variety of noise types.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

Stochastic trajectory modeling for speech recognition.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

1993
Plausibility functions in continuous speech recognition: The VINICS system.
Speech Commun., 1993

A Bayesian approach to phone duration adaptation for lombard speech recognition.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Use of explicit context-dependent phonemic model in continuous speech recognition.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Duration of phones as function of utterance length and its use in automatic speech recognition.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Iterative transformation and alignment for speech labeling.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Base transformation for environment adaptation in continuous speech recognition.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

1992
Minimization of speech alignment error by iterative transformation for speaker adaptation.
Proceedings of the Second International Conference on Spoken Language Processing, 1992

DTW-based phonetic labeling using explicit phoneme duration constraints.
Proceedings of the Second International Conference on Spoken Language Processing, 1992

Hand-written text recognition based on a new formulation.
Proceedings of the 11th IAPR International Conference on Pattern Recognition, 1992

Nonlinear vectorial interpolation for speaker recognition.
Proceedings of the 1992 IEEE International Conference on Acoustics, 1992

1991
Signal-to-String Conversion Based on High Likelihood Regions Using Embedded Dynamic Programming.
IEEE Trans. Pattern Anal. Mach. Intell., 1991

VINICS: a continuous speech recognizer based on a new robust formulation.
Proceedings of the Second European Conference on Speech Communication and Technology, 1991

Comparing two phoneme identification methods using a continuous speech recognizer.
Proceedings of the Second European Conference on Speech Communication and Technology, 1991

Continuous speech recognition based on high plausibility regions.
Proceedings of the 1991 International Conference on Acoustics, 1991

Non-linear vector interpolation by neural network for phoneme identification in continuous speech.
Proceedings of the 1991 International Conference on Acoustics, 1991

Neural network coupled with IIR sequential adapter for phoneme recognition in continuous speech.
Proceedings of the 1991 International Conference on Acoustics, 1991

1990
A multiknowledge base system for continuous speech understanding.
Proceedings of the 10th IAPR International Conference on Pattern Recognition, 1990

Towards a general signal interpretation system-signal-to-symbol conversion level.
Proceedings of the 10th IAPR International Conference on Pattern Recognition, 1990

Text-independent speaker recognition by trajectory space comparison.
Proceedings of the 1990 International Conference on Acoustics, 1990

1989
Parallel construction of syntactic structure for continuous speech recognition.
Proceedings of the First European Conference on Speech Communication and Technology, 1989

1988
A specialist society for continuous speech understanding.
Proceedings of the IEEE International Conference on Acoustics, 1988

1987
Time domain harmonic matching pitch estimation using time-dependent speech modeling.
IEEE Trans. Acoust. Speech Signal Process., 1987

Phoneme-based continuous speech recognition without pre-segmentation.
Proceedings of the European Conference on Speech Technology, 1987


  Loading...