Yifan Gong
Affiliations:- Microsoft Corporation, Redmond, WA, USA
- Texas Instruments Inc., Dallas, TX, USA
- INRIA-Lorraine, Nancy, France
- Henri Poincaré University, Department of Mathematics and Computer Science, Nancy, France (PhD)
According to our database1,
Yifan Gong
authored at least 214 papers
between 1987 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
On csauthors.net:
Bibliography
2024
NOTSOFAR-1 Challenge: New Datasets, Baseline, and Tasks for Distant Meeting Transcription.
CoRR, 2024
Value FULCRA: Mapping Large Language Models to the Multidimensional Spectrum of Basic Human Value.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
Adapting Large Language Model with Speech for Fully Formatted End-to-End Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2024
2023
IEEE Signal Process. Mag., July, 2023
Value FULCRA: Mapping Large Language Models to the Multidimensional Spectrum of Basic Human Values.
CoRR, 2023
Hybrid Attention-based Encoder-decoder Model for Efficient Language Model Adaptation.
CoRR, 2023
Building High-Accuracy Multilingual ASR With Gated Language Experts and Curriculum Training.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
Multi Transcription-Style Speech Transcription Using Attention-Based Encoder-Decoder Model.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
2022
Streaming, fast and accurate on-device Inverse Text Normalization for Automatic Speech Recognition.
CoRR, 2022
Proceedings of the IEEE Spoken Language Technology Workshop, 2022
Proceedings of the IEEE Spoken Language Technology Workshop, 2022
Streaming, Fast and Accurate on-Device Inverse Text Normalization for Automatic Speech Recognition.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022
Internal Language Model Adaptation with Text-Only Data for End-to-End Speech Recognition.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Have Best of Both Worlds: Two-Pass Hybrid and E2E Cascading Framework for Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
2021
IEEE ACM Trans. Audio Speech Lang. Process., 2021
IEEE Signal Process. Lett., 2021
Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021
Improving Multilingual Transformer Transducer Models by Reducing Language Confusions.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Rapid Speaker Adaptation for Conformer Transducer: Attention and Bias Are All You Need.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Microsoft Speaker Diarization System for the Voxceleb Speaker Recognition Challenge 2020.
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021
2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Rapid RNN-T Adaptation Using Personalized Speech Synthesis and Neural Language Generator.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
High-Accuracy and Low-Latency Speech Recognition with Two-Head Contextual Layer Trajectory LSTM Model.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Exploring Pre-Training with Alignments for RNN Transducer Based End-to-End Speech Recognition.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Using Personalized Speech Synthesis and Neural Language Generator for Rapid Speaker Adaptation.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Acoustic Model Adaptation for Presentation Transcription and Intelligent Meeting Assistant Systems.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
2019
IEEE ACM Trans. Audio Speech Lang. Process., 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
2018
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018
Multi-Channel Overlapped Speech Recognition with Location Guided Speech Extraction Network.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Efficient Integration of Fixed Beamformers and Speech Separation Networks for Multi-Channel Far-Field Speech Separation.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Improving Mask Learning Based Speech Enhancement System with Restoration Layers and Residual Connection.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Improved cepstra minimum-mean-square-error noise reduction algorithm for robust speech recognition.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Unsupervised adaptation with domain separation networks for robust speech recognition.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017
Challenges in and Solutions to Deep Learning Network Acoustic Modeling in Speech Recognition Products at Microsoft.
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017
2016
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Non-negative intermediate-layer DNN adaptation for a 10-KB speaker adaptation profile.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Intermediate-layer DNN adaptation for offline and session-based iterative speaker adaptation.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Confidence-features and confidence-scores for ASR applications in arbitration and DNN speaker adaptation.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Investigating online low-footprint speaker adaptation using generalized linear regression and click-through data.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Small-footprint high-performance deep neural network-based speech recognition using split-VQ.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015
2014
IEEE ACM Trans. Audio Speech Lang. Process., 2014
A fast maximum likelihood nonlinear feature transformation method for GMM-HMM speaker adaptation.
Neurocomputing, 2014
Variable-activation and variable-input deep neural network for robust speech recognition.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Multi-accent deep neural network acoustic model with accent-specific top layer using the KLD-regularized model adaptation.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
A comparative analytic study on the Gaussian mixture and context dependent deep neural network hidden Markov models.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Towards better performance with heterogeneous training data in acoustic modeling using deep neural networks.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Singular value decomposition based low-footprint speaker adaptation and personalization for deep neural network.
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
2013
Restructuring of deep neural network acoustic models with singular value decomposition.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Semi-supervised GMM and DNN acoustic model training with multi-system combination and confidence re-calibration.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Cross-language knowledge transfer using multilingual deep neural network with shared hidden layers.
Proceedings of the IEEE International Conference on Acoustics, 2013
Predicting speech recognition confidence using deep learning with word identity and score features.
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
2012
Adaptation of context-dependent deep neural networks for automatic speech recognition.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012
Improving wideband speech recognition using mixed-bandwidth training data in CD-DNN-HMM.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012
A Feature Space Transformation Method for Personalization using Generalized I-Vector Clustering.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
2009
A Novel Framework and Training Algorithm for Variable-Parameter Hidden Markov Models.
IEEE Trans. Speech Audio Process., 2009
A unified framework of HMM adaptation with joint compensation of additive and convolutive distortions.
Comput. Speech Lang., 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
2008
Robust Speech Recognition Using a Cepstral Minimum-Mean-Square-Error-Motivated Noise Suppressor.
IEEE Trans. Speech Audio Process., 2008
Improvements on Mel-Frequency Cepstrum Minimum-Mean-Square-Error Noise Suppressor for Robust Speech Recognition.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008
Parameter clustering and sharing in variable-parameter HMMs for noise robust speech recognition.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Discriminative training of variable-parameter HMMs for noise robust speech recognition.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
A minimum-mean-square-error noise reduction algorithm on Mel-frequency cepstra for robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2008
Proceedings of the IEEE International Conference on Acoustics, 2008
HMM adaptation using a phase-sensitive acoustic distortion model for environment-robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2008
2007
A Study of Variable-Parameter Gaussian Mixture Hidden Markov Modeling for Noisy Speech Recognition.
IEEE Trans. Speech Audio Process., 2007
High-performance hmm adaptation with joint compensation of additive and convolutive distortions via Vector Taylor Series.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007
2006
Modeling Variance Variation in a Variable Parameter HMM Framework for Noise Robust Speech Recognition.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
2005
A Method of Joint Compensation of Additive and Convolutive Distortions for Speaker-Independent Speech Recognition.
IEEE Trans. Speech Audio Process., 2005
2004
Can back-ends be more robust than front-ends? Investigation over the Aurora-2 database.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004
2003
Model-space compensation of microphone and noise for speaker-independent speech recognition.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
2002
IEEE Trans. Speech Audio Process., 2002
The effects of speech compression on speech recognition and text-to-speech synthesis.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Experiments on speaker-independent voice command recognition using in-vehicle hands free speech.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
A comparative study of approximations for parallel model combination of static and dynamic parameters.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Noise-robust open-set speaker recognition using noise-dependent Gaussian mixture classifier.
Proceedings of the IEEE International Conference on Acoustics, 2002
2000
Proceedings of the IEEE International Conference on Acoustics, 2000
Implementing a high accuracy speaker-independent continuous speech recognizer on a fixed-point DSP.
Proceedings of the IEEE International Conference on Acoustics, 2000
1999
IEEE Signal Process. Lett., 1999
Speaker-dependent name dialing in a car environment with out-of-vocabulary rejection.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999
1998
A general joint additive and convolutive bias compensation approach applied to noisy Lombard speech recognition.
IEEE Trans. Speech Audio Process., 1998
Assessing the importance of the segmentation probability in segment-based speech recognition.
Speech Commun., 1998
Environment normalization training and environment adaptation using mixture stochastic trajectory model.
Speech Commun., 1998
1997
Stochastic trajectory modeling and sentence searching for continuous speech recognition.
IEEE Trans. Speech Audio Process., 1997
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997
Elimination of trajectory folding phenomenon: HMM, trajectory mixture HMM and mixture stochastic trajectory model.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997
A unified maximum likelihood approach to acoustic mismatch compensation: application to noisy Lombard speech recognition.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997
1996
Comparative experiments of several adaptation approaches to noisy speech recognition using stochastic trajectory models.
Speech Commun., 1996
Estimation of mixtures of stochastic dynamic trajectories: application to continuous speech recognition.
Comput. Speech Lang., 1996
A study on continuous Chinese speech recognition based on stochastic trajectory models.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996
Proceedings of the 4th International Conference on Spoken Language Processing, 1996
Proceedings of the 4th International Conference on Spoken Language Processing, 1996
Modelling long term variability information in mixture stochastic trajectory framework.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996
A semi-continuous stochastic trajectory model for phoneme-based continuous speech recognition.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996
1995
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995
Evaluation of Bayes decision approach to automatic determination of thresholds for speaker verification.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995
Stochastic trajectory models for speech recognition: an extension to modelling time correlation.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995
Proceedings of the Third International Conference on Document Analysis and Recognition, 1995
1994
Proceedings of IAPR Workshop on Machine Vision Applications, 1994
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994
1993
Speech Commun., 1993
Proceedings of the Third European Conference on Speech Communication and Technology, 1993
Proceedings of the Third European Conference on Speech Communication and Technology, 1993
Duration of phones as function of utterance length and its use in automatic speech recognition.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993
Proceedings of the Third European Conference on Speech Communication and Technology, 1993
Proceedings of the Third European Conference on Speech Communication and Technology, 1993
1992
Minimization of speech alignment error by iterative transformation for speaker adaptation.
Proceedings of the Second International Conference on Spoken Language Processing, 1992
Proceedings of the Second International Conference on Spoken Language Processing, 1992
Proceedings of the 11th IAPR International Conference on Pattern Recognition, 1992
Proceedings of the 1992 IEEE International Conference on Acoustics, 1992
1991
Signal-to-String Conversion Based on High Likelihood Regions Using Embedded Dynamic Programming.
IEEE Trans. Pattern Anal. Mach. Intell., 1991
Proceedings of the Second European Conference on Speech Communication and Technology, 1991
Proceedings of the Second European Conference on Speech Communication and Technology, 1991
Proceedings of the 1991 International Conference on Acoustics, 1991
Non-linear vector interpolation by neural network for phoneme identification in continuous speech.
Proceedings of the 1991 International Conference on Acoustics, 1991
Neural network coupled with IIR sequential adapter for phoneme recognition in continuous speech.
Proceedings of the 1991 International Conference on Acoustics, 1991
1990
Proceedings of the 10th IAPR International Conference on Pattern Recognition, 1990
Proceedings of the 10th IAPR International Conference on Pattern Recognition, 1990
Proceedings of the 1990 International Conference on Acoustics, 1990
1989
Proceedings of the First European Conference on Speech Communication and Technology, 1989
1988
Proceedings of the IEEE International Conference on Acoustics, 1988
1987
IEEE Trans. Acoust. Speech Signal Process., 1987
Proceedings of the European Conference on Speech Technology, 1987