IEEE ACM Trans. Audio Speech Lang. Process., 2019

Comparative Study of Parametric and Representation Uncertainty Modeling for Recurrent Neural Network Language Models.

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Fast DNN Acoustic Model Speaker Adaptation by Learning Hidden Unit Contribution Features.

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Unsupervised Methods for Audio Classification from Lecture Discussion Recordings.

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Exploiting Visual Features Using Bayesian Gated Neural Networks for Disordered Speech Recognition.

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

On the Use of Pitch Features for Disordered Speech Recognition.

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Jointly Trained Conversion Model and WaveNet Vocoder for Non-Parallel Voice Conversion Using Mel-Spectrograms and Phonetic Posteriorgrams.

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Extract, Adapt and Recognize: An End-to-End Neural Network for Corrupted Monaural Speech Recognition.

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

LF-MMI Training of Bayesian and Gaussian Process Time Delay Neural Networks for Speech Recognition.

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

The CUHK Dysarthric Speech Recognition Systems for English and Cantonese.

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Recurrent Neural Network Language Model Training Using Natural Gradient.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

BLHUC: Bayesian Learning of Hidden Unit Contributions for Deep Neural Network Speaker Adaptation.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Speech Emotion Recognition Using Capsule Networks.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

CNN-RNN-CTC Based End-to-end Mispronunciation Detection and Diagnosis.

[DOI]

Wai-Kim Leung

Helen Meng

Proceedings of the IEEE International Conference on Acoustics, 2019

Gaussian Process Lstm Recurrent Neural Network Language Models for Speech Recognition.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Bayesian and Gaussian Process Neural Networks for Large Vocabulary Continuous Speech Recognition.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

End-to-end Code-switched TTS with Mix of Monolingual Recordings.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

2018

The HCCL-CUHK System for the Voice Conversion Challenge 2018.

[DOI]

Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

Investigation of Stacked Deep Neural Networks and Mixture Density Networks for Acoustic-to-Articulatory Inversion.

[DOI]

Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Development of the CUHK Dysarthric Speech Recognition System for the UA Speech Corpus.

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Rapid Style Adaptation Using Residual Error Embedding for Expressive Speech Synthesis.

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Semi-supervised Cross-domain Visual Feature Learning for Audio-Visual Broadcast Speech Transcription.

[DOI]

William D. Marslen-Wilson

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Voice Conversion Across Arbitrary Speakers Based on a Single Target-Speaker Utterance.

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Unsupervised Discovery of Non-native Phonetic Patterns in L2 English Speech for Mispronunciation Detection and Diagnosis.

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Gaussian Process Neural Networks for Speech Recognition.

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Feature Based Adaptation for Speaking Style Synthesis.

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Unsupervised Discovery of an Extended Phoneme Set in L2 English Speech for Mispronunciation Detection and Diagnosis.

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Limited-Memory BFGS Optimization of Recurrent Neural Network Language Models for Speech Recognition.

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Drawing-Based Automatic Dementia Screening Using Gaussian Process Markov Chains.

[DOI]

Proceedings of the 51st Hawaii International Conference on System Sciences, 2018

2017

Relating dynamic brain states to dynamic machine states: Human and machine solutions to the speech recognition problem.

[DOI]

PLoS Comput. Biol., 2017

Future Word Contexts in Neural Network Language Models.

[DOI]

CoRR, 2017

RNN-LDA Clustering for Feature Based DNN Adaptation.

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Investigating Bidirectional Recurrent Neural Network Language Models for Speech Recognition.

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Multi-task learning of structured output layer bidirectional LSTMS for speech synthesis.

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Recurrent neural network language models for keyword search.

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Multimodal learning using 3D audio-visual data for audio-visual speech recognition.

[DOI]

Proceedings of the 2017 International Conference on Asian Language Processing, 2017

2016

Two Efficient Lattice Rescoring Methods Using Recurrent Neural Network Language Models.

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2016

Efficient Training and Evaluation of Recurrent Neural Network Language Models for Automatic Speech Recognition.

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2016

Deep Neural Network Based Acoustic-to-Articulatory Inversion Using Phone Sequence Information.

[DOI]

Xurong Xie

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Selection of Multi-Genre Broadcast Data for the Training of Automatic Speech Recognition Systems.

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Convolutional neural network bottleneck features for bi-directional generalized variable parameter HMMs.

[DOI]

Proceedings of the IEEE International Conference on Information and Automation, 2016

Improved DNN-based segmentation for multi-genre broadcast audio.

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

CUED-RNNLM - An open-source toolkit for efficient training and evaluation of recurrent neural network language models.

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015

Automatic Complexity Control of Generalized Variable Parameter HMMs for Noise Robust Speech Recognition.

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2015

Generalized variable parameter HMMs based acoustic-to-articulatory inversion.

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Efficient use of DNN bottleneck features in generalized variable parameter HMMs for noise robust speech recognition.

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

The Cambridge University 2014 BOLT conversational telephone Mandarin Chinese LVCSR system for speech translation.

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Recurrent neural network language model adaptation for multi-genre broadcast speech recognition.

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Investigations of low resource multi-accent mandarin speech recognition.

[DOI]

Proceedings of the IEEE International Conference on Information and Automation, 2015

Paraphrastic recurrent neural network language models.

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Recurrent neural network language model training with noise contrastive estimation for speech recognition.

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Improving the training and evaluation efficiency of recurrent neural network language models.

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Cambridge university transcription systems for the multi-genre broadcast challenge.

[DOI]

Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

The development of the cambridge university alignment systems for the multi-genre broadcast challenge.

[DOI]

Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

Speaker diarisation and longitudinal linking in multi-genre broadcast data.

[DOI]

Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

Investigation of back-off based interpolation between recurrent neural network and n-gram language models.

[DOI]

Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

The MGB challenge: Evaluating multi-genre broadcast media recognition.

[DOI]

Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014

Paraphrastic language models.

[DOI]

Comput. Speech Lang., 2014

Deep neural network bottleneck features for generalized variable parameter HMMs.

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Efficient GPU-based training of recurrent neural network language models using spliced sentence bunch.

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Efficient lattice rescoring using recurrent neural network language models.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Paraphrastic neural network language models.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

2013

Language model cross adaptation for LVCSR system combination.

[DOI]

Comput. Speech Lang., 2013

Use of contexts in language model interpolation and adaptation.

[DOI]

Mark John Francis Gales

Comput. Speech Lang., 2013

Improving lightly supervised training for broadcast transcription.

[DOI]

Matthew Stephen Seigel

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Cross-domain paraphrasing for improving language modelling using out-of-domain data.

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Feature space generalized variable parameter HMMs for noise robust recognition.

[DOI]

Yang Li

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Automatic Transcription of Multi-genre Media Archives.

[DOI]

Matthew Stephen Seigel

Pawel Swietojanski

Proceedings of the First Workshop on Speech, 2013

Paraphrastic language models and combination with neural network language models.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

Automatic model complexity control for generalized variable parameter HMMs.

[DOI]

Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2012

Transcription of multi-genre media archives using out-of-domain data.

[DOI]

Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

Structured modeling based on generalized variable parameter HMMs and speaker adaptation.

[DOI]

Yang Li

Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

2011

A flexible framework for HMM based noise robust speech recognition using generalized parametric space polynomial regression.

[DOI]

Ning Cheng

Sci. China Inf. Sci., 2011

Improving LVCSR System Combination Using Neural Network Language Model Cross Adaptation.

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Word Boundary Modelling and Full Covariance Gaussians for Arabic Speech-to-Text Systems.

[DOI]

Frank Diehl

Mark John Francis Gales

Marcus Tomalin

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Generalized Variable Parameter HMMs for Noise Robust Speech Recognition.

[DOI]

Ning Cheng

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Investigation of acoustic units for LVCSR systems.

[DOI]

Mark John Francis Gales

Jim L. Hieronymus

Proceedings of the IEEE International Conference on Acoustics, 2011

2010

Improved neural network based language modelling and adaptation.

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Language model combination and adaptation usingweighted finite state transducers.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

2009

Exploiting Chinese character models to improve speech recognition performance.

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

2008

Context dependent language model adaptation.

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

2007

Automatic Model Complexity Control Using Marginalized Discriminative Growth Functions.

[DOI]

IEEE Trans. Speech Audio Process., 2007

Improving Speech Transcription for Mandarin-English Translation.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2007

Speech Recognition System Combination for Machine Translation.

[DOI]

Abdelkhalek Messaoudi

Proceedings of the IEEE International Conference on Acoustics, 2007

Discriminative language model adaptation for Mandarin broadcast speech transcription and translation.

[DOI]

Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2006

Corrections to "Automatic Transcription of Conversational Telephone Speech".

[DOI]

IEEE Trans. Speech Audio Process., 2006

The Cu-Htk Mandarin Broadcast News Transcription System.

[DOI]

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005

Automatic transcription of conversational telephone speech.

[DOI]

IEEE Trans. Speech Audio Process., 2005

Investigation of Acoustic Modeling Techniques for LVCSR Systems.

[DOI]

Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Development of the CUHTK 2004 Mandarin Conversational Telephone Speech Transcription System.

[DOI]

Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004

Model complexity control and compression using discriminative growth functions.

[DOI]

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Development of the 2003 CU-HTK conversational telephone speech transcription system.

[DOI]

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003

Automatic complexity control for HLDA systems.

[DOI]