2024
Optimizing Contextual Speech Recognition Using Vector Quantization for Efficient Retrieval.
CoRR, 2024
2023
Approximate Nearest Neighbour Phrase Mining for Contextual Speech Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Variable Attention Masking for Configurable Transformer Transducer Speech Recognition.
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the IEEE International Conference on Acoustics, 2023
2022
Optimizing Bilingual Neural Transducer with Synthetic Code-switching Text Generation.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2022
2020
Adaptation Algorithms for Speech Recognition: An Overview.
CoRR, 2020
Building Proactive Voice Assistants: When and How (not) to Interact.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2020
Static Visual Spatial Priors for DoA Estimation.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Multi-Task Self-Supervised Learning for Robust Speech Recognition.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
SLURP: A Spoken Language Understanding Resource Package.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020
2019
Benchmarking Natural Language Understanding Services for Building Conversational Agents.
Proceedings of the Increasing Naturalness and Flexibility in Spoken Dialogue Interaction, 2019
Teacher-student Training for Acoustic Event Detection Using Audioset.
Proceedings of the IEEE International Conference on Acoustics, 2019
2018
Teacher-Student Training for Text-Independent Speaker Recognition.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018
Multi-Modal Sequence Fusion via Recursive Attention for Emotion Recognition.
Proceedings of the 22nd Conference on Computational Natural Language Learning, 2018
2017
Multitask Learning of Context-Dependent Targets in Deep Neural Network Acoustic Models.
IEEE ACM Trans. Audio Speech Lang. Process., 2017
Distant Speech Recognition Experiments Using the AMI Corpus.
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017
2016
Learning representations for speech recognition using artificial neural networks.
PhD thesis, 2016
Differentiable Pooling for Unsupervised Acoustic Model Adaptation.
IEEE ACM Trans. Audio Speech Lang. Process., 2016
Learning Hidden Unit Contributions for Unsupervised Acoustic Model Adaptation.
IEEE ACM Trans. Audio Speech Lang. Process., 2016
Unsupervised Adaptation of Recurrent Neural Network Language Models.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
SAT-LHUC: Speaker adaptive training for learning hidden unit contributions.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
2015
A study of speaker adaptation for DNN-based speech synthesis.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Structured output layer with auxiliary targets for context-dependent acoustic modelling.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Differentiable pooling for unsupervised speaker adaptation.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
2014
Convolutional Neural Networks for Distant Speech Recognition.
IEEE Signal Process. Lett., 2014
Learning hidden unit contributions for unsupervised speaker adaptation of neural network acoustic models.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014
The UEDIN ASR systems for the IWSLT 2014 evaluation.
Proceedings of the 11th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2014, 2014
Investigation of maxout networks for speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014
Neural networks for distant speech recognition.
Proceedings of the 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2014
2013
Automatic Transcription of Multi-genre Media Archives.
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the First Workshop on Speech, 2013
Combining in-domain and out-of-domain speech data for automatic recognition of disordered speech.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
A lecture transcription system combining neural network acoustic and language models.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Revisiting hybrid and GMM-HMM system combination techniques.
Proceedings of the IEEE International Conference on Acoustics, 2013
Multilingual training of deep neural networks.
Proceedings of the IEEE International Conference on Acoustics, 2013
Multi-level adaptive networks in tandem and hybrid ASR systems.
Proceedings of the IEEE International Conference on Acoustics, 2013
Hybrid acoustic models for distant and multichannel large vocabulary speech recognition.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013
2012
Unsupervised cross-lingual knowledge transfer in DNN-based LVCSR.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012
Transcription of multi-genre media archives using out-of-domain data.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012
The UEDIN systems for the IWSLT 2012 evaluation.
Proceedings of the 2012 International Workshop on Spoken Language Translation, 2012
2011
Automatic Selection of Pareto-Optimal Topologies of Hidden Markov Models Using Multicriteria Evolutionary Algorithms.
Proceedings of the Applications of Evolutionary Computation, 2011
2007
Comparison of HMM and DTW methods in automatic recognition of pathological phoneme pronunciation.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007