Stefan Braun

Xiaodan Zhuang

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Variable Attention Masking for Configurable Transformer Transducer Speech Recognition.

[DOI]

Stefan Braun

Dogan Can

Thiago Fraga da Silva

Proceedings of the IEEE International Conference on Acoustics, 2023

2022

Optimizing Bilingual Neural Transducer with Synthetic Code-switching Text Generation.

[DOI]

Thien Nguyen

Nathalie Tran

Liuhui Deng

Thiago Fraga da Silva

CoRR, 2022

2020

Adaptation Algorithms for Speech Recognition: An Overview.

[DOI]

CoRR, 2020

Building Proactive Voice Assistants: When and How (not) to Interact.

[DOI]

CoRR, 2020

Static Visual Spatial Priors for DoA Estimation.

[DOI]

Ondrej Miksik

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Multi-Task Self-Supervised Learning for Robust Speech Recognition.

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

SLURP: A Spoken Language Understanding Resource Package.

[DOI]

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

2019

Benchmarking Natural Language Understanding Services for Building Conversational Agents.

[DOI]

Proceedings of the Increasing Naturalness and Flexibility in Spoken Dialogue Interaction, 2019

Teacher-student Training for Acoustic Event Detection Using Audioset.

[DOI]

Ruibo Shi

Raymond W. M. Ng

Proceedings of the IEEE International Conference on Acoustics, 2019

2018

Teacher-Student Training for Text-Independent Speaker Recognition.

[DOI]

Raymond W. M. Ng

Xuechen Liu

P. G. Keerthana Gopalakrishnan

Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Multi-Modal Sequence Fusion via Recursive Attention for Emotion Recognition.

[DOI]

Rory Beard

Ritwik Das

Raymond W. M. Ng

Luka Eerens

Ondrej Miksik

Proceedings of the 22nd Conference on Computational Natural Language Learning, 2018

2017

Multitask Learning of Context-Dependent Targets in Deep Neural Network Acoustic Models.

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2017

Distant Speech Recognition Experiments Using the AMI Corpus.

[DOI]

Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

2016

Learning representations for speech recognition using artificial neural networks.

[DOI]

PhD thesis, 2016

Differentiable Pooling for Unsupervised Acoustic Model Adaptation.

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2016

Learning Hidden Unit Contributions for Unsupervised Acoustic Model Adaptation.

[DOI]

Jinyu Li

IEEE ACM Trans. Audio Speech Lang. Process., 2016

Unsupervised Adaptation of Recurrent Neural Network Language Models.

[DOI]

Siva Reddy Gangireddy

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

SAT-LHUC: Speaker adaptive training for learning hidden unit contributions.

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015

A study of speaker adaptation for DNN-based speech synthesis.

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Structured output layer with auxiliary targets for context-dependent acoustic modelling.

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Differentiable pooling for unsupervised speaker adaptation.

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014

Convolutional Neural Networks for Distant Speech Recognition.

[DOI]

IEEE Signal Process. Lett., 2014

Learning hidden unit contributions for unsupervised speaker adaptation of neural network acoustic models.

[DOI]

Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

The UEDIN ASR systems for the IWSLT 2014 evaluation.

[DOI]

Proceedings of the 11th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2014, 2014

Investigation of maxout networks for speech recognition.

[DOI]

Jinyu Li

Jui-Ting Huang

Proceedings of the IEEE International Conference on Acoustics, 2014

Neural networks for distant speech recognition.

[DOI]

Proceedings of the 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2014

2013

Automatic Transcription of Multi-genre Media Archives.

[DOI]

Matthew Stephen Seigel

Philip C. Woodland

Proceedings of the First Workshop on Speech, 2013

Combining in-domain and out-of-domain speech data for automatic recognition of disordered speech.

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

A lecture transcription system combining neural network acoustic and language models.

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Revisiting hybrid and GMM-HMM system combination techniques.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

Multilingual training of deep neural networks.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

Multi-level adaptive networks in tandem and hybrid ASR systems.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

Hybrid acoustic models for distant and multichannel large vocabulary speech recognition.

[DOI]

Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2012

Unsupervised cross-lingual knowledge transfer in DNN-based LVCSR.

[DOI]

Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

Transcription of multi-genre media archives using out-of-domain data.

[DOI]

Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

The UEDIN systems for the IWSLT 2012 evaluation.

[DOI]

Proceedings of the 2012 International Workshop on Spoken Language Translation, 2012

2011

Automatic Selection of Pareto-Optimal Topologies of Hidden Markov Models Using Multicriteria Evolutionary Algorithms.

[DOI]