Thomas Hain
Orcid: 0000-0003-0939-3464Affiliations:
- University of Sheffield, England, UK
According to our database1,
Thomas Hain
authored at least 204 papers
between 1994 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
LASER: Learning by Aligning Self-supervised Representations of Speech for Improving Content-related Tasks.
CoRR, 2024
Transcription-Free Fine-Tuning of Speech Separation Models for Noisy and Reverberant Multi-Speaker Automatic Speech Recognition.
CoRR, 2024
CoRR, 2024
1st Place Solution to Odyssey Emotion Recognition Challenge Task1: Tackling Class Imbalance Problem.
Proceedings of the Odyssey 2024: The Speaker and Language Recognition Workshop, 2024
Combining Conformer and Dual-Path-Transformer Networks for Single Channel Noisy Reverberant Speech Separation.
Proceedings of the IEEE International Conference on Acoustics, 2024
Non-Intrusive Speech Intelligibility Prediction for Hearing-Impaired Users Using Intermediate ASR Features and Human Memory Models.
Proceedings of the IEEE International Conference on Acoustics, 2024
SCORE: Self-Supervised Correspondence Fine-Tuning for Improved Content Representations.
Proceedings of the IEEE International Conference on Acoustics, 2024
Multi-CMGAN+/+: Leveraging Multi-Objective Speech Quality Metric Prediction for Speech Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2024
Progressive Unsupervised Domain Adaptation for ASR Using Ensemble Models and Multi-Stage Training.
Proceedings of the IEEE International Conference on Acoustics, 2024
Using Speech Foundational Models in Loss Functions for Hearing Aid Speech Enhancement.
Proceedings of the 32nd European Signal Processing Conference, 2024
Character Error Rate Estimation for Automatic Speech Recognition of Short Utterances.
Proceedings of the 32nd European Signal Processing Conference, 2024
Improving Accented Speech Recognition Using Data Augmentation Based on Unsupervised Text-to-Speech Synthesis.
Proceedings of the 32nd European Signal Processing Conference, 2024
Proceedings of the 32nd European Signal Processing Conference, 2024
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Improving Acoustic Word Embeddings through Correspondence Training of Self-supervised Speech Representations.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
2023
Fast Word Error Rate Estimation Using Self-Supervised Representations For Speech And Text.
CoRR, 2023
Non Intrusive Intelligibility Predictor for Hearing Impaired Individuals using Self Supervised Speech Representations.
CoRR, 2023
Empirical Interpretation of the Relationship Between Speech Acoustic Context and Emotion Recognition.
CoRR, 2023
The Effect of Spoken Language on Speech Enhancement Using Self-Supervised Speech Representation Loss Functions.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023
Proceedings of the 9th Workshop on Speech and Language Technology in Education, 2023
Learning Cross-lingual Mappings for Data Augmentation to Improve Low-Resource Speech Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Deformable Temporal Convolutional Networks for Monaural Noisy Reverberant Speech Separation.
Proceedings of the IEEE International Conference on Acoustics, 2023
Perceive and Predict: Self-Supervised Speech Representation Based Loss Functions for Speech Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2023
Towards Domain Generalisation in ASR with Elitist Sampling and Ensemble Knowledge Distillation.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the 31st European Signal Processing Conference, 2023
Proceedings of the 31st European Signal Processing Conference, 2023
Proceedings of the 31st European Signal Processing Conference, 2023
On Time Domain Conformer Models for Monaural Speech Separation in Noisy Reverberant Acoustic Environments.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
Simulation of Teacher-Learner Interaction in English Language Pronunciation Learning.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
MUST: A Multilingual Student-Teacher Learning Approach for Low-Resource Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
2022
Comput. Speech Lang., 2022
Dynamic Kernels and Channel Attention with Multi-Layer Embedding Aggregation for Speaker Verification.
CoRR, 2022
Investigating the Impact of Cross-lingual Acoustic-Phonetic Similarities on Multilingual Speech Recognition.
CoRR, 2022
Utterance Weighted Multi-Dilation Temporal Convolutional Networks for Monaural Speech Dereverberation.
Proceedings of the 17th International Workshop on Acoustic Signal Enhancement, 2022
Non-Linear Pairwise Language Mappings for Low-Resource Multilingual Acoustic Model Fusion.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Investigating the Impact of Crosslingual Acoustic-Phonetic Similarities on Multilingual Speech Recognition.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Non-intrusive Speech Intelligibility Metric Prediction for Hearing Impaired Individuals.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Receptive Field Analysis of Temporal Convolutional Networks for Monaural Speech Dereverberation.
Proceedings of the 30th European Signal Processing Conference, 2022
Insights of Neural Representations in Multi-Banded and Multi-Channel Convolutional Transformers for End-to-End ASR.
Proceedings of the 30th European Signal Processing Conference, 2022
Proceedings of the 30th European Signal Processing Conference, 2022
2021
H-VECTORS: Improving the robustness in utterance-level speaker embeddings using a hierarchical attention model.
Neural Networks, 2021
Evaluation of the effectiveness and efficiency of state-of-the-art features and models for automatic speech recognition error detection.
J. Big Data, 2021
Proceedings of the IEEE Spoken Language Technology Workshop, 2021
Proceedings of the IEEE Spoken Language Technology Workshop, 2021
Proceedings of the Statistical Language and Speech Processing, 2021
Proceedings of the PRICAI 2021: Trends in Artificial Intelligence, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Multiple-Hypothesis CTC-Based Semi-Supervised Adaptation of End-to-End Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021
Towards Low-Resource Stargan Voice Conversion Using Weight Adaptive Instance Normalization.
Proceedings of the IEEE International Conference on Acoustics, 2021
Improving Audio Anomalies Recognition Using Temporal Convolutional Attention Networks.
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021
2020
T-vectors: Weakly Supervised Speaker Identification Using Hierarchical Transformer Model.
CoRR, 2020
Improving Audio Anomalies Recognition Using Temporal Convolutional Attention Network.
CoRR, 2020
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020
Uncertainty-Aware Machine Support for Paper Reviewing on the Interspeech 2019 Submission Corpus.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Weakly Supervised Training of Hierarchical Attention Networks for Speaker Identification.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Multilingual Speech Recognition Using Language-Specific Phoneme Recognition as Auxiliary Task for Indian Languages.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Removing Bias with Residual Mixture of Multi-View Attention for Speech Emotion Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Empirical Interpretation of Speech Emotion Perception with Attention Based Model for Speech Emotion Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Unsupervised Acoustic Unit Representation Learning for Voice Conversion Using WaveNet Auto-Encoders.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Exploration of Audio Quality Assessment and Anomaly Localisation Using Attention Models.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Selective Adaptation of End-to-End Speech Recognition using Hybrid CTC/Attention Architecture for Noise Robustness.
Proceedings of the 28th European Signal Processing Conference, 2020
2019
Recurrent Neural Network Language Model Adaptation for Multi-Genre Broadcast Speech Recognition and Alignment.
IEEE ACM Trans. Audio Speech Lang. Process., 2019
System-independent ASR error detection and classification using Recurrent Neural Network.
Comput. Speech Lang., 2019
CoRR, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Latent Dirichlet Allocation Based Acoustic Data Selection for Automatic Speech Recognition.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Unsupervised Adaptation of Acoustic Models for ASR Using Utterance-Level Embeddings from Squeeze and Excitation Networks.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
2018
Multim. Tools Appl., 2018
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Towards a generic approach for automatic speech recognition error detection and classification.
Proceedings of the 4th International Conference on Advanced Technologies for Signal and Image Processing, 2018
2017
Acoustic adaptation to dynamic background conditions with asynchronous transformations.
Comput. Speech Lang., 2017
Comput. Speech Lang., 2017
Analysing acoustic model changes for active learning in automatic speech recognition.
Proceedings of the International Conference on Systems, Signals and Image Processing, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Semi-Supervised Adaptation of RNNLMs by Fine-Tuning with Domain-Specific Auxiliary Features.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Statistical normalisation of phase-based feature representation for robust speech recognition.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017
2016
Using phone features to improve dialogue state tracking generalisation to unseen states.
Proceedings of the SIGDIAL 2016 Conference, 2016
Proceedings of the Odyssey 2016: The Speaker and Language Recognition Workshop, 2016
A Framework for Collecting Realistic Recordings of Dysarthric Speech - the homeService Corpus.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Combining Weak Tokenisers for Phonotactic Language Recognition in a Resource-Constrained Setting.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Use of Generalised Nonlinearity in Vector Taylor Series Noise Compensation for Robust Speech Recognition.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Combining Feature and Model-Based Adaptation of RNNLMs for Multi-Genre Broadcast Speech Recognition.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 13th IEEE/ACS International Conference of Computer Systems and Applications, 2016
2015
Proceedings of the ISCA International Workshop on Speech and Language Technology in Education, 2015
Long-Term Statistical Feature Extraction from Speech Signal and Its Application in Emotion Recognition.
Proceedings of the Statistical Language and Speech Processing, 2015
Proceedings of the SIGDIAL 2015 Conference, 2015
A study on the stability and effectiveness of features in quality estimation for spoken language translation.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Unsupervised domain discovery using latent dirichlet allocation for acoustic modelling in speech recognition.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Automatic assessment of English learner pronunciation using discriminative classifiers.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 8th International Conference on Educational Data Mining, 2015
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015
Latent Dirichlet Allocation based organisation of broadcast media archives for deep neural network adaptation.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015
2014
Capitalising on North American speech resources for the development of a South African English large vocabulary speech recognition system.
Comput. Speech Lang., 2014
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014
Automatic selection of speakers for improved acoustic modelling: recognition of disordered speech with sparse data.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014
Proceedings of the 11th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2014, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Speaker dependent bottleneck layer training for speaker adaptation in automatic speech recognition.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Using contextual information in joint factor eigenspace MLLR for speech recognition in diverse scenarios.
Proceedings of the IEEE International Conference on Acoustics, 2014
Using neural network front-ends on far field multiple microphones based speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014
2013
homeService: Voice-enabled assistive technology in the home using cloud-based automatic speech recognition.
Proceedings of the Fourth Workshop on Speech and Language Processing for Assistive Technologies, 2013
Asynchronous factorisation of speaker and background with feature transforms in speech recognition.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the First Workshop on Speech, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Combining in-domain and out-of-domain speech data for automatic recognition of disordered speech.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
2012
IEEE Trans. Speech Audio Process., 2012
IEEE Trans. Speech Audio Process., 2012
Impact du degré de supervision sur l'adaptation à un domaine d'un modèle de langage à partir du Web (Impact of the level of supervision on Web-based language model domain adaptation) [in French].
Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, 2012
Resource development and experiments in automatic south african broadcast news transcription.
Proceedings of the Third Workshop on Spoken Language Technologies for Under-resourced Languages, 2012
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012
An alignment matching method to explore pseudosyllable properties across different corpora.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Application of SVM-based correctness predictions to unsupervised discriminative speaker adaptation.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Cross-Language Phone Recognition when the Target Language Phoneme Inventory is not Known.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
2010
IEEE Trans. Speech Audio Process., 2010
IEEE Signal Process. Lett., 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
2008
Proceedings of the 2008 IEEE Spoken Language Technology Workshop, 2008
Discrimininative training of narrow band - wide band adapted systems for meeting recognition.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the IEEE International Conference on Acoustics, 2007
Proceedings of the Multimodal Technologies for Perception of Humans, 2007
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007
2006
IEEE Trans. Speech Audio Process., 2006
Proceedings of the Machine Learning for Multimodal Interaction, 2006
Proceedings of the Machine Learning for Multimodal Interaction, 2006
Proceedings of the Machine Learning for Multimodal Interaction, 2006
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Hypothesis spaces for minimum Bayes risk training in large vocabulary speech recognition.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
The segmentation of multi-channel meeting recordings for automatic speech recognition.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
2005
IEEE Trans. Speech Audio Process., 2005
Speech Commun., 2005
Proceedings of the Machine Learning for Multimodal Interaction, 2005
Proceedings of the Machine Learning for Multimodal Interaction, 2005
Proceedings of the Machine Learning for Multimodal Interaction, 2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
A Fast, Practical Algorithm for the Trapezoidation of Simple Polygons.
Proceedings of The 2005 International Conference on Imaging Science, 2005
Fast Floating Point Square Root.
Proceedings of the 2005 International Conference on Algorithmic Mathematics and Computer Science, 2005
2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004
2001
New features in the CU-HTK system for transcription of conversational telephone speech.
Proceedings of the IEEE International Conference on Acoustics, 2001
2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
1999
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999
1998
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998
1994
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994