Thomas Hain

Orcid: 0000-0003-0939-3464

Affiliations:
  • University of Sheffield, England, UK


According to our database1, Thomas Hain authored at least 204 papers between 1994 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
LASER: Learning by Aligning Self-supervised Representations of Speech for Improving Content-related Tasks.
CoRR, 2024

Transcription-Free Fine-Tuning of Speech Separation Models for Noisy and Reverberant Multi-Speaker Automatic Speech Recognition.
CoRR, 2024

EmoBox: Multilingual Multi-corpus Speech Emotion Recognition Toolkit and Benchmark.
CoRR, 2024

1st Place Solution to Odyssey Emotion Recognition Challenge Task1: Tackling Class Imbalance Problem.
Proceedings of the Odyssey 2024: The Speaker and Language Recognition Workshop, 2024

Combining Conformer and Dual-Path-Transformer Networks for Single Channel Noisy Reverberant Speech Separation.
Proceedings of the IEEE International Conference on Acoustics, 2024

Non-Intrusive Speech Intelligibility Prediction for Hearing-Impaired Users Using Intermediate ASR Features and Human Memory Models.
Proceedings of the IEEE International Conference on Acoustics, 2024

SCORE: Self-Supervised Correspondence Fine-Tuning for Improved Content Representations.
Proceedings of the IEEE International Conference on Acoustics, 2024

Multi-CMGAN+/+: Leveraging Multi-Objective Speech Quality Metric Prediction for Speech Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2024

Progressive Unsupervised Domain Adaptation for ASR Using Ensemble Models and Multi-Stage Training.
Proceedings of the IEEE International Conference on Acoustics, 2024

Using Speech Foundational Models in Loss Functions for Hearing Aid Speech Enhancement.
Proceedings of the 32nd European Signal Processing Conference, 2024

Character Error Rate Estimation for Automatic Speech Recognition of Short Utterances.
Proceedings of the 32nd European Signal Processing Conference, 2024

Improving Accented Speech Recognition Using Data Augmentation Based on Unsupervised Text-to-Speech Synthesis.
Proceedings of the 32nd European Signal Processing Conference, 2024

Hallucination in Perceptual Metric-Driven Speech Enhancement Networks.
Proceedings of the 32nd European Signal Processing Conference, 2024

Methods of Automatic Matrix Language Determination for Code-Switched Speech.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Improving Acoustic Word Embeddings through Correspondence Training of Self-supervised Speech Representations.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

Automatic Speech Recognition System-Independent Word Error Rate Estimation.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
Fast Word Error Rate Estimation Using Self-Supervised Representations For Speech And Text.
CoRR, 2023

Non Intrusive Intelligibility Predictor for Hearing Impaired Individuals using Self Supervised Speech Representations.
CoRR, 2023

Empirical Interpretation of the Relationship Between Speech Acoustic Context and Emotion Recognition.
CoRR, 2023

The Effect of Spoken Language on Speech Enhancement Using Self-Supervised Speech Representation Loss Functions.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

Exploring Speech Representations for Proficiency Assessment in Language Learning.
Proceedings of the 9th Workshop on Speech and Language Technology in Education, 2023

Learning Cross-lingual Mappings for Data Augmentation to Improve Low-Resource Speech Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Domain Adaptive Self-supervised Training of Automatic Speech Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Deformable Temporal Convolutional Networks for Monaural Noisy Reverberant Speech Separation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Perceive and Predict: Self-Supervised Speech Representation Based Loss Functions for Speech Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2023

Towards Domain Generalisation in ASR with Elitist Sampling and Ensemble Knowledge Distillation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Adapting Pretrained Models for Adult to Child Voice Conversion.
Proceedings of the 31st European Signal Processing Conference, 2023

On Data Sampling Strategies for Training Neural Network Speech Separation Models.
Proceedings of the 31st European Signal Processing Conference, 2023

Probing Statistical Representations for End-to-End ASR.
Proceedings of the 31st European Signal Processing Conference, 2023

On Time Domain Conformer Models for Monaural Speech Separation in Noisy Reverberant Acoustic Environments.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Deriving Translational Acoustic Sub-Word Embeddings.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Simulation of Teacher-Learner Interaction in English Language Pronunciation Learning.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

MUST: A Multilingual Student-Teacher Learning Approach for Low-Resource Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
Automatic detection of behavioural codes in team interactions.
Comput. Speech Lang., 2022

Dynamic Kernels and Channel Attention with Multi-Layer Embedding Aggregation for Speaker Verification.
CoRR, 2022

Investigating the Impact of Cross-lingual Acoustic-Phonetic Similarities on Multilingual Speech Recognition.
CoRR, 2022

Utterance Weighted Multi-Dilation Temporal Convolutional Networks for Monaural Speech Dereverberation.
Proceedings of the 17th International Workshop on Acoustic Signal Enhancement, 2022

Non-Linear Pairwise Language Mappings for Low-Resource Multilingual Acoustic Model Fusion.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Investigating the Impact of Crosslingual Acoustic-Phonetic Similarities on Multilingual Speech Recognition.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Non-intrusive Speech Intelligibility Metric Prediction for Hearing Impaired Individuals.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

A Model for Assessor Bias in Automatic Pronunciation Assessment.
Proceedings of the IEEE International Conference on Acoustics, 2022

Unsupervised Data Selection for Speech Recognition with Contrastive Loss Ratios.
Proceedings of the IEEE International Conference on Acoustics, 2022

Receptive Field Analysis of Temporal Convolutional Networks for Monaural Speech Dereverberation.
Proceedings of the 30th European Signal Processing Conference, 2022

Insights of Neural Representations in Multi-Banded and Multi-Channel Convolutional Transformers for End-to-End ASR.
Proceedings of the 30th European Signal Processing Conference, 2022

MetricGAN+/-: Increasing Robustness of Noise Reduction on Unseen Data.
Proceedings of the 30th European Signal Processing Conference, 2022

2021
H-VECTORS: Improving the robustness in utterance-level speaker embeddings using a hierarchical attention model.
Neural Networks, 2021

Evaluation of the effectiveness and efficiency of state-of-the-art features and models for automatic speech recognition error detection.
J. Big Data, 2021

Supervised Speaker Embedding De-Mixing in Two-Speaker Environment.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Contextual Joint Factor Acoustic Embeddings.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Use of Speaker Metadata for Improving Automatic Pronunciation Assessment.
Proceedings of the Statistical Language and Speech Processing, 2021

WINVC: One-Shot Voice Conversion with Weight Adaptive Instance Normalization.
Proceedings of the PRICAI 2021: Trends in Artificial Intelligence, 2021

Insights on Neural Representations for End-to-End Speech Recognition.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Multiple-Hypothesis CTC-Based Semi-Supervised Adaptation of End-to-End Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

Towards Low-Resource Stargan Voice Conversion Using Weight Adaptive Instance Normalization.
Proceedings of the IEEE International Conference on Acoustics, 2021

Improving Audio Anomalies Recognition Using Temporal Convolutional Attention Networks.
Proceedings of the IEEE International Conference on Acoustics, 2021

Attention Based Model for Segmental Pronunciation Error Detection.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

Uncertainty Aware Review Hallucination for Science Article Classification.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020
T-vectors: Weakly Supervised Speaker Identification Using Hierarchical Transformer Model.
CoRR, 2020

Improving Audio Anomalies Recognition Using Temporal Convolutional Attention Network.
CoRR, 2020

Robust Speaker Recognition Using Speech Enhancement And Attention Model.
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020

Uncertainty-Aware Machine Support for Paper Reviewing on the Interspeech 2019 Submission Corpus.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Weakly Supervised Training of Hierarchical Attention Networks for Speaker Identification.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Speaker Re-Identification with Speaker Dependent Speech Enhancement.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Multilingual Speech Recognition Using Language-Specific Phoneme Recognition as Auxiliary Task for Indian Languages.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Removing Bias with Residual Mixture of Multi-View Attention for Speech Emotion Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Empirical Interpretation of Speech Emotion Perception with Attention Based Model for Speech Emotion Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Unsupervised Acoustic Unit Representation Learning for Voice Conversion Using WaveNet Auto-Encoders.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Exploration of Audio Quality Assessment and Anomaly Localisation Using Attention Models.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

H-Vectors: Utterance-Level Speaker Embedding Using a Hierarchical Attention Model.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Selective Adaptation of End-to-End Speech Recognition using Hybrid CTC/Attention Architecture for Noise Robustness.
Proceedings of the 28th European Signal Processing Conference, 2020

2019
Recurrent Neural Network Language Model Adaptation for Multi-Genre Broadcast Speech Recognition and Alignment.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

System-independent ASR error detection and classification using Recurrent Neural Network.
Comput. Speech Lang., 2019

Contextual Joint Factor Acoustic Embeddings.
CoRR, 2019

Improving Robustness In Speaker Identification Using A Two-Stage Attention Model.
CoRR, 2019

Learning Temporal Clusters Using Capsule Routing for Speech Emotion Recognition.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Detecting Mismatch Between Speech and Transcription Using Cross-Modal Attention.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Latent Dirichlet Allocation Based Acoustic Data Selection for Automatic Speech Recognition.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Unsupervised Adaptation of Acoustic Models for ASR Using Utterance-Level Embeddings from Squeeze and Excitation Networks.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

A Cross-Corpus Study on Speech Emotion Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Spatio-Temporal Context Modelling for Speech Emotion Classification.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018
Lightly supervised alignment of subtitles on multi-genre broadcasts.
Multim. Tools Appl., 2018

Improving ASR Error Detection with RNNLM Adaptation.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Improved Acoustic Modelling for Automatic Literacy Assessment of Children.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

On the Usefulness of the Speech Phase Spectrum for Pitch Extraction.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Exploring the Use of Group Delay for Generalised VTS Based Noise Compensation.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Towards a generic approach for automatic speech recognition error detection and classification.
Proceedings of the 4th International Conference on Advanced Technologies for Signal and Image Processing, 2018

2017
Acoustic adaptation to dynamic background conditions with asynchronous transformations.
Comput. Speech Lang., 2017

Unsupervised crosslingual adaptation of tokenisers for spoken language recognition.
Comput. Speech Lang., 2017

Analysing acoustic model changes for active learning in automatic speech recognition.
Proceedings of the International Conference on Systems, Signals and Image Processing, 2017

Robust Source-Filter Separation of Speech Signal in the Phase Domain.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Channel Compensation in the Generalised Vector Taylor Series Approach to Robust ASR.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Semi-Supervised Adaptation of RNNLMs by Fine-Tuning with Domain-Specific Auxiliary Features.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Shefce: A Cantonese-English bilingual speech corpus for pronunciation assessment.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

DNN approach to speaker diarisation using speaker channels.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Statistical normalisation of phase-based feature representation for robust speech recognition.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Exploring the use of acoustic embeddings in neural machine translation.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016
Using phone features to improve dialogue state tracking generalisation to unseen states.
Proceedings of the SIGDIAL 2016 Conference, 2016

The Sheffield language recognition system in NIST LRE 2015.
Proceedings of the Odyssey 2016: The Speaker and Language Recognition Workshop, 2016

A Framework for Collecting Realistic Recordings of Dysarthric Speech - the homeService Corpus.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

The OpenCourseWare Metadiscourse (OCWMD) Corpus.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Error Correction in Lightly Supervised Alignment of Broadcast Subtitles.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Combining Weak Tokenisers for Phonotactic Language Recognition in a Resource-Constrained Setting.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

DNN-Based Speaker Clustering for Speaker Diarisation.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Use of Generalised Nonlinearity in Vector Taylor Series Noise Compensation for Robust Speech Recognition.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

The Sheffield Wargame Corpus - Day Two and Day Three.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

webASR 2 - Improved Cloud Based Speech Technology.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Automatic Genre and Show Identification of Broadcast Media.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Combining Feature and Model-Based Adaptation of RNNLMs for Multi-Genre Broadcast Speech Recognition.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Improving Generalisation to New Speakers in Spoken Dialogue State Tracking.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Colloquialising Modern Standard Arabic Text for Improved Speech Recognition.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Groupwise learning for ASR k-best list reranking in spoken language translation.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Segment-oriented evaluation of speaker diarisation performance.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Automatic speech recognition errors detection using supervised learning techniques.
Proceedings of the 13th IEEE/ACS International Conference of Computer Systems and Applications, 2016

2015
The USFD Spoken Language Translation System for IWSLT 2014.
CoRR, 2015

Annotating meta-discourse in academic lectures from different disciplines.
Proceedings of the ISCA International Workshop on Speech and Language Technology in Education, 2015

Long-Term Statistical Feature Extraction from Speech Signal and Its Application in Emotion Recognition.
Proceedings of the Statistical Language and Speech Processing, 2015

Knowledge transfer between speakers for personalised dialogue management.
Proceedings of the SIGDIAL 2015 Conference, 2015

A study on the stability and effectiveness of features in quality estimation for spoken language translation.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Source-filter separation of speech signal in the phase domain.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Noise-matched training of CRF based sentence end detection models.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Unsupervised domain discovery using latent dirichlet allocation for acoustic modelling in speech recognition.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Data-selective transfer learning for multi-domain speech recognition.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Automatic assessment of English learner pronunciation using discriminative classifiers.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Quality estimation for asr k-best list rescoring in spoken language translation.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

An investigation into speaker informed DNN front-end for LVCSR.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Using Topic Segmentation Models for the Automatic Organisation of MOOCs resources.
Proceedings of the 8th International Conference on Educational Data Mining, 2015

The 2015 sheffield system for transcription of Multi-Genre Broadcast media.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

The 2015 sheffield system for longitudinal diarisation of broadcast media.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

Latent Dirichlet Allocation based organisation of broadcast media archives for deep neural network adaptation.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

The MGB challenge: Evaluating multi-genre broadcast media recognition.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014
Capitalising on North American speech resources for the development of a South African English large vocabulary speech recognition system.
Comput. Speech Lang., 2014

Semi-supervised DNN training in meeting recognition.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Background-tracking acoustic features for genre identification of broadcast shows.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Automatic selection of speakers for improved acoustic modelling: recognition of disordered speech with sparse data.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

The USFD SLT system for IWSLT 2014.
Proceedings of the 11th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2014, 2014

Multi-pass sentence-end detection of lecture speech.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Extending Limabeam with discrimination and coarse gradients.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Speaker dependent bottleneck layer training for speaker adaptation in automatic speech recognition.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Adaptive speech recognition and dialogue management for users with speech disorders.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Using contextual information in joint factor eigenspace MLLR for speech recognition in diverse scenarios.
Proceedings of the IEEE International Conference on Acoustics, 2014

Using neural network front-ends on far field multiple microphones based speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
homeService: Voice-enabled assistive technology in the home using cloud-based automatic speech recognition.
Proceedings of the Fourth Workshop on Speech and Language Processing for Assistive Technologies, 2013

Asynchronous factorisation of speaker and background with feature transforms in speech recognition.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Automatic Transcription of Multi-genre Media Archives.
Proceedings of the First Workshop on Speech, 2013

The sheffield wargames corpus.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Learning speaker-specific pronunciations of disordered speech.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Combining in-domain and out-of-domain speech data for automatic recognition of disordered speech.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Adaptation of lecture speech recognition system with machine translation output.
Proceedings of the IEEE International Conference on Acoustics, 2013

Lightly supervised learning from a damaged natural speech corpus.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Transcribing Meetings With the AMIDA Systems.
IEEE Trans. Speech Audio Process., 2012

Correctness-Adjusted Unsupervised Discriminative Acoustic Model Adaptation.
IEEE Trans. Speech Audio Process., 2012

Impact du degré de supervision sur l'adaptation à un domaine d'un modèle de langage à partir du Web (Impact of the level of supervision on Web-based language model domain adaptation) [in French].
Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, 2012

Resource development and experiments in automatic south african broadcast news transcription.
Proceedings of the Third Workshop on Spoken Language Technologies for Under-resourced Languages, 2012

Automatic transcription of academic lectures from diverse disciplines.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

An alignment matching method to explore pseudosyllable properties across different corpora.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Supervised and unsupervised Web-based language model domain adaptation.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

A comparative study of adaptive, automatic recognition of disordered speech.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

CRF-based Diacritisation of Colloquial Arabic for Automatic Speech Recognition.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Application of SVM-based correctness predictions to unsupervised discriminative speaker adaptation.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Making an Automatic Speech Recognition Service Freely Available on the Web.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Web-Based Automatic Speech Recognition Service - webASR.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Extending Audio Notetaker to Browse WebASR Transcriptions.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

An Analysis of Automatic Speech Recognition with Multiple Microphones.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Cross-Language Phone Recognition when the Target Language Phoneme Inventory is not Known.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

An Investigation in Speech Recognition for Colloquial Arabic.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

2010
Error Approximation and Minimum Phone Error Acoustic Model Estimation.
IEEE Trans. Speech Audio Process., 2010

Automatic Optimization of Speech Decoder Parameters.
IEEE Signal Process. Lett., 2010

The AMIDA 2009 meeting transcription system.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

2009
Real-time ASR from meetings.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

2008
Bob: A lexicon and pronunciation dictionary generator.
Proceedings of the 2008 IEEE Spoken Language Technology Workshop, 2008

Discrimininative training of narrow band - wide band adapted systems for meeting recognition.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Automatic speech recognition for scientific purposes - webASR.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

2007
Application of CMLLR in narrow band wide band adapted systems.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Temporal masking for unsupervised minimum Bayes risk speaker adaptation.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

The AMI System for the Transcription of Speech in Meetings.
Proceedings of the IEEE International Conference on Acoustics, 2007

The 2007 AMI(DA) System for Meeting Transcription.
Proceedings of the Multimodal Technologies for Perception of Humans, 2007

Recognition and understanding of meetings the AMI and AMIDA projects.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2006
Corrections to "Automatic Transcription of Conversational Telephone Speech".
IEEE Trans. Speech Audio Process., 2006

Juicer: A Weighted Finite-State Transducer Speech Decoder.
Proceedings of the Machine Learning for Multimodal Interaction, 2006

The AMI Meeting Transcription System: Progress and Performance.
Proceedings of the Machine Learning for Multimodal Interaction, 2006


Automatic speech recognition experiments with articulatory data.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Hypothesis spaces for minimum Bayes risk training in large vocabulary speech recognition.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

The segmentation of multi-channel meeting recordings for automatic speech recognition.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Strategies for Language Model Web-Data Collection.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
Automatic transcription of conversational telephone speech.
IEEE Trans. Speech Audio Process., 2005

Implicit modelling of pronunciation variation in automatic speech recognition.
Speech Commun., 2005

The Development of the AMI System for the Transcription of Speech in Meetings.
Proceedings of the Machine Learning for Multimodal Interaction, 2005

The 2005 AMI System for the Transcription of Speech in Meetings.
Proceedings of the Machine Learning for Multimodal Interaction, 2005


Transcription of conference room meetings: an investigation.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Applying vocal tract length normalization to meeting recordings.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

A Fast, Practical Algorithm for the Trapezoidation of Simple Polygons.
Proceedings of The 2005 International Conference on Imaging Science, 2005

Fast Floating Point Square Root.
Proceedings of the 2005 International Conference on Algorithmic Mathematics and Computer Science, 2005

2004
Using VTLN for broadcast news transcription.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Development of the 2003 CU-HTK conversational telephone speech transcription system.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2001
New features in the CU-HTK system for transcription of conversational telephone speech.
Proceedings of the IEEE International Conference on Acoustics, 2001

2000
Modelling sub-phone insertions and deletions in continuous speech recognition.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

1999
Improvements in accuracy and speed in the HTK broadcast news transcription system.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Dynamic HMM selection for continuous speech recognition.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

The 1998 HTK system for transcription of conversational telephone speech.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

1998
Segmentation and classification of broadcast news audio.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Experiments in broadcast news transcription.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

1994
On the convergence of fractal transforms.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994


  Loading...