Peter Bell

Orcid: 0000-0002-9597-9615

Affiliations:
  • University of Edinburgh, Centre for Speech Technology Research, UK


According to our database1, Peter Bell authored at least 135 papers between 2007 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Comparison and analysis of new curriculum criteria for end-to-end ASR.
Speech Commun., 2024

Beyond Oversmoothing: Evaluating DDPM and MSE for Scalable Speech Synthesis in ASR.
CoRR, 2024

Semi-Supervised Cognitive State Classification from Speech with Multi-View Pseudo-Labeling.
CoRR, 2024

Revise, Reason, and Recognize: LLM-Based Emotion Recognition via Emotion-Specific Prompts and ASR Error Correction.
CoRR, 2024

Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition.
CoRR, 2024

TTSDS - Text-to-Speech Distribution Score.
CoRR, 2024

Speech Emotion Recognition with ASR Transcripts: A Comprehensive Study on Word Error Rate and Fusion Techniques.
CoRR, 2024

Phonetic Error Analysis of Raw Waveform Acoustic Models with Parametric and Non-Parametric CNNs.
CoRR, 2024

Explainable Attribute-Based Speaker Verification.
CoRR, 2024

Crossmodal ASR Error Correction with Discrete Speech Units.
CoRR, 2024

LLM-Personalize: Aligning LLM Planners with Human Preferences via Reinforced Self-Training for Housekeeping Robots.
CoRR, 2024

1st Place Solution to Odyssey Emotion Recognition Challenge Task1: Tackling Class Imbalance Problem.
Proceedings of the Odyssey 2024: The Speaker and Language Recognition Workshop, 2024

Regarding Topology and Adaptability in Differentiable WFST-Based E2E ASR.
Proceedings of the IEEE International Conference on Acoustics, 2024

Exploring Dominant Paths in CTC-Like ASR Models: Unraveling the Effectiveness of Viterbi Decoding.
Proceedings of the IEEE International Conference on Acoustics, 2024

Can We Trust Explainable AI Methods on ASR? An Evaluation on Phoneme Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2024

Bootstrap Predictive Coding: Investigating a Non-Contrastive Self-Supervised Learning Approach.
Proceedings of the IEEE International Conference on Acoustics, 2024

Analyzing the Role of Part-of-Speech in Code-Switching: A Corpus-Based Study.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024

UnMute Toolkit: Speech Interactions Designed With Minoritised Language Speakers.
Proceedings of the ACM Conversational User Interfaces 2024, 2024

Cultivating Spoken Language Technologies for Unwritten Languages.
Proceedings of the CHI Conference on Human Factors in Computing Systems, 2024

2023
Multi-Stream Acoustic Modelling Using Raw Real and Imaginary Parts of the Fourier Transform.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Phonetic Error Analysis Beyond Phone Error Rate.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Cross-Attention is Not Enough: Incongruity-Aware Multimodal Sentiment Analysis and Emotion Recognition.
CoRR, 2023

Regarding Topology and Variant Frame Rates for Differentiable WFST-based End-to-End ASR.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Quantifying the perceptual value of lexical and non-lexical channels in speech.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Evaluating and reducing the distance between synthetic and real speech distributions.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

ASR and Emotional Speech: A Word-Level Investigation of the Mutual Impact of Speech and Emotion Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Transfer Learning for Personality Perception via Speech Emotion Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Unsupervised Code-switched Text Generation from Parallel Text.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Capturing Formality in Speech Across Domains and Languages.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Explanations for Automatic Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

Efficient Intelligibility Evaluation Using Keyword Spotting: A Study on Audio-Visual Speech Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2023

The Edinburgh International Accents of English Corpus: Towards the Democratization of English ASR.
Proceedings of the IEEE International Conference on Acoustics, 2023

Multimodal Dyadic Impression Recognition via Listener Adaptive Cross-Domain Fusion.
Proceedings of the IEEE International Conference on Acoustics, 2023

Do dialogue representations align with perception? An empirical study.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Situating Automatic Speech Recognition Development within Communities of Under-heard Language Speakers.
Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, 2023

2022
Exploration of a Self-Supervised Speech Model: A Study on Emotional Corpora.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

AVSE Challenge: Audio-Visual Speech Enhancement Challenge.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Investigating perception of spoken dialogue acceptability through surprisal.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Investigating the contribution of speaker attributes to speaker separability using disentangled speaker representations.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Deciphering Speech: a Zero-Resource Approach to Cross-Lingual Transfer in ASR.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Investigating Sequence-Level Normalisation For CTC-Like End-to-End ASR.
Proceedings of the IEEE International Conference on Acoustics, 2022

Fusing ASR Outputs in Joint Training for Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022

Improving Code-switched ASR with Linguistic Information.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Opportunities and Challenges of Automatic Speech Recognition Systems for Low-Resource Language Speakers.
Proceedings of the CHI '22: CHI Conference on Human Factors in Computing Systems, New Orleans, LA, USA, 29 April 2022, 2022

2021
Mask-combine Decoding and Classification Approach for Punctuation Prediction with real-time Inference Constraints.
CoRR, 2021

On The Usefulness of Self-Attention for Automatic Speech Recognition with Transformers.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Stochastic Attention Head Removal: A Simple and Effective Method for Improving Transformer Based ASR Models.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

On the Learning Dynamics of Semi-Supervised Training for ASR.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

It's Not What You Said, it's How You Said it: Discriminative Perception of Speech as a Multichannel Communication System.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Leveraging Speaker Attribute Information Using Multi Task Learning for Speaker Verification and Diarization.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Speech Acoustic Modelling Using Raw Source and Filter Components.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

The CSTR System for Multilingual and Code-Switching ASR Challenges for Low Resource Indian Languages.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Train Your Classifier First: Cascade Neural Networks Training from Upper Layers to Lower Layers.
Proceedings of the IEEE International Conference on Acoustics, 2021

Speech Acoustic Modelling from Raw Phase Spectrum.
Proceedings of the IEEE International Conference on Acoustics, 2021

Segmenting Subtitles for Correcting ASR Segmentation Errors.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Leveraging Linguistic Knowledge for Accent Robustness of End-to-End Models.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
Stochastic Attention Head Removal: A Simple and Effective Method for Improving Automatic Speech Recognition with Transformers.
CoRR, 2020

Subtitles to Segmentation: Improving Low-Resource Speech-to-Text Translation Pipelines.
CoRR, 2020

Adaptation Algorithms for Speech Recognition: An Overview.
CoRR, 2020

When Can Self-Attention Be Replaced by Feed Forward Layers?
CoRR, 2020

DropClass and DropAdapt: Dropping classes for deep speaker representation learning.
CoRR, 2020

Dropping Classes for Deep Speaker Representation Learning.
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020

Subtitles to Segmentation: Improving Low-Resource Speech-to-TextTranslation Pipelines.
Proceedings of the workshop on Cross-Language Search and Summarization of Text and Speech, 2020

A Deep 2D Convolutional Network for Waveform-Based Speech Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Raw Sign and Magnitude Spectra for Multi-Head Acoustic Modelling.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

On the Robustness and Training Dynamics of Raw Waveform Models.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Deep Scattering Power Spectrum Features for Robust Speech Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Deep Neural Network Driven Binaural Audio Visual Speech Separation.
Proceedings of the 2020 International Joint Conference on Neural Networks, 2020

Multi-Scale Octave Convolutions for Robust Speech Recognition.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Channel Adversarial Training for Speaker Verification and Diarization.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Cross Lingual Transfer Learning for Zero-Resource Domain Adaptation.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Lattice-Based Unsupervised Test-Time Adaptation of Neural Network Acoustic Models.
CoRR, 2019

Trainable Dynamic Subsampling for End-to-End Speech Recognition.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

On Learning Interpretable CNNs with Parametric Modulated Kernel-Based Filters.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Lattice-Based Lightly-Supervised Acoustic Model Training.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Untranscribed Web Audio for Low Resource Speech Recognition.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Windowed Attention Mechanisms for Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

On the Usefulness of Statistical Normalisation of Bottleneck Features for Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

Embeddings for DNN Speaker Adaptive Training.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Speaker Adaptive Training Using Model Agnostic Meta-Learning.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Acoustic Model Adaptation from Raw Waveforms with Sincnet.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018
Few-shot learning with attention-based sequence-to-sequence models.
CoRR, 2018

Analyzing Deep CNN-Based Utterance Embeddings for Acoustic Model Adaptation.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Learning to Adapt: A Meta-learning Approach for Speaker Adaptation.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

2017
Multitask Learning of Context-Dependent Targets in Deep Neural Network Acoustic Models.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Hierarchical Recurrent Neural Network for Story Segmentation.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Factorised Representations for Neural Network Adaptation to Diverse Acoustic Environments.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

A System for Real Time Collaborative Transcription Correction.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Sequence-to-sequence models for punctuated transcription combining lexical and acoustic features.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017


Hierarchical recurrent neural network for story segmentation using fusion of lexical and acoustic features.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Simplifying very deep convolutional neural network architectures for robust speech recognition.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

WERD: Using social text spelling variants for evaluating dialectal speech recognition.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016
ALISA: An automatic lightly supervised speech segmentation and alignment tool.
Comput. Speech Lang., 2016

Punctuated transcription of multi-genre broadcasts using acoustic and lexical approaches.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

The MGB-2 challenge: Arabic multi-dialect broadcast media recognition.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Unsupervised Adaptation of Recurrent Neural Network Language Models.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Improving Children's Speech Recognition Through Out-of-Domain Data Augmentation.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Automatic Dialect Detection in Arabic Broadcast Speech.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2015
Automatic Dialect Detection in Arabic Broadcast Speech.
CoRR, 2015

Structured output layer with auxiliary targets for context-dependent acoustic modelling.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Towards automatic detection of reported speech in dialogue using prosodic cues.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Complementary tasks for context-dependent deep neural network acoustic models.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

A system for automatic broadcast news summarisation, geolocation and translation.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Regularization of context-dependent deep neural networks with context-independent multi-task training.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

The MGB challenge: Evaluating multi-genre broadcast media recognition.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

Multi-reference WER for evaluating ASR for languages with no orthographic rules.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

A system for automatic alignment of broadcast media captions using weighted finite-state transducers.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014
The UEDIN ASR systems for the IWSLT 2014 evaluation.
Proceedings of the 11th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2014, 2014

A semi-Markov model for speech segmentation with an utterance-break prior.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Cross-lingual adaptation with multi-task adaptive networks.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

2013
Using adaptation to improve speech transcription alignment in noisy and reverberant environments.
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013

Description of the UEDIN system for German ASR.
Proceedings of the 10th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2013, 2013

The UEDIN English ASR system for the IWSLT 2013 evaluation.
Proceedings of the 10th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2013, 2013

Lightly supervised discriminative training of grapheme models for improved sentence-level alignment of speech and text data.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Automatic Transcription of Multi-genre Media Archives.
Proceedings of the First Workshop on Speech, 2013

Combining in-domain and out-of-domain speech data for automatic recognition of disordered speech.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Processing and Linking Audio Events in Large Multimedia Archives: The EU inEvent Project.
Proceedings of the First Workshop on Speech, 2013

A lecture transcription system combining neural network acoustic and language models.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Grapheme and multilingual posterior features for under-resourced speech recognition: A study on Scottish Gaelic.
Proceedings of the IEEE International Conference on Acoustics, 2013

Multi-level adaptive networks in tandem and hybrid ASR systems.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
A grapheme-based method for automatic alignment of speech and text data.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

Transcription of multi-genre media archives using out-of-domain data.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

The UEDIN systems for the IWSLT 2012 evaluation.
Proceedings of the 2012 International Workshop on Spoken Language Translation, 2012

A tutorial dialogue system with unrestricted spoken input.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Designing a spoken language interface for a tutorial dialogue system.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Evaluating language understanding accuracy with respect to objective outcomes in a dialogue system.
Proceedings of the EACL 2012, 2012

2011
Beetle II: an adaptable tutorial dialogue system.
Proceedings of the SIGDIAL 2011 Conference, 2011

Adaptive Intelligent Tutorial Dialogue in the BEETLE II System.
Proceedings of the Artificial Intelligence in Education - 15th International Conference, 2011

2010
Stochastic pronunciation modelling and soft match for out-of-vocabulary spoken term detection.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Term-dependent confidence for out-of-vocabulary term detection.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Diagonal priors for full covariance speech recognition.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

2008
Covariance updates for discriminative training by constrained line search.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

A shrinkage estimator for speech recognition with full covariance HMMs.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

2007
Sparse Gaussian graphical models for speech recognition.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007


  Loading...