Simon King
Orcid: 0000-0002-2694-2843Affiliations:
- University of Edinburgh, Centre for Speech Technology Research, Scotland, UK
According to our database1,
Simon King
authored at least 272 papers
between 1996 and 2025.
Collaborative distances:
Collaborative distances:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
Refining the evaluation of speech synthesis: A summary of the Blizzard Challenge 2023.
Comput. Speech Lang., 2025
Comput. Speech Lang., March, 2024
Natural language guidance of high-fidelity text-to-speech with synthetic annotations.
CoRR, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Differentiable Grey-box Modelling of Phaser Effects using Frame-based Spectral Processing.
CoRR, 2023
Proceedings of the 12th ISCA Speech Synthesis Workshop, 2023
Proceedings of the 12th ISCA Speech Synthesis Workshop, 2023
Spell4TTS: Acoustically-informed spellings for improving text-to-speech pronunciations.
Proceedings of the 12th ISCA Speech Synthesis Workshop, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the International Workshop on Cognitive AI 2023 co-located with the 3rd International Conference on Learning & Reasoning (IJCLR 2023), 2023
Autovocoder: Fast Waveform Generation from a Learned Speech Representation Using Differentiable Digital Signal Processing.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Combining conversational speech with read speech to improve prosody in Text-to-Speech synthesis.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Speech Audio Corrector: using speech from non-target speakers for one-off correction of mispronunciations in grapheme-input text-to-speech.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
An Overview of Voice Conversion and Its Challenges: From Statistical Modeling to Deep Learning.
IEEE ACM Trans. Audio Speech Lang. Process., 2021
Proceedings of the 11th ISCA Speech Synthesis Workshop, 2021
Comparing acoustic and textual representations of previous linguistic context for improving Text-to-Speech.
Proceedings of the 11th ISCA Speech Synthesis Workshop, 2021
Proceedings of the 11th ISCA Speech Synthesis Workshop, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the Blizzard Challenge 2021, virtual, October 23, 2021, 2021
Proceedings of the Blizzard Challenge 2021, virtual, October 23, 2021, 2021
A Vector Quantized Variational Autoencoder (VQ-VAE) Autoregressive Neural F<sub>0</sub> Model for Statistical Parametric Speech Synthesis.
IEEE ACM Trans. Audio Speech Lang. Process., 2020
Perception of prosodic variation for speech synthesis using an unsupervised discrete representation of F0.
CoRR, 2020
Comparison of Speech Representations for Automatic Quality Estimation in Multi-Speaker Text-to-Speech Synthesis.
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020
Hider-Finder-Combiner: An Adversarial Architecture for General Speech Signal Modification.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
An Unsupervised Method to Select a Speaker Subset from Large Multi-Speaker Speech Synthesis Datasets.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Testing the Limits of Representation Mixing for Pronunciation Correction in End-to-End Speech Synthesis.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2nd Conference on Conversational User Interfaces, 2020
Proceedings of the Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020, 2020
Proceedings of the 10th ISCA Speech Synthesis Workshop, 2019
Measuring the contribution to cognitive load of each predicted vocoder speech parameter in DNN-based speech synthesis.
Proceedings of the 10th ISCA Speech Synthesis Workshop, 2019
A Comparison of Letters and Phones as Input to Sequence-to-Sequence Models for Speech Synthesis.
Proceedings of the 10th ISCA Speech Synthesis Workshop, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Using Pupil Dilation to Measure Cognitive Load When Listening to Text-to-Speech in Quiet and in Noise.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Investigating the Robustness of Sequence-to-Sequence Text-to-Speech Models to Imperfectly-Transcribed Training Data.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Speech Waveform Reconstruction Using Convolutional Neural Networks with Noise and Periodic Inputs.
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the Blizzard Challenge 2019, Vienna, Austria, September 23, 2019, 2019
Exploring the robustness of features and enhancement on speech recognition systems in highly-reverberant real environments.
CoRR, 2018
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Learning Interpretable Control Dimensions for Speech Synthesis by Using External Data.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the Blizzard Challenge 2018, Hyderabad, India, September 8, 2018, 2018
Using Eigenvoices and Nearest-Neighbors in HMM-Based Cross-Lingual Speaker Adaptation With Limited Data.
IEEE ACM Trans. Audio Speech Lang. Process., 2017
Locally Normalized Filter Banks Applied to Deep Neural-Network-Based Robust Speech Recognition.
IEEE Signal Process. Lett., 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Direct Modelling of Magnitude and Phase Spectra for Statistical Parametric Speech Synthesis.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the Blizzard Challenge 2017, Stockholm, Sweden, August 25, 2017, 2017
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017
Anti-Spoofing for Text-Independent Speaker Verification: An Initial Database, Comparison of Countermeasures, and Human Performance.
IEEE ACM Trans. Audio Speech Lang. Process., 2016
Improving Trajectory Modelling for DNN-Based Speech Synthesis by Using Stacked Bottleneck Features and Minimum Generation Error Training.
IEEE ACM Trans. Audio Speech Lang. Process., 2016
Comput. Speech Lang., 2016
Improving Trajectory Modelling for DNN-based Speech Synthesis by using Stacked Bottleneck Features and Minimum Trajectory Error Training.
CoRR, 2016
Proceedings of the 9th ISCA Speech Synthesis Workshop, 2016
Proceedings of the 9th ISCA Speech Synthesis Workshop, 2016
Proceedings of the 9th ISCA Speech Synthesis Workshop, 2016
Median-based generation of synthetic speech durations using a non-parametric approach.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
The Use of Locally Normalized Cepstral Coefficients (LNCC) to Improve Speaker Recognition Accuracy in Highly Reverberant Rooms.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Waveform Generation Based on Signal Reshaping for Statistical Parametric Speech Synthesis.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Testing the consistency assumption: Pronunciation variant forced alignment in read and spontaneous speech synthesis.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the Blizzard Challenge 2016, Cuppertino, CA, USA, September 16, 2016, 2016
EURASIP J. Adv. Signal Process., 2015
A perceptually-motivated low-complexity instantaneous linear channel normalization technique applied to speaker verification.
Comput. Speech Lang., 2015
Proceedings of the International Conference on Speech Technology and Human-Computer Dialogue, 2015
A Comparison of Manual and Automatic Voice Repair for Individual with Vocal Disabilities.
Proceedings of the 6th Workshop on Speech and Language Processing for Assistive Technologies, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Minimum trajectory error training for deep neural networks, combined with stacked bottleneck features.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Deep neural network context embeddings for model selection in rich-context HMM synthesis.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Robustness to additive noise of locally-normalized cepstral coefficients in speaker verification.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 18th International Congress of Phonetic Sciences, 2015
Deep neural networks employing Multi-Task Learning and stacked bottleneck features for speech synthesis.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Attributing modelling errors in HMM synthesis by stepping gradually from natural to modelled speech.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
IEEE J. Sel. Top. Signal Process., 2014
Context-dependent acoustic modeling based on hidden maximum entropy model for statistical parametric speech synthesis.
EURASIP J. Audio Speech Music. Process., 2014
Intelligibility enhancement of HMM-generated speech in additive noise by modifying Mel cepstral coefficients to increase the glimpse proportion.
Comput. Speech Lang., 2014
Comput. Speech Lang., 2014
Introduction to the Special Issue on The listening talker: context-dependent speech production and perception.
Comput. Speech Lang., 2014
The listening talker: A review of human and algorithmic context-induced modifications of speech.
Comput. Speech Lang., 2014
Unsupervised lexical clustering of speech segments using fixed-dimensional acoustic embeddings.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014
Investigating source and filter contributions, and their interaction, to statistical parametric speech synthesis.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Development of a genre-dependent TTS system with cross-speaker speaking-style transplantation.
Proceedings of the 2nd International Workshop on Speech, Language and Audio in Multimedia, 2014
Measuring the perceptual effects of modelling assumptions in speech synthesis using stimuli constructed from repeated natural speech.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
A comparison of open-source segmentation architectures for dealing with imperfect data from the media in speech synthesis.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Neural net word representations for phrase-break prediction without a part of speech tagger.
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Voice source modelling using deep neural networks for statistical parametric speech synthesis.
Proceedings of the 22nd European Signal Processing Conference, 2014
Proceedings of the Blizzard Challenge 2014, Singapore, Singapore, September 19, 2014, 2014
IEEE ACM Trans. Audio Speech Lang. Process., 2013
Recording speech articulation in dialogue: Evaluating a synchronized double electromagnetic articulography setup.
J. Phonetics, 2013
Personalising speech-to-speech translation: Unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis.
Comput. Speech Lang., 2013
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013
Unsupervised and lightly-supervised learning for rapid construction of TTS systems in multiple languages from 'found' data: evaluation and analysis.
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013
Using neighbourhood density and selective SNR boosting to increase the intelligibility of synthetic speech in noise.
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013
Using adaptation to improve speech transcription alignment in noisy and reverberant environments.
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013
Combining a vector space representation of linguistic context with a deep neural network for text-to-speech synthesis.
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013
Mage - reactive articulatory feature control of HMM-based parametric speech synthesis.
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013
Towards Personalised Synthesised Voices for Individuals with Vocal Disabilities: Voice Banking and Reconstruction.
Proceedings of the Fourth Workshop on Speech and Language Processing for Assistive Technologies, 2013
The voice bank corpus: Design, collection and data analysis of a large regional accent speech database.
Proceedings of the 2013 International Conference Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013
Combining perceptually-motivated spectral shaping with loudness and duration modification for intelligibility enhancement of HMM-based synthetic speech in noise.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
TUNDRA: a multilingual corpus of found data for TTS research created with light supervision.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Lightly supervised discriminative training of grapheme models for improved sentence-level alignment of speech and text data.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Combining in-domain and out-of-domain speech data for automatic recognition of disordered speech.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Improving intelligibility in noise of HMM-generated speech via noise-dependent and -independent methods.
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2013
Proceedings of the Blizzard Challenge 2013, 2013
ACM Trans. Inf. Syst., 2012
Analysis of unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis using KLD-based transform mapping.
Speech Commun., 2012
Speech Commun., 2012
J. Comput. Sci. Technol., 2012
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012
Noise-robust whispered speech recognition using a non-audible-murmur microphone with VTS compensation.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012
Using HMM-based Speech Synthesis to Reconstruct the Voice of Individuals with Degenerative Speech Disorders.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Mel cepstral coefficient modification based on the Glimpse Proportion measure for improving the intelligibility of HMM-generated synthetic speech in noise.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Evaluating speech intelligibility enhancement for HMM-based synthetic speech in noise.
Proceedings of the ISCA Workshop on Statistical And Perceptual Audition, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Using Bayesian Networks to find relevant context features for HMM-based speech synthesis.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Cepstral analysis based on the glimpse proportion measure for improving the intelligibility of HMM-based synthetic speech in noise.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the Blizzard Challenge 2012, Portland, OR, USA, September 14, 2012, 2012
IEEE Trans. Speech Audio Process., 2011
IEEE Signal Process. Lett., 2011
The Romanian speech synthesis (RSS) corpus: Building a high quality HMM-based speech synthesis system using a high sampling rate.
Speech Commun., 2011
Listeners' weighting of acoustic cues to synthetic speech naturalness: A multidimensional scaling analysis.
Speech Commun., 2011
Unsupervised Continuous-Valued Word Features for Phrase-Break Prediction without a Part-of-Speech Tagger.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Can Objective Measures Predict the Intelligibility of Modified HMM-Based Synthetic Speech in Noise?
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Announcing the Electromagnetic Articulography (Day 1) Subset of the mngu0 Articulatory Corpus.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
Evaluation of objective measures for intelligibility prediction of HMM-based synthetic speech in noise.
Proceedings of the IEEE International Conference on Acoustics, 2011
An analysis of machine translation and speech synthesis in speech-to-speech translation system.
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the Blizzard Challenge 2011, Turin, Italy, September 2, 2011, 2011
Proceedings of the 13th International ACM SIGACCESS Conference on Computers and Accessibility, 2011
Thousands of Voices for HMM-Based Speech Synthesis-Analysis and Application of TTS Systems Built on Various ASR Corpora.
IEEE Trans. Speech Audio Process., 2010
IEEE Trans. Speech Audio Process., 2010
Analysis of statistical parametric and unit selection speech synthesis systems applied to emotional speech.
Speech Commun., 2010
IEEE J. Sel. Top. Signal Process., 2010
Speaker adaptation and the evaluation of speaker similarity in the EMIME speech-to-speech translation project.
Proceedings of the Seventh ISCA Tutorial and Research Workshop on Speech Synthesis, 2010
Proceedings of the Seventh ISCA Tutorial and Research Workshop on Speech Synthesis, 2010
Proceedings of the Seventh ISCA Tutorial and Research Workshop on Speech Synthesis, 2010
Evans, Joe Frankel, Raphaël Troncy: Direct posterior confidence for out-of-vocabulary spoken term detection.
Proceedings of the 2010 International Workshop on Searching Spontaneous Conversational Speech, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
CRF-based stochastic pronunciation modeling for out-of-vocabulary spoken term detection.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
A classifier-based target cost for unit selection speech synthesis trained on perceptual data.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
Stochastic pronunciation modelling and soft match for out-of-vocabulary spoken term detection.
Proceedings of the IEEE International Conference on Acoustics, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
Proceedings of the Blizzard Challenge 2010, Kansai Science City, Japan, September 25, 2010, 2010
Proceedings of the ACL 2010, 2010
IEEE Trans. Speech Audio Process., 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
A posterior probability-based system hybridisation and combination for spoken term detection.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
Analysis of Unsupervised and Noise-Robust Speaker-Adaptive HMM-Based Speech Synthesis Systems toward a Unified ASR and TTS Framework.
Proceedings of the Blizzard Challenge 2009, Edinburgh, Scotland, UK, September 4, 2009, 2009
Proceedings of the Blizzard Challenge 2009, Edinburgh, Scotland, UK, September 4, 2009, 2009
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009
Speech Commun., 2008
Proceedings of the First Workshop on Child, Computer and Interaction, 2008
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Cross-lingual portability of MLP-based tandem features - a case study for English and Hungarian.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the IEEE International Conference on Acoustics, 2008
Proceedings of the Blizzard Challenge 2008, 2008
IEEE Trans. Speech Audio Process., 2007
Speech Commun., 2007
Pattern Recognit. Lett., 2007
Comput. Speech Lang., 2007
Improved average-voice-based speech synthesis using gender-mixed modeling and a parameter generation algorithm considering GV.
Proceedings of the Sixth ISCA Workshop on Speech Synthesis, 2007
Single speaker segmentation and inventory selection using dynamic time warping self organization and joint multigram mapping.
Proceedings of the Sixth ISCA Workshop on Speech Synthesis, 2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Articulatory Feature-Based Methods for Acoustic and Audio-Visual Speech Recognition: Summary from the 2006 JHU Summer workshop.
Proceedings of the IEEE International Conference on Acoustics, 2007
Proceedings of the IEEE International Conference on Acoustics, 2007
Proceedings of the IEEE International Conference on Acoustics, 2007
Proceedings of the Evaluation of text-to-speech systems: Blizzard Challenge 2007, 2007
Proceedings of the Evaluation of text-to-speech systems: Blizzard Challenge 2007, 2007
Monolingual and crosslingual comparison of tandem features derived from articulatory and phone MLPS.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007
Subjective evaluation of join cost and smoothing methods for unit selection speech synthesis.
IEEE Trans. Speech Audio Process., 2006
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Proceedings of the Blizzard Challenge 2006, Pittsburgh, PA, USA, September 16, 2006, 2006
Inductive String Template-Based Learning of Spoken Language.
Proceedings of the Pattern Recognition in Information Systems, 2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Detection of Symbolic Gestural Events in Articulatory Data for Use in Structural Representations of Continuous Speech.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
Proceedings of the Fifth ISCA ITRW on Speech Synthesis, 2004
Proceedings of the Fifth ISCA ITRW on Speech Synthesis, 2004
Proceedings of the Fifth ISCA ITRW on Speech Synthesis, 2004
Subjective evaluation of join cost functions used in unit selection speech synthesis.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the 17th International Conference on Pattern Recognition, 2004
J. Phonetics, 2003
Comput. Speech Lang., 2003
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Estimation of voice source and vocal tract characteristics based on multi-frame analysis.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Objective distance measures for spectral discontinuities in concatenative speech synthesis.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
Comput. Speech Lang., 2000
An automatic speech recognition system using neural networks and linear dynamic models to recover and model articulatory traces.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997
Proceedings of the 4th International Conference on Spoken Language Processing, 1996