Thomas Drugman
Orcid: 0000-0002-1491-7878
According to our database1,
Thomas Drugman
authored at least 111 papers
between 2007 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data.
CoRR, 2024
2023
Proceedings of the 12th ISCA Speech Synthesis Workshop, 2023
Proceedings of the 12th ISCA Speech Synthesis Workshop, 2023
eCat: An End-to-End Model for Multi-Speaker TTS & Many-to-Many Fine-Grained Prosody Transfer.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
2022
Speech Commun., 2022
CoRR, 2022
CopyCat2: A Single Model for Multi-Speaker TTS and Many-to-Many Fine-Grained Prosody Transfer.
CoRR, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
CopyCat2: A Single Model for Multi-Speaker TTS and Many-to-Many Fine-Grained Prosody Transfer.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
2021
Proceedings of the 11th ISCA Speech Synthesis Workshop, 2021
Proceedings of the 11th ISCA Speech Synthesis Workshop, 2021
Proceedings of the 11th ISCA Speech Synthesis Workshop, 2021
Weakly-Supervised Word-Level Pronunciation Error Detection in Non-Native English Speech.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Detection of Lexical Stress Errors in Non-Native (L2) English with Data Augmentation and Attention.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
2020
Dynamic Prosody Generation for Speech Synthesis Using Linguistics-Driven Acoustic Embedding Selection.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
2019
In Other News: a Bi-style Text-to-speech Model for Synthesizing Newscaster Voice with Limited Data.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Interpretable Deep Learning Model for the Detection and Reconstruction of Dysarthric Speech.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
2018
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018
Parameter Generation Algorithms for Text-To-Speech Synthesis with Recurrent Neural Networks.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018
2017
Phrase Break Prediction for Long-Form Reading TTS: Exploiting Text Structure Information.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
2016
Proceedings of the Recent Advances in Nonlinear Speech Processing, 2016
IEEE ACM Trans. Audio Speech Lang. Process., 2016
IEEE Signal Process. Lett., 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Active and Semi-Supervised Learning in ASR: Benefits on the Acoustic and Language Models.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
2015
Comput. Speech Lang., 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
2014
Fast Inter-Harmonic Reconstruction for Spectral Envelope Estimation in High-Pitched Voices.
IEEE Signal Process. Lett., 2014
IEEE Signal Process. Lett., 2014
IEEE J. Sel. Top. Signal Process., 2014
Neurocomputing, 2014
Context-dependent acoustic modeling based on hidden maximum entropy model for statistical parametric speech synthesis.
EURASIP J. Audio Speech Music. Process., 2014
Comput. Speech Lang., 2014
Comput. Speech Lang., 2014
Using mutual information in supervised temporal event detection: Application to cough detection.
Biomed. Signal Process. Control., 2014
Speech synthesis in various communicative situations: impact of pronunciation variations.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Excitation modeling for HMM-based speech synthesis: Breaking down the impact of periodic and aperiodic components.
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
2013
IEEE J. Biomed. Health Informatics, 2013
IEEE Signal Process. Lett., 2013
HMM-based speech synthesis of live sports commentaries: integration of a two-layer prosody annotation.
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013
On the Importance of Pre-emphasis and Window Shape in Phase-Based Speech Recognition.
Proceedings of the Advances in Nonlinear Speech Processing - 6th International Conference, 2013
Proceedings of the Advances in Nonlinear Speech Processing - 6th International Conference, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
A quantitative comparison of glottal closure instant estimation algorithms on a large variety of singing sounds.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
A comparative study of pitch extraction algorithms on a large variety of singing sounds.
Proceedings of the IEEE International Conference on Acoustics, 2013
2012
IEEE Trans. Speech Audio Process., 2012
IEEE Trans. Speech Audio Process., 2012
Comput. Speech Lang., 2012
Automatic Phone Alignment - A Comparison between Speaker-Independent Models and Models Trained on the Corpus to Align.
Proceedings of the Advances in Natural Language Processing, 2012
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012
Proceedings of the 12th International Conference on New Interfaces for Musical Expression, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
2011
Causal-anticausal decomposition of speech using complex cepstrum for glottal source estimation.
Speech Commun., 2011
Proceedings of the Advances in Nonlinear Speech Processing, 2011
Proceedings of the Advances in Nonlinear Speech Processing, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the 19th European Signal Processing Conference, 2011
2010
Proceedings of the Seventh ISCA Tutorial and Research Workshop on Speech Synthesis, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 18th European Signal Processing Conference, 2010
2009
Proceedings of the Advances in Nonlinear Speech Processing, 2009
On the mutual information of glottal source estimation techniques for the automatic detection of speech pathologies.
Proceedings of the Sixth International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, 2009
A deterministic plus stochastic model of the residual signal for improved parametric speech synthesis.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
On the mutual information between source and filter contributions for voice pathology detection.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Using a pitch-synchronous residual codebook for hybrid HMM/frame selection speech synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2009
Proceedings of the 17th European Signal Processing Conference, 2009
2008
Glottal Source Estimation Robustness - A Comparison of Sensitivity of Voice Source Estimation Techniques.
Proceedings of the SIGMAP 2008, 2008
Proceedings of the 10th International Conference on Multimodal Interfaces, 2008
Voice source parameters estimation by fitting the glottal formant and the inverse filtering open phase.
Proceedings of the 2008 16th European Signal Processing Conference, 2008
2007
Proceedings of the IEEE 9th Workshop on Multimedia Signal Processing, 2007