Tuomo Raitio

According to our database1, Tuomo Raitio authored at least 52 papers between 2008 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Dialog Modeling in Audiobook Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Audiobook synthesis with long-form neural text-to-speech.
Proceedings of the 12th ISCA Speech Synthesis Workshop, 2023

Improving the quality of neural TTS using long-form content and multi-speaker multi-style modeling.
Proceedings of the 12th ISCA Speech Synthesis Workshop, 2023

2022
Emphasis Control for Parallel Neural TTS.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Vocal effort modeling in neural TTS for improving the intelligibility of synthetic speech in noise.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Hierarchical Prosody Modeling and Control in Non-Autoregressive Parallel Neural TTS.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Whispered and Lombard Neural Speech Synthesis.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

On-Device Neural Speech Synthesis.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
Controllable Neural Text-to-Speech Synthesis Using Intuitive Prosodic Features.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

2017
Siri On-Device Deep Learning-Guided Unit Selection Text-to-Speech System.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

2016
Phase perception of the glottal excitation and its relevance in statistical parametric speech synthesis.
Speech Commun., 2016

2015
Toward a Universal Synthetic Speech Spoofing Detection Using Phase Information.
IEEE Trans. Inf. Forensics Secur., 2015

A Deep Generative Architecture for Postfiltering in Statistical Parametric Speech Synthesis.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

Phase perception of the glottal excitation of vocoded speech.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Noise robust estimation of the voice source using a deep neural network.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014
Quasi Closed Phase Glottal Inverse Filtering Analysis With Weighted Linear Prediction.
IEEE ACM Trans. Audio Speech Lang. Process., 2014

Synthesis and perception of breathy, normal, and Lombard speech in the presence of noise.
Comput. Speech Lang., 2014

Automatic glottal inverse filtering with the Markov chain Monte Carlo method.
Comput. Speech Lang., 2014

Deep neural network based trainable voice source model for synthesis of speech with varying vocal effort.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Investigating source and filter contributions, and their interaction, to statistical parametric speech synthesis.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

DNN-based stochastic postfilter for HMM-based speech synthesis.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Excitation modeling for HMM-based speech synthesis: Breaking down the impact of periodic and aperiodic components.
Proceedings of the IEEE International Conference on Acoustics, 2014

COVAREP - A collaborative voice analysis repository for speech technologies.
Proceedings of the IEEE International Conference on Acoustics, 2014

A comparative evaluation of vocoding techniques for HMM-based laughter synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2014

Parametric representation for singing voice synthesis: A comparative evaluation.
Proceedings of the IEEE International Conference on Acoustics, 2014

Voice source modelling using deep neural networks for statistical parametric speech synthesis.
Proceedings of the 22nd European Signal Processing Conference, 2014

Effect of MPEG audio compression on vocoders used in statistical parametric speech synthesis.
Proceedings of the 22nd European Signal Processing Conference, 2014

The Simple4All entry to the Blizzard Challenge 2014.
Proceedings of the Blizzard Challenge 2014, Singapore, Singapore, September 19, 2014, 2014

2013
Wavelets for intonation modeling in HMM speech synthesis.
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013

Lombard modified text-to-speech synthesis for improved intelligibility: submission for the hurricane challenge 2013.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Analysis and synthesis of shouted speech.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

HMM-based synthesis of creaky voice.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Effect of MPEG audio compression on HMM-based speech synthesis.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Comparing glottal-flow-excited statistical parametric speech synthesis methods.
Proceedings of the IEEE International Conference on Acoustics, 2013

Prediction of creaky voice from contextual factors.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Effect of noise type and level on focus related fundamental frequency changes.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Wideband Parametric Speech Synthesis Using Warped Linear Prediction.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Automatic Detection of High Vocal Effort in Telephone Speech.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Voice source analysis using biomechanical modeling and glottal inverse filtering.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Towards Glottal Source Controllability in Expressive Speech Synthesis.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Utilizing Markov Chain Monte Carlo (MCMC) Method for Improved Glottal Inverse Filtering.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

On measuring the intelligibility of synthetic speech in noise - Do we need a realistic noise environment?
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

The GlottHMM Entry for Blizzard Challenge 2012: Hybrid Approach.
Proceedings of the Blizzard Challenge 2012, Portland, OR, USA, September 14, 2012, 2012

2011
HMM-Based Speech Synthesis Utilizing Glottal Inverse Filtering.
IEEE Trans. Speech Audio Process., 2011

Analysis of HMM-Based Lombard Speech Synthesis.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Detection of Shouted Speech in the Presence of Ambient Noise.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Utilizing glottal source pulse library for generating improved excitation signal for HMM-based speech synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2011

The GlottHMM Speech Synthesis Entry for Blizzard Challenge 2011: Utilizing Source Unit Selection in HMM-Based Speech Synthesis for Improved Excitation Generation.
Proceedings of the Blizzard Challenge 2011, Turin, Italy, September 2, 2011, 2011

2010
Comparison of formant enhancement methods for HMM-based speech synthesis.
Proceedings of the Seventh ISCA Tutorial and Research Workshop on Speech Synthesis, 2010

The GlottHMM Speech Synthesis Entry for Blizzard Challenge 2010.
Proceedings of the Blizzard Challenge 2010, Kansai Science City, Japan, September 25, 2010, 2010

2009
New method for delexicalization and its application to prosodic tagging for text-to-speech synthesis.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

2008
HMM-based Finnish text-to-speech system utilizing glottal inverse filtering.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008


  Loading...