We stand with Ukraine

We stand with Ukraine

Tuomo Raitio

According to our database¹, Tuomo Raitio authored at least 52 papers between 2008 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

Dialog Modeling in Audiobook Synthesis.

[BibT_eX]

[DOI]

Cheng-chieh Yeh

,

Amirreza Shirani

,

,

,

Ramya Rasipuram

,

,

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

Audiobook synthesis with long-form neural text-to-speech.

[BibT_eX]

[DOI]

,

Cheng-chieh Yeh

,

,

,

Ramya Rasipuram

,

,

Proceedings of the 12th ISCA Speech Synthesis Workshop, 2023

Improving the quality of neural TTS using long-form content and multi-speaker multi-style modeling.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 12th ISCA Speech Synthesis Workshop, 2023

2022

Emphasis Control for Parallel Neural TTS.

[BibT_eX]

[DOI]

Shreyas Seshadri

,

,

,

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Vocal effort modeling in neural TTS for improving the intelligibility of synthetic speech in noise.

[BibT_eX]

[DOI]

,

,

,

P. V. Muhammed Shifas

,

,

Yannis Stylianou

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Hierarchical Prosody Modeling and Control in Non-Autoregressive Parallel Neural TTS.

[BibT_eX]

[DOI]

,

,

Shreyas Seshadri

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

Whispered and Lombard Neural Speech Synthesis.

[BibT_eX]

[DOI]

,

,

,

,

,

Varun Lakshminarasimhan

Proceedings of the IEEE Spoken Language Technology Workshop, 2021

On-Device Neural Speech Synthesis.

[BibT_eX]

[DOI]

Sivanand Achanta

,

,

,

,

,

Ramya Rasipuram

,

Francesco Rossi

,

,

Jaimin Upadhyay

,

,

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020

Controllable Neural Text-to-Speech Synthesis Using Intuitive Prosodic Features.

[BibT_eX]

[DOI]

,

Ramya Rasipuram

,

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

2017

Siri On-Device Deep Learning-Guided Unit Selection Text-to-Speech System.

[BibT_eX]

[DOI]

,

,

Alistair Conkie

,

,

Abie Hadjitarkhani

,

,

Nancy Huddleston

,

,

,

Matthias Neeracher

,

Kishore Prahallad

,

,

Ramya Rasipuram

,

,

Becci Williamson

,

,

,

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

2016

Phase perception of the glottal excitation and its relevance in statistical parametric speech synthesis.

[BibT_eX]

[DOI]

,

,

,

,

Speech Commun., 2016

2015

Toward a Universal Synthetic Speech Spoofing Detection Using Phase Information.

[BibT_eX]

[DOI]

,

,

,

,

,

IEEE Trans. Inf. Forensics Secur., 2015

A Deep Generative Architecture for Postfiltering in Statistical Parametric Speech Synthesis.

[BibT_eX]

[DOI]

,

,

Cassia Valentini-Botinhao

,

,

Junichi Yamagishi

IEEE ACM Trans. Audio Speech Lang. Process., 2015

Phase perception of the glottal excitation of vocoded speech.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Noise robust estimation of the voice source using a deep neural network.

[BibT_eX]

[DOI]

Manu Airaksinen

,

,

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014

Quasi Closed Phase Glottal Inverse Filtering Analysis With Weighted Linear Prediction.

[BibT_eX]

[DOI]

Manu Airaksinen

,

,

,

IEEE ACM Trans. Audio Speech Lang. Process., 2014

Synthesis and perception of breathy, normal, and Lombard speech in the presence of noise.

[BibT_eX]

[DOI]

,

,

,

Comput. Speech Lang., 2014

Automatic glottal inverse filtering with the Markov chain Monte Carlo method.

[BibT_eX]

[DOI]

,

,

Manu Airaksinen

,

Samuli Siltanen

,

,

Comput. Speech Lang., 2014

Deep neural network based trainable voice source model for synthesis of speech with varying vocal effort.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Investigating source and filter contributions, and their interaction, to statistical parametric speech synthesis.

[BibT_eX]

[DOI]

,

,

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

DNN-based stochastic postfilter for HMM-based speech synthesis.

[BibT_eX]

[DOI]

,

,

Cassia Valentini-Botinhao

,

Junichi Yamagishi

,

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Excitation modeling for HMM-based speech synthesis: Breaking down the impact of periodic and aperiodic components.

[BibT_eX]

[DOI]

,

Proceedings of the IEEE International Conference on Acoustics, 2014

COVAREP - A collaborative voice analysis repository for speech technologies.

[BibT_eX]

[DOI]

Gilles Degottex

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2014

A comparative evaluation of vocoding techniques for HMM-based laughter synthesis.

[BibT_eX]

[DOI]

Bajibabu Bollepalli

,

Jérôme Urbain

,

,

Joakim Gustafson

,

Hüseyin Çakmak

Proceedings of the IEEE International Conference on Acoustics, 2014

Parametric representation for singing voice synthesis: A comparative evaluation.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2014

Voice source modelling using deep neural networks for statistical parametric speech synthesis.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 22nd European Signal Processing Conference, 2014

Effect of MPEG audio compression on vocoders used in statistical parametric speech synthesis.

[BibT_eX]

[DOI]

Bajibabu Bollepalli

,

Proceedings of the 22nd European Signal Processing Conference, 2014

The Simple4All entry to the Blizzard Challenge 2014.

[BibT_eX]

[DOI]

,

,

Dhananjaya Gowda

,

,

,

Proceedings of the Blizzard Challenge 2014, Singapore, Singapore, September 19, 2014, 2014

2013

Wavelets for intonation modeling in HMM speech synthesis.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013

Lombard modified text-to-speech synthesis for improved intelligibility: submission for the hurricane challenge 2013.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Analysis and synthesis of shouted speech.

[BibT_eX]

[DOI]

,

,

Jouni Pohjalainen

,

Manu Airaksinen

,

,

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

HMM-based synthesis of creaky voice.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Effect of MPEG audio compression on HMM-based speech synthesis.

[BibT_eX]

[DOI]

Bajibabu Bollepalli

,

,

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Comparing glottal-flow-excited statistical parametric speech synthesis methods.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2013

Prediction of creaky voice from contextual factors.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2013

2012

Effect of noise type and level on focus related fundamental frequency changes.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Juhani Järvikivi

,

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Wideband Parametric Speech Synthesis Using Warped Linear Prediction.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Automatic Detection of High Vocal Effort in Telephone Speech.

[BibT_eX]

[DOI]

Jouni Pohjalainen

,

,

,

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Voice source analysis using biomechanical modeling and glottal inverse filtering.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Towards Glottal Source Controllability in Expressive Speech Synthesis.

[BibT_eX]

[DOI]

Jaime Lorenzo-Trueba

,

Roberto Barra-Chicote

,

,

,

,

Junichi Yamagishi

,

Juan Manuel Montero

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Utilizing Markov Chain Monte Carlo (MCMC) Method for Improved Glottal Inverse Filtering.

[BibT_eX]

[DOI]

,

,

Samuli Siltanen

,

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

On measuring the intelligibility of synthetic speech in noise - Do we need a realistic noise environment?

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

The GlottHMM Entry for Blizzard Challenge 2012: Hybrid Approach.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Blizzard Challenge 2012, Portland, OR, USA, September 14, 2012, 2012

2011

HMM-Based Speech Synthesis Utilizing Glottal Inverse Filtering.

[BibT_eX]

[DOI]

,

,

Junichi Yamagishi

,

,

,

,

IEEE Trans. Speech Audio Process., 2011

Analysis of HMM-Based Lombard Speech Synthesis.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Detection of Shouted Speech in the Presence of Ambient Noise.

[BibT_eX]

[DOI]

Jouni Pohjalainen

,

,

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Utilizing glottal source pulse library for generating improved excitation signal for HMM-based speech synthesis.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2011

The GlottHMM Speech Synthesis Entry for Blizzard Challenge 2011: Utilizing Source Unit Selection in HMM-Based Speech Synthesis for Improved Excitation Generation.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Blizzard Challenge 2011, Turin, Italy, September 2, 2011, 2011

2010

Comparison of formant enhancement methods for HMM-based speech synthesis.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Seventh ISCA Tutorial and Research Workshop on Speech Synthesis, 2010

The GlottHMM Speech Synthesis Entry for Blizzard Challenge 2010.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Blizzard Challenge 2010, Kansai Science City, Japan, September 25, 2010, 2010

2009

New method for delexicalization and its application to prosodic tagging for text-to-speech synthesis.

[BibT_eX]

[DOI]

,

,

,

,

Juhani Järvikivi

,

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

2008

HMM-based Finnish text-to-speech system utilizing glottal inverse filtering.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Loading...