Martti Vainio
Orcid: 0000-0003-2570-0196Affiliations:
- University of Helsinki, Finland
According to our database1,
Martti Vainio
authored at least 72 papers
between 1996 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
Sound symbolism in manual and vocal responses: phoneme-response interactions associated with grasping as well as vertical and size dimensions of keypresses.
Cogn. Process., August, 2024
High-Pitched Sound is Open and Low-Pitched Sound is Closed: Representing the Spatial Meaning of Pitch Height.
Cogn. Sci., August, 2024
2023
Investigating the Utility of Surprisal from Large Language Models for Speech Synthesis Prosody.
Proceedings of the 12th ISCA Speech Synthesis Workshop, 2023
2020
J. Phonetics, 2020
CoRR, 2020
2019
The Sound of Grasp Affordances: Influence of Grasp-Related Size of Categorized Objects on Vocalization.
Cogn. Sci., 2019
Predicting Prosodic Prominence from Text with Pre-trained Contextualized Word Representations.
Proceedings of the 22nd Nordic Conference on Computational Linguistics, NoDaLiDa 2019, Turku, Finland, September 30, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Prosodic Representations of Prominence Classification Neural Networks and Autoencoders Using Bottleneck Features.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
2017
Hierarchical representation and estimation of prosody using continuous wavelet transform.
Comput. Speech Lang., 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
2016
Phase perception of the glottal excitation and its relevance in statistical parametric speech synthesis.
Speech Commun., 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Digitala: An Augmented Test and Review Process Prototype for High-Stakes Spoken Foreign Language Examination.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
2015
NeuroImage, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Different parts of the same elephant: A roadmap to disentangle and connect different perspectives on prosodic prominence.
Proceedings of the 18th International Congress of Phonetic Sciences, 2015
Proceedings of the 18th International Congress of Phonetic Sciences, 2015
Proceedings of the 18th International Congress of Phonetic Sciences, 2015
2014
J. Phonetics, 2014
Synthesis and perception of breathy, normal, and Lombard speech in the presence of noise.
Comput. Speech Lang., 2014
An adaptive post-filtering method producing an artificial Lombard-like effect for intelligibility enhancement of narrowband telephone speech.
Comput. Speech Lang., 2014
Phonetics and Machine Learning: Hierarchical Modelling of Prosody in Statistical Speech Synthesis.
Proceedings of the Statistical Language and Speech Processing, 2014
Deep neural network based trainable voice source model for synthesis of speech with varying vocal effort.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Voice source modelling using deep neural networks for statistical parametric speech synthesis.
Proceedings of the 22nd European Signal Processing Conference, 2014
2013
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013
Acoustic and visual phonetic features in the mcgurk effect - an audiovisual speech illusion.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Lombard modified text-to-speech synthesis for improved intelligibility: submission for the hurricane challenge 2013.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Language background affects the strength of the pitch bias in a duration discrimination task.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Utilization of the Lombard effect in post-filtering for intelligibility enhancement of telephone speech.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Improved formant frequency estimation from high-pitched vowels by downgrading the contribution of the glottal source with weighted linear prediction.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Intonational speaker verification: A study on parameters and performance under noisy conditions.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
On measuring the intelligibility of synthetic speech in noise - Do we need a realistic noise environment?
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Comparison of post-filtering methods for intelligibility enhancement of telephone speech.
Proceedings of the 20th European Signal Processing Conference, 2012
Proceedings of the Blizzard Challenge 2012, Portland, OR, USA, September 14, 2012, 2012
2011
IEEE Trans. Speech Audio Process., 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 17th International Congress of Phonetic Sciences, 2011
Estimates for the Measurement and Articulatory Error in MRI Data from Sustained Vowel Production.
Proceedings of the 17th International Congress of Phonetic Sciences, 2011
Utilizing glottal source pulse library for generating improved excitation signal for HMM-based speech synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2011
The GlottHMM Speech Synthesis Entry for Blizzard Challenge 2011: Utilizing Source Unit Selection in HMM-Based Speech Synthesis for Improved Excitation Generation.
Proceedings of the Blizzard Challenge 2011, Turin, Italy, September 2, 2011, 2011
Recording Speech Sound and Articulation in MRI.
Proceedings of the BIODEVICES 2011, 2011
2010
Proceedings of the Seventh ISCA Tutorial and Research Workshop on Speech Synthesis, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the Blizzard Challenge 2010, Kansai Science City, Japan, September 25, 2010, 2010
2009
New method for delexicalization and its application to prosodic tagging for text-to-speech synthesis.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
2008
IEEE Trans. Speech Audio Process., 2008
Proceedings of the Text, Speech and Dialogue, 11th International Conference, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
2007
Proceedings of the Fifth International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, 2007
2006
J. Phonetics, 2006
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
2001
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
2000
Proceedings of the Second International Conference on Language Resources and Evaluation, 2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
1999
Proceedings of the Third IEEE Workshop on Multimedia Signal Processing, 1999
Relational vs. object-oriented models for representing speech: a comparison using ANDOSL data.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999
1998
Modeling the microprosody of pitch and loudness for speech synthesis with neural networks.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998
1996
Pitch, loudness, and segmental duration correlates: towards a model for the phonetic aspects of finnish prosody.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996
A multilingual phonetic representation and analysis system for different speech databases.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996