Antonio Bonafonte

Proceedings of the Fifth International Conference, 2021

2019

Time-domain speech enhancement using generative adversarial networks.

[BibT_eX]

[DOI]

Speech Commun., 2019

Problem-Agnostic Speech Embeddings for Multi-Speaker Text-to-Speech with SampleRNN.

[BibT_eX]

[DOI]

David Álvarez

Proceedings of the 10th ISCA Speech Synthesis Workshop, 2019

Towards Generalized Speech Enhancement with Generative Adversarial Networks.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Learning Problem-Agnostic Speech Representations from Multiple Self-Supervised Tasks.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Prosodic Phrase Alignment for Machine Dubbing.

[BibT_eX]

[DOI]

Alp Öktem

Mireia Farrús

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

2018

Visualizing Punctuation Restoration in Speech Transcripts with Prosograph.

[BibT_eX]

[DOI]

Alp Öktem

Mireia Farrús

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Expressive Speech Synthesis Using Sentiment Embeddings.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Spanish Statistical Parametric Speech Synthesis Using a Neural Vocoder.

[BibT_eX]

[DOI]

Georgina Dorca

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Language and Noise Transfer in Speech Enhancement Generative Adversarial Network.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Whispered-to-voiced Alaryngeal Speech Conversion with Generative Adversarial Networks.

[BibT_eX]

[DOI]

José Andrés González López

Proceedings of the Fourth International Conference, 2018

Self-Attention Linguistic-Acoustic Decoder.

[BibT_eX]

[DOI]

Proceedings of the Fourth International Conference, 2018

Bilingual Prosodic Dataset Compilation for Spoken Language Translation.

[BibT_eX]

[DOI]

Alp Öktem

Mireia Farrús

Proceedings of the Fourth International Conference, 2018

Corpus for Cyberbullying Prevention.

[BibT_eX]

[DOI]

Proceedings of the Fourth International Conference, 2018

Multi-Speaker Neural Vocoder.

[BibT_eX]

[DOI]

Oriol Barbany

Proceedings of the Fourth International Conference, 2018

2017

SEGAN: Speech Enhancement Generative Adversarial Network.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

2016

Multi-output RNN-LSTM for multiple speaker speech synthesis with α-interpolation model.

[BibT_eX]

[DOI]

Proceedings of the 9th ISCA Speech Synthesis Workshop, 2016

Prosodic and Spectral iVectors for Expressive Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 9th ISCA Speech Synthesis Workshop, 2016

Direct Expressive Voice Training Based on Semantic Selection.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Deep Neural Networks for i-Vector Language Identification of Short Utterances in Cars.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Prosodic Break Prediction with RNNs.

[BibT_eX]

[DOI]

Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2016

Multi-output RNN-LSTM for multiple speaker speech synthesis and adaptation.

[BibT_eX]

[DOI]

Proceedings of the 24th European Signal Processing Conference, 2016

Acoustic feature prediction from semantic features for expressive speech using deep neural networks.

[BibT_eX]

[DOI]

Proceedings of the 24th European Signal Processing Conference, 2016

2015

Creating expressive synthetic voices by unsupervised clustering of audiobooks.

[BibT_eX]

[DOI]

Paula Lopez-Otero

Laura Docío Fernández

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

2013

Glissando: a corpus for multidisciplinary prosodic studies in Spanish and Catalan.

[BibT_eX]

[DOI]

Juan María Garrido

Lourdes Aguilar

Valentín Cardeñoso-Payo

Emma Rodero

Carme de la Mota

César González Ferreras

Carlos Vivaracho-Pascual

Eva Estebas-Vilaplana

Mercedes Cabrera

Lang. Resour. Evaluation, 2013

Parametric decomposition of the spectral envelope.

[BibT_eX]

[DOI]

Anderson Fraiha Machado

Marcelo Queiroz

Proceedings of the IEEE International Conference on Acoustics, 2013

2012

Production of filled pauses in concatenative speech synthesis based on the underlying fluent sentence.

[BibT_eX]

[DOI]

Speech Commun., 2012

Building Synthetic Voices in the META-NET Framework.

[BibT_eX]

[DOI]

Emília Garcia Casademont

Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

BUCEADOR, a multi-language search engine for digital libraries.

[BibT_eX]

[DOI]

Antonio Cardenal López

Eduardo Rodríguez Banga

Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

2011

Introducing nativization to Spanish TTS systems.

[BibT_eX]

[DOI]

Speech Commun., 2011

Adding Glottal Source Information to Intra-Lingual Voice Conversion.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Albayzín 2010: A Spanish Text to Speech Evaluation.

[BibT_eX]

[DOI]

Francisco Campillo

Francisco Méndez Pazó

Montserrat Arza

Laura Docío Fernández

Eva Navas

Iñaki Sainz

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Work in progress - Cooperative and competitive projects for engaging students in advanced ICT subjects.

[BibT_eX]

[DOI]

Proceedings of the 2011 Frontiers in Education Conference, 2011

BUCEADOR hybrid TTS for Blizzard Challenge 2011.

[BibT_eX]

[DOI]

Proceedings of the Blizzard Challenge 2011, Turin, Italy, September 2, 2011, 2011

2010

INCA Algorithm for Training Voice Conversion Systems From Nonparallel Corpora.

[BibT_eX]

[DOI]

Daniel Erro

IEEE Trans. Speech Audio Process., 2010

Voice Conversion Based on Weighted Frequency Warping.

[BibT_eX]

[DOI]

Daniel Erro

IEEE Trans. Speech Audio Process., 2010

Nativization of English words in Spanish using analogy.

[BibT_eX]

[DOI]

Proceedings of the Seventh ISCA Tutorial and Research Workshop on Speech Synthesis, 2010

TTS Evaluation Campaign with a Common Spanish Database.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Language Resources and Evaluation, 2010

Synthesis of filled pauses based on a disfluent speech model.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

2009

Voice conversion using k-histograms and frame selection.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Towards robust glottal source modeling.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Determining intonational boundaries from the acoustic signal.

[BibT_eX]

[DOI]

Lourdes Aguilar

Francisco Campillo

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Improving consistence of phonetic transcription for text-to-speech.

[BibT_eX]

[DOI]

Juan Carlos Tulli

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

New strategies for pronunciation by analogy.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

2008

Corpus and Voices for Catalan Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Language Resources and Evaluation, 2008

Intonation modeling of Mandarin Chinese using a superpositional approach.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

On the generation of synthetic disfluent speech: local prosodic modifications caused by the insertion of editing terms.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

A study of JEMA for intonation modeling.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

The UPC TTS System Description for the 2008 Blizzard Challenge.

[BibT_eX]

[DOI]

Proceedings of the Blizzard Challenge 2008, 2008

2007

Filled Pauses in Speech Synthesis: Towards Conversational Speech.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech and Dialogue, 10th International Conference, 2007

Flexible harmonic/stochastic speech synthesis.

[BibT_eX]

[DOI]

Daniel Erro

Proceedings of the Sixth ISCA Workshop on Speech Synthesis, 2007

Statistical analysis of filled pauses<sup>2</sup> rhythm for disfluent speech synthesis.

[BibT_eX]

[DOI]

David Escudero

Proceedings of the Sixth ISCA Workshop on Speech Synthesis, 2007

The UPC TTS system description for the 2007 Blizzard Challenge.

[BibT_eX]

[DOI]

Proceedings of the Evaluation of text-to-speech systems: Blizzard Challenge 2007, 2007

2006

Spanish Synthesis Corpora.

[BibT_eX]

[DOI]

Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

ECESS Inter-Module Interface Specification for Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

GAIA: Common Framework for the Development of Speech Translation Technologies.

[BibT_eX]

[DOI]

Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

TC-STAR: Specifications of Language Resources and Evaluation for Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

Acceptance Testing of a Spoken Language Translation System.

[BibT_eX]

[DOI]

Rafael E. Banchs

Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

Text-independent cross-language voice conversion.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Learning from errors in grapheme-to-phoneme conversion.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Text-Independent Voice Conversion Based on Unit Selection.

[BibT_eX]

[DOI]

Shrikanth S. Narayanan

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Residual Conversion Versus Prediction on Voice Morphing Systems.

[BibT_eX]

[DOI]

Helenca Duxans

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Prosody Generation for Speech-to-Speech Translation.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Database Pruning for Unsupervised Building of Text-To-Speech Voices.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005

Main Issues in Grapheme-to-Phoneme Conversion for TTS.

[BibT_eX]

[DOI]

Proces. del Leng. Natural, 2005

Analysis of prosodic features towards modelling of emotional and pragmatic attributes of speech.

[BibT_eX]

[DOI]

Proces. del Leng. Natural, 2005

Evaluation of VTLN-based voice conversion for embedded speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Training the tilt intonation model using the JEMA methodology.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Automatic voice-source parameterization of natural speech.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

A Study on Residual Prediction Techniques for Voice Conversion.

[BibT_eX]

[DOI]

Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Comparative study of Automatic Phone Segmentation methods for TTS.

[BibT_eX]

[DOI]

Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004

Rational characteristic functions and Markov chains: application to modeling probability density functions.

[BibT_eX]

[DOI]

Josep Vidal

Natalia Fernández

Signal Process., 2004

Voice Conversion Using Exclusively Unaligned Training Data.

[BibT_eX]

[DOI]

Proces. del Leng. Natural, 2004

Including dynamic information in voice conversion systems.

[BibT_eX]

[DOI]

Proces. del Leng. Natural, 2004

Intonation modeling for TTS using a joint extraction and prediction approach.

[BibT_eX]

[DOI]

Proceedings of the Fifth ISCA ITRW on Speech Synthesis, 2004

Towards phone segmentation for concatenative speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the Fifth ISCA ITRW on Speech Synthesis, 2004

A first step towards text-independent voice conversion.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Including dynamic and phonetic information in voice conversion systems.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Joint extraction and prediction of fujisaki's intonation model parameters.

[BibT_eX]

[DOI]

Klaus Wimmer

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

2003

Lexicon and Corpora for Speech to Speech Translation (LC-STAR).

[BibT_eX]

[DOI]

Proces. del Leng. Natural, 2003

Phrase break prediction: a comparative study.

[BibT_eX]

[DOI]

Proces. del Leng. Natural, 2003

Experimental evaluation of the relevance of prosodic features in Spanish using machine learning techniques.

[BibT_eX]

[DOI]

Valentín Cardeñoso-Payo

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Estimation of GMM in voice conversion including unaligned data.

[BibT_eX]

[DOI]

Helenca Duxans

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Lexica and corpora for speech-to-speech translation: a trilingual approach.

[BibT_eX]

[DOI]

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

HMM recognition of expressions in unrestrained video intervals.

[BibT_eX]

[DOI]

José Luis Landabaso

Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002

Facial animation parameters extraction and expression recognition using Hidden Markov Models.

[BibT_eX]

[DOI]

Signal Process. Image Commun., 2002

Interface Databases: Design and Collection of a Multilingual Emotional Speech Database.

[BibT_eX]

[DOI]

Proceedings of the Third International Conference on Language Resources and Evaluation, 2002

Emotion recognition based on MPEG-4 Facial Animation Parameters.

[BibT_eX]

[DOI]

José Luis Landabaso

Proceedings of the IEEE International Conference on Acoustics, 2002

Corpus based extraction of quantitative prosodic parameters of stress groups in Spanish.

[BibT_eX]

[DOI]

Valentín Cardeñoso-Payo

Proceedings of the IEEE International Conference on Acoustics, 2002

2001

Speech emotion recognition using hidden Markov models.

[BibT_eX]

[DOI]

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

2000

The demiphone: An efficient contextual subword unit for continuous speech recognition.

[BibT_eX]

[DOI]

Speech Commun., 2000

1998

Modeling phone duration: application to Catalan TTS.

[BibT_eX]

[DOI]

Albert Febrer

Jaume Padrell

Proceedings of the Third ESCA/COCOSDA Workshop on Speech Synthesis, 1998

Using x-gram for efficient speech recognition.

[BibT_eX]

[DOI]