Antonio Bonafonte

Orcid: 0000-0002-6240-9915

According to our database1, Antonio Bonafonte authored at least 113 papers between 1992 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Controllable Emphasis with zero data for text-to-speech.
Proceedings of the 12th ISCA Speech Synthesis Workshop, 2023

2022
Analysis and Utilization of Entrainment on Acoustic and Emotion Features in User-agent Dialogue.
CoRR, 2022

Distribution Augmentation for Low-Resource Expressive Text-To-Speech.
Proceedings of the IEEE International Conference on Acoustics, 2022

Discrete Acoustic Space for an Efficient Sampling in Neural Text-To-Speech.
Proceedings of the 6th International Conference, 2022

2021
Corpora compilation for prosody-informed speech processing.
Lang. Resour. Evaluation, 2021

Proteno: Text Normalization with Limited Data for Fast Deployment in Text to Speech Systems.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Industry Papers, 2021

Diverse Conversational Spoken Language Generation.
Proceedings of the Fifth International Conference, 2021

2019
Time-domain speech enhancement using generative adversarial networks.
Speech Commun., 2019

Problem-Agnostic Speech Embeddings for Multi-Speaker Text-to-Speech with SampleRNN.
Proceedings of the 10th ISCA Speech Synthesis Workshop, 2019

Towards Generalized Speech Enhancement with Generative Adversarial Networks.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Learning Problem-Agnostic Speech Representations from Multiple Self-Supervised Tasks.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Prosodic Phrase Alignment for Machine Dubbing.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

2018
Visualizing Punctuation Restoration in Speech Transcripts with Prosograph.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Expressive Speech Synthesis Using Sentiment Embeddings.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Spanish Statistical Parametric Speech Synthesis Using a Neural Vocoder.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Language and Noise Transfer in Speech Enhancement Generative Adversarial Network.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Whispered-to-voiced Alaryngeal Speech Conversion with Generative Adversarial Networks.
Proceedings of the Fourth International Conference, 2018

Self-Attention Linguistic-Acoustic Decoder.
Proceedings of the Fourth International Conference, 2018

Bilingual Prosodic Dataset Compilation for Spoken Language Translation.
Proceedings of the Fourth International Conference, 2018

Corpus for Cyberbullying Prevention.
Proceedings of the Fourth International Conference, 2018

Multi-Speaker Neural Vocoder.
Proceedings of the Fourth International Conference, 2018

2017
SEGAN: Speech Enhancement Generative Adversarial Network.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

2016
Multi-output RNN-LSTM for multiple speaker speech synthesis with α-interpolation model.
Proceedings of the 9th ISCA Speech Synthesis Workshop, 2016

Prosodic and Spectral iVectors for Expressive Speech Synthesis.
Proceedings of the 9th ISCA Speech Synthesis Workshop, 2016

Direct Expressive Voice Training Based on Semantic Selection.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Deep Neural Networks for i-Vector Language Identification of Short Utterances in Cars.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Prosodic Break Prediction with RNNs.
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2016

Multi-output RNN-LSTM for multiple speaker speech synthesis and adaptation.
Proceedings of the 24th European Signal Processing Conference, 2016

Acoustic feature prediction from semantic features for expressive speech using deep neural networks.
Proceedings of the 24th European Signal Processing Conference, 2016

2015
Creating expressive synthetic voices by unsupervised clustering of audiobooks.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

2013
Glissando: a corpus for multidisciplinary prosodic studies in Spanish and Catalan.
Lang. Resour. Evaluation, 2013

Parametric decomposition of the spectral envelope.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Production of filled pauses in concatenative speech synthesis based on the underlying fluent sentence.
Speech Commun., 2012

Building Synthetic Voices in the META-NET Framework.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

BUCEADOR, a multi-language search engine for digital libraries.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

2011
Introducing nativization to Spanish TTS systems.
Speech Commun., 2011

Adding Glottal Source Information to Intra-Lingual Voice Conversion.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Albayzín 2010: A Spanish Text to Speech Evaluation.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Work in progress - Cooperative and competitive projects for engaging students in advanced ICT subjects.
Proceedings of the 2011 Frontiers in Education Conference, 2011

BUCEADOR hybrid TTS for Blizzard Challenge 2011.
Proceedings of the Blizzard Challenge 2011, Turin, Italy, September 2, 2011, 2011

2010
INCA Algorithm for Training Voice Conversion Systems From Nonparallel Corpora.
IEEE Trans. Speech Audio Process., 2010

Voice Conversion Based on Weighted Frequency Warping.
IEEE Trans. Speech Audio Process., 2010

Nativization of English words in Spanish using analogy.
Proceedings of the Seventh ISCA Tutorial and Research Workshop on Speech Synthesis, 2010

TTS Evaluation Campaign with a Common Spanish Database.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

Synthesis of filled pauses based on a disfluent speech model.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Voice conversion using k-histograms and frame selection.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Towards robust glottal source modeling.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Determining intonational boundaries from the acoustic signal.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Improving consistence of phonetic transcription for text-to-speech.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

New strategies for pronunciation by analogy.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
Corpus and Voices for Catalan Speech Synthesis.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

Intonation modeling of Mandarin Chinese using a superpositional approach.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

On the generation of synthetic disfluent speech: local prosodic modifications caused by the insertion of editing terms.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

A study of JEMA for intonation modeling.
Proceedings of the IEEE International Conference on Acoustics, 2008

The UPC TTS System Description for the 2008 Blizzard Challenge.
Proceedings of the Blizzard Challenge 2008, 2008

2007
Filled Pauses in Speech Synthesis: Towards Conversational Speech.
Proceedings of the Text, Speech and Dialogue, 10th International Conference, 2007

Flexible harmonic/stochastic speech synthesis.
Proceedings of the Sixth ISCA Workshop on Speech Synthesis, 2007

Statistical analysis of filled pauses<sup>2</sup> rhythm for disfluent speech synthesis.
Proceedings of the Sixth ISCA Workshop on Speech Synthesis, 2007

The UPC TTS system description for the 2007 Blizzard Challenge.
Proceedings of the Evaluation of text-to-speech systems: Blizzard Challenge 2007, 2007

2006
Spanish Synthesis Corpora.
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

ECESS Inter-Module Interface Specification for Speech Synthesis.
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

GAIA: Common Framework for the Development of Speech Translation Technologies.
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

TC-STAR: Specifications of Language Resources and Evaluation for Speech Synthesis.
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

Acceptance Testing of a Spoken Language Translation System.
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

Text-independent cross-language voice conversion.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Learning from errors in grapheme-to-phoneme conversion.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Text-Independent Voice Conversion Based on Unit Selection.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Residual Conversion Versus Prediction on Voice Morphing Systems.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Prosody Generation for Speech-to-Speech Translation.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Database Pruning for Unsupervised Building of Text-To-Speech Voices.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
Main Issues in Grapheme-to-Phoneme Conversion for TTS.
Proces. del Leng. Natural, 2005

Analysis of prosodic features towards modelling of emotional and pragmatic attributes of speech.
Proces. del Leng. Natural, 2005

Evaluation of VTLN-based voice conversion for embedded speech synthesis.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Training the tilt intonation model using the JEMA methodology.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Automatic voice-source parameterization of natural speech.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

A Study on Residual Prediction Techniques for Voice Conversion.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Comparative study of Automatic Phone Segmentation methods for TTS.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
Rational characteristic functions and Markov chains: application to modeling probability density functions.
Signal Process., 2004

Voice Conversion Using Exclusively Unaligned Training Data.
Proces. del Leng. Natural, 2004

Including dynamic information in voice conversion systems.
Proces. del Leng. Natural, 2004

Intonation modeling for TTS using a joint extraction and prediction approach.
Proceedings of the Fifth ISCA ITRW on Speech Synthesis, 2004

Towards phone segmentation for concatenative speech synthesis.
Proceedings of the Fifth ISCA ITRW on Speech Synthesis, 2004

A first step towards text-independent voice conversion.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Including dynamic and phonetic information in voice conversion systems.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Joint extraction and prediction of fujisaki's intonation model parameters.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

2003
Lexicon and Corpora for Speech to Speech Translation (LC-STAR).
Proces. del Leng. Natural, 2003

Phrase break prediction: a comparative study.
Proces. del Leng. Natural, 2003

Experimental evaluation of the relevance of prosodic features in Spanish using machine learning techniques.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Estimation of GMM in voice conversion including unaligned data.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Lexica and corpora for speech-to-speech translation: a trilingual approach.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

HMM recognition of expressions in unrestrained video intervals.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002
Facial animation parameters extraction and expression recognition using Hidden Markov Models.
Signal Process. Image Commun., 2002

Interface Databases: Design and Collection of a Multilingual Emotional Speech Database.
Proceedings of the Third International Conference on Language Resources and Evaluation, 2002

Emotion recognition based on MPEG-4 Facial Animation Parameters.
Proceedings of the IEEE International Conference on Acoustics, 2002

Corpus based extraction of quantitative prosodic parameters of stress groups in Spanish.
Proceedings of the IEEE International Conference on Acoustics, 2002

2001
Speech emotion recognition using hidden Markov models.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

2000
The demiphone: An efficient contextual subword unit for continuous speech recognition.
Speech Commun., 2000

1998
Modeling phone duration: application to Catalan TTS.
Proceedings of the Third ESCA/COCOSDA Workshop on Speech Synthesis, 1998

Using x-gram for efficient speech recognition.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

The UPC text-to-speech system for Spanish and catalan.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

1997
The demiphone: an efficient subword unit for continuous speech recognition.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

A bilingual text-to-speech system in Spanish and catalan.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

1996
Duration modeling with expanded HMM applied to speech recognition.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Explicit segmentation of speech using Gaussian models.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Sethos: the UPC speech understanding system.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Language modeling using x-grams.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

1995
Semantic decoding of speech in constrained domains.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Study of subword units for Spanish speech recognition.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

1993
Albayzin speech database: design of the phonetic corpus.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Out-of-vocabulary word modelling and rejection for keyword spotting.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

An efficient algorithm to find the best state sequence in HSMM.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

1992
Syllabic fillers for Spanish HMM keyword spotting.
Proceedings of the Second International Conference on Spoken Language Processing, 1992

Efficient integration of coarticulation and lexical information in a finite state grammar.
Proceedings of the Second International Conference on Spoken Language Processing, 1992


  Loading...