Manuel Sam Ribeiro

According to our database1, Manuel Sam Ribeiro authored at least 24 papers between 2015 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Comparing normalizing flows and diffusion models for prosody and acoustic modelling in text-to-speech.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Improving grapheme-to-phoneme conversion by learning pronunciations from speech recordings.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Multilingual context-based pronunciation learning for Text-to-Speech.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

2022
Predicting pairwise preferences between TTS audio stimuli using parallel ratings data and anti-symmetric twin neural networks.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Low-data? No problem: low-resource, language-agnostic conversational text-to-speech via F0-conditioned data augmentation.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Cross-Speaker Style Transfer for Text-to-Speech Using Data Augmentation.
Proceedings of the IEEE International Conference on Acoustics, 2022

Voice Filter: Few-Shot Text-to-Speech Speaker Adaptation Using Voice Conversion as a Post-Processing Module.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Exploiting ultrasound tongue imaging for the automatic detection of speech articulation errors.
Speech Commun., 2021

Automatic audiovisual synchronisation for ultrasound tongue imaging.
Speech Commun., 2021

Tal: A Synchronised Multi-Speaker Corpus of Ultrasound Tongue Imaging, Audio, and Lip Videos.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Silent versus Modal Multi-Speaker Speech Recognition from Ultrasound and Video.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

2019
Ultrasound Tongue Imaging for Diarization and Alignment of Child Speech Therapy Sessions.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Synchronising Audio and Ultrasound by Learning Cross-Modal Embeddings.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Speaker-independent Classification of Phonetic Segments from Raw Ultrasound in Child Speech.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
UltraSuite: A Repository of Ultrasound and Acoustic Data from Child Speech Therapy Sessions.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

The CSTR entry to the 2018 Blizzard Challenge.
Proceedings of the Blizzard Challenge 2018, Hyderabad, India, September 8, 2018, 2018

2017
Learning Word Vector Representations Based on Acoustic Counts.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

The CSTR entry to the Blizzard Challenge 2017.
Proceedings of the Blizzard Challenge 2017, Stockholm, Sweden, August 25, 2017, 2017

2016
Parallel and cascaded deep neural networks for text-to-speech synthesis.
Proceedings of the 9th ISCA Speech Synthesis Workshop, 2016

Syllable-Level Representations of Suprasegmental Features for DNN-Based Text-to-Speech Synthesis.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

The SIWIS Database: A Multilingual Speech Database with Acted Emphasis.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Wavelet-based decomposition of F0 as a secondary task for DNN-based speech synthesis with multi-task learning.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
A perceptual investigation of wavelet-based decomposition of f0 for text-to-speech synthesis.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

A multi-level representation of f0 using the continuous wavelet transform and the Discrete Cosine Transform.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015


  Loading...