Edresson Casanova

Ricardo M. Marcacini

Odilon Gonçalves

Rodrigo Lima

Proceedings of the 31st International Conference on Computational Linguistics, 2025

2024

Low Frame-rate Speech Codec: a Codec Designed for Fast High-quality Speech LLM Training and Inference.

[BibT_eX]

[DOI]

CoRR, 2024

XTTS: a Massively Multilingual Zero-Shot Text-to-Speech Model.

[BibT_eX]

[DOI]

CoRR, 2024

TTS applied to the generation of datasets for automatic speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 16th International Conference on Computational Processing of Portuguese, 2024

MLAAD: The Multi-Language Audio Anti-Spoofing Dataset.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2024

2023

CORAA ASR: a large corpus of spontaneous and prepared speech manually validated for speech recognition in Brazilian Portuguese.

[BibT_eX]

[DOI]

Ricardo Corso Fernandes Junior

Lucas Oliveira

Bruno Baldissera Carlotto

Fernando Gorgulho Fayet

Lang. Resour. Evaluation, September, 2023

Evaluating OpenAI's Whisper ASR for Punctuation Prediction and Topic Modeling of life histories of the Museum of the Person.

[BibT_eX]

[DOI]

Ricardo M. Marcacini

CoRR, 2023

CML-TTS: A Multilingual Dataset for Speech Synthesis in Low-Resource Languages.

[BibT_eX]

[DOI]

Arlindo R. Galvão Filho

Proceedings of the Text, Speech, and Dialogue - 26th International Conference, 2023

Evaluation of Speech Representations for MOS Prediction.

[BibT_eX]

[DOI]

Lucas R. S. Gris

Arlindo R. Galvão Filho

Proceedings of the Text, Speech, and Dialogue - 26th International Conference, 2023

ASR data augmentation in low-resource settings using cross-lingual multi-speaker TTS and cross-lingual voice conversion.

[BibT_eX]

[DOI]

Alexander Korolev

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

2022

TTS-Portuguese Corpus: a corpus for speech synthesis in Brazilian Portuguese.

[BibT_eX]

[DOI]

João Paulo Ramos Teixeira

Lang. Resour. Evaluation, 2022

Interpretability Analysis of Deep Models for COVID-19 Detection.

[BibT_eX]

[DOI]

Flaviane Romani Fernandes Svartman

Marcelo Finger

Beatriz Raposo de Medeiros

Marcus Vinícius Moreira Martins

Larissa Cristina Berti

João Paulo Teixeira

CoRR, 2022

A single speaker is almost all you need for automatic speech recognition.

[BibT_eX]

[DOI]

Alexander Korolev

CoRR, 2022

Overview of the Automatic Speech Recognition for Spontaneous and Prepared Speech & Speech Emotion Recognition in Portuguese (S&ER) Shared-tasks at PROPOR 2022.

[BibT_eX]

[DOI]

Ricardo M. Marcacini

Arnaldo Candido Jr.

Proceedings of the Workshop on Automatic Speech Recognition for Spontaneous and Prepared Speech & Speech Emotion Recognition in Portuguese co-located with 15th edition of the International Conference on the Computational Processing of Portuguese (PROPOR 2022), 2022

Brazilian Portuguese Speech Recognition Using Wav2vec 2.0.

[BibT_eX]

[DOI]

Proceedings of the Computational Processing of the Portuguese Language, 2022

BibleTTS: a large, high-fidelity, multilingual, and uniquely African speech corpus.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for Everyone.

[BibT_eX]

[DOI]

Julian Weber

Christopher Dane Shulby

Eren Gölge

Moacir A. Ponti

Proceedings of the International Conference on Machine Learning, 2022

2021

CORAA: a large corpus of spontaneous and prepared speech manually validated for speech recognition in Brazilian Portuguese.

[BibT_eX]

[DOI]

Ricardo Corso Fernandes Junior

Lucas Oliveira

Bruno Baldissera Carlotto

Fernando Gorgulho Fayet

CoRR, 2021

Evaluating Semantic Similarity Methods to Build Semantic Predictability Norms of Reading Data.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech, and Dialogue - 24th International Conference, 2021

SC-GlowTTS: An Efficient Zero-Shot Multi-Speaker Text-To-Speech Model.

[BibT_eX]

[DOI]

Eren Gölge

Nicolas Michael Müller

Arnaldo Candido Jr.

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Transfer Learning and Data Augmentation Techniques to the COVID-19 Identification Tasks in ComParE 2021.

[BibT_eX]

[DOI]

Ricardo Corso Fernandes Junior

Arnaldo Candido Jr.

Marcelo Finger

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Speech2Phone: A Novel and Efficient Method for Training Speaker Recognition Models.

[BibT_eX]

[DOI]

Hamilton Pereira da Silva

Proceedings of the Intelligent Systems - 10th Brazilian Conference, 2021

Deep Learning against COVID-19: Respiratory Insufficiency Detection in Brazilian Portuguese Speech.

[BibT_eX]

[DOI]

Lucas Gris

Augusto Camargo Neto

Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020

End-To-End Speech Synthesis Applied to Brazilian Portuguese.

[BibT_eX]

[DOI]

João Paulo Teixeira

CoRR, 2020

Speech2Phone: A Multilingual and Text Independent Speaker Identification Model.

[BibT_eX]

[DOI]

Hamilton Pereira da Silva

Pedro Luiz de Paula Filho

Alessandro Ferreira Cordeiro

Victor de Oliveira Guedes

Marco Antonio Sobrevilla Cabezudo

CoRR, 2020

Natural Language Inference for Portuguese Using BERT and Multilingual Information.

[BibT_eX]

[DOI]

Marcio Lima Inácio

Ana Carolina Rodrigues

Rogério Figueredo de Sousa

Proceedings of the Computational Processing of the Portuguese Language, 2020

Evaluating Sentence Segmentation in Different Datasets of Neuropsychological Language Tests in Brazilian Portuguese.

[BibT_eX]

[DOI]

Proceedings of The 12th Language Resources and Evaluation Conference, 2020

2019

NILC at ASSIN 2: Exploring Multilingual Approaches.

[BibT_eX]

[DOI]

Marco Antonio Sobrevilla Cabezudo

Marcio Lima Inácio

Ana Carolina Rodrigues