Clara Rivera

According to our database1, Clara Rivera authored at least 15 papers between 2020 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Connecting Language Technologies with Rich, Diverse Data Sources Covering Thousands of Languages.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
AfriQA: Cross-lingual Open-Retrieval Question Answering for African Languages.
CoRR, 2023

MD3: The Multi-Dialect Dataset of Dialogues.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023


TaTA: A Multilingual Table-to-Text Dataset for African Languages.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

2022
Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets.
Trans. Assoc. Comput. Linguistics, 2022

FLEURS: FEW-Shot Learning Evaluation of Universal Representations of Speech.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Writing System and Speaker Metadata for 2, 800+ Language Varieties.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

XTREME-S: Evaluating Cross-lingual Speech Representations.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2020
Google Crowdsourced Speech Corpora and Related Open-Source Resources for Low-Resource Languages and Dialects: An Overview.
CoRR, 2020

Open-Source High Quality Speech Datasets for Basque, Catalan and Galician.
Proceedings of the 1st Joint Workshop on Spoken Language Technologies for Under-resourced languages and Collaboration and Computing for Under-Resourced Languages, 2020

Open-source Multi-speaker Speech Corpora for Building Gujarati, Kannada, Malayalam, Marathi, Tamil and Telugu Speech Synthesis Systems.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Open-source Multi-speaker Corpora of the English Accents in the British Isles.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Developing an Open-Source Corpus of Yoruba Speech.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Multimodal Pretraining for Dense Video Captioning.
Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, 2020


  Loading...