Marcely Zanon Boito
Orcid: 0000-0003-0134-6719
According to our database1,
Marcely Zanon Boito
authored at least 25 papers
between 2014 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
2014
2016
2018
2020
2022
2024
0
1
2
3
4
5
6
2
1
2
1
2
1
1
4
2
2
1
3
1
1
1
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
LeBenchmark 2.0: A standardized, replicable and enhanced framework for self-supervised representations of French speech.
Comput. Speech Lang., 2024
Multilingual Distilwhisper: Efficient Distillation of Multi-Task Speech Models Via Language-Specific Experts.
Proceedings of the IEEE International Conference on Acoustics, 2024
2023
DistilWhisper: Efficient Distillation of Multi-task Speech Models via Language-Specific Experts.
CoRR, 2023
NAVER LABS Europe's Multilingual Speech Translation Systems for the IWSLT 2023 Low-Resource Track.
Proceedings of the 20th International Conference on Spoken Language Translation, 2023
2022
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022
ON-TRAC Consortium Systems for the IWSLT 2022 Dialect and Low-resource Speech Translation Tasks.
Proceedings of the 19th International Conference on Spoken Language Translation, 2022
Proceedings of the 19th International Conference on Spoken Language Translation, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
2021
Models and resources for attention-based unsupervised word segmentation : an application to computational language documentation. (Modèles et ressources pour la segmentation non supervisée des mots basée sur l'attention).
PhD thesis, 2021
CoRR, 2021
LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech.
CoRR, 2021
Task Agnostic and Task Specific Self-Supervised Learning from Speech with LeBenchmark.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021
<i>LeBenchmark</i>: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
2020
Investigating Language Impact in Bilingual Approaches for Computational Language Documentation.
Proceedings of the 1st Joint Workshop on Spoken Language Technologies for Under-resourced languages and Collaboration and Computing for Under-Resourced Languages, 2020
MaSS: A Large and Clean Multilingual Corpus of Sentence-aligned Spoken Utterances Extracted from the Bible.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020
2019
ON-TRAC Consortium End-to-End Speech Translation Systems for the IWSLT 2019 Shared Task.
CoRR, 2019
How Does Language Influence Documentation Workflow? Unsupervised Word Discovery Using Translations in Multiple Languages.
CoRR, 2019
Empirical Evaluation of Sequence-to-Sequence Models for Word Discovery in Low-Resource Settings.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
2018
Proceedings of the 6th Intl. Workshop on Spoken Language Technologies for Under-Resourced Languages, 2018
A Very Low Resource Language Speech Corpus for Computational Language Documentation Experiments.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
2017
Unwritten languages demand attention too! Word discovery with encoder-decoder models.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017
2014
Size Does Not Matter. Frequency Does. A Study of Features for Measuring Lexical Complexity.
Proceedings of the Advances in Artificial Intelligence - IBERAMIA 2014, 2014