We stand with Ukraine

We stand with Ukraine

Marcely Zanon Boito

Orcid: 0000-0003-0134-6719

According to our database¹, Marcely Zanon Boito authored at least 25 papers between 2014 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

Online presence:

on orcid.org

On csauthors.net:

Bibliography

2024

LeBenchmark 2.0: A standardized, replicable and enhanced framework for self-supervised representations of French speech.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2024

mHuBERT-147: A Compact Multilingual HuBERT Model.

[BibT_eX]

[DOI]

Marcely Zanon Boito

,

,

,

Laurent Besacier

,

Ioan Calapodescu

CoRR, 2024

Multilingual Distilwhisper: Efficient Distillation of Multi-Task Speech Models Via Language-Specific Experts.

[BibT_eX]

[DOI]

Thomas Palmeira Ferraz

,

Marcely Zanon Boito

,

,

Vassilina Nikoulina

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

DistilWhisper: Efficient Distillation of Multi-task Speech Models via Language-Specific Experts.

[BibT_eX]

[DOI]

Thomas Palmeira Ferraz

,

Marcely Zanon Boito

,

,

Vassilina Nikoulina

CoRR, 2023

NAVER LABS Europe's Multilingual Speech Translation Systems for the IWSLT 2023 Low-Resource Track.

[BibT_eX]

[DOI]

Edward Gow-Smith

,

Alexandre Berard

,

Marcely Zanon Boito

,

Ioan Calapodescu

Proceedings of the 20th International Conference on Spoken Language Translation, 2023

2022

Speech Resources in the Tamasheq Language.

[BibT_eX]

[DOI]

Marcely Zanon Boito

,

,

Florentin Barbier

,

Souhir Gahbiche

,

,

Mickael Rouvier

,

Yannick Estève

Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

ON-TRAC Consortium Systems for the IWSLT 2022 Dialect and Low-resource Speech Translation Tasks.

[BibT_eX]

[DOI]

Marcely Zanon Boito

,

,

,

Antoine Laurent

,

,

,

,

,

Florentin Barbier

,

Souhir Gahbiche

,

Yannick Estève

Proceedings of the 19th International Conference on Spoken Language Translation, 2022

Findings of the IWSLT 2022 Evaluation Campaign.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Spoken Language Translation, 2022

A Study of Gender Impact in Self-supervised Models for Speech-to-Text Systems.

[BibT_eX]

[DOI]

Marcely Zanon Boito

,

Laurent Besacier

,

Natalia A. Tomashenko

,

Yannick Estève

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2021

Models and resources for attention-based unsupervised word segmentation : an application to computational language documentation. (Modèles et ressources pour la segmentation non supervisée des mots basée sur l'attention).

[BibT_eX]

[DOI]

Marcely Zanon Boito

PhD thesis, 2021

Unsupervised Word Segmentation from Discrete Speech Units in Low-Resource Settings.

[BibT_eX]

[DOI]

Marcely Zanon Boito

,

,

,

Aline Villavicencio

,

Laurent Besacier

CoRR, 2021

LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech.

[BibT_eX]

[DOI]

,

,

,

Marcely Zanon Boito

,

Salima Mdhaffar

,

,

,

Natalia A. Tomashenko

,

Marco Dinarelli

,

Titouan Parcollet

,

Alexandre Allauzen

,

Yannick Estève

,

Benjamin Lecouteux

,

François Portet

,

Solange Rossato

,

Fabien Ringeval

,

,

Laurent Besacier

CoRR, 2021

Task Agnostic and Task Specific Self-Supervised Learning from Speech with LeBenchmark.

[BibT_eX]

[DOI]

,

,

,

Marcely Zanon Boito

,

Salima Mdhaffar

,

,

,

Natalia A. Tomashenko

,

Marco Dinarelli

,

Titouan Parcollet

,

Alexandre Allauzen

,

Yannick Estève

,

Benjamin Lecouteux

,

François Portet

,

Solange Rossato

,

Fabien Ringeval

,

,

Laurent Besacier

Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

<i>LeBenchmark</i>: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech.

[BibT_eX]

[DOI]

,

,

,

Marcely Zanon Boito

,

Salima Mdhaffar

,

,

,

Natalia A. Tomashenko

,

Marco Dinarelli

,

Titouan Parcollet

,

Alexandre Allauzen

,

Yannick Estève

,

Benjamin Lecouteux

,

François Portet

,

Solange Rossato

,

Fabien Ringeval

,

,

Laurent Besacier

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

2020

Investigating alignment interpretability for low-resource NMT.

[BibT_eX]

[DOI]

Marcely Zanon Boito

,

Aline Villavicencio

,

Laurent Besacier

Mach. Transl., 2020

Investigating Language Impact in Bilingual Approaches for Computational Language Documentation.

[BibT_eX]

[DOI]

Marcely Zanon Boito

,

Aline Villavicencio

,

Laurent Besacier

Proceedings of the 1st Joint Workshop on Spoken Language Technologies for Under-resourced languages and Collaboration and Computing for Under-Resourced Languages, 2020

MaSS: A Large and Clean Multilingual Corpus of Sentence-aligned Spoken Utterances Extracted from the Bible.

[BibT_eX]

[DOI]

Marcely Zanon Boito

,

,

Mahault Garnerin

,

Éric Le Ferrand

,

Laurent Besacier

Proceedings of The 12th Language Resources and Evaluation Conference, 2020

2019

ON-TRAC Consortium End-to-End Speech Translation Systems for the IWSLT 2019 Shared Task.

[BibT_eX]

[DOI]

,

Natalia A. Tomashenko

,

Marcely Zanon Boito

,

Antoine Caubrière

,

,

Mickael Rouvier

,

Laurent Besacier

,

Yannick Estève

CoRR, 2019

How Does Language Influence Documentation Workflow? Unsupervised Word Discovery Using Translations in Multiple Languages.

[BibT_eX]

[DOI]

Marcely Zanon Boito

,

Aline Villavicencio

,

Laurent Besacier

CoRR, 2019

Empirical Evaluation of Sequence-to-Sequence Models for Word Discovery in Low-Resource Settings.

[BibT_eX]

[DOI]

Marcely Zanon Boito

,

Aline Villavicencio

,

Laurent Besacier

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

2018

A Small Griko-Italian Speech Translation Corpus.

[BibT_eX]

[DOI]

Marcely Zanon Boito

,

Antonios Anastasopoulos

,

Aline Villavicencio

,

Laurent Besacier

,

Proceedings of the 6th Intl. Workshop on Spoken Language Technologies for Under-Resourced Languages, 2018

A Very Low Resource Language Speech Corpus for Computational Language Documentation Experiments.

[BibT_eX]

[DOI]

,

,

Martine Adda-Decker

,

,

Laurent Besacier

,

Jamison Cooper-Leavitt

,

Guy-Noël Kouarata

,

,

Hélène Maynard

,

,

,

Sebastian Stüker

,

,

Marcely Zanon Boito

Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Unsupervised Word Segmentation from Speech with Attention.

[BibT_eX]

[DOI]

,

Marcely Zanon Boito

,

,

Alexandre Berard

,

,

Aline Villavicencio

,

Laurent Besacier

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

2017

Unwritten languages demand attention too! Word discovery with encoder-decoder models.

[BibT_eX]

[DOI]

Marcely Zanon Boito

,

Alexandre Berard

,

Aline Villavicencio

,

Laurent Besacier

Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2014

Size Does Not Matter. Frequency Does. A Study of Features for Measuring Lexical Complexity.

[BibT_eX]

[DOI]

Rodrigo Wilkens

,

Alessandro Dalla Vecchia

,

Marcely Zanon Boito

,

,

Aline Villavicencio

Proceedings of the Advances in Artificial Intelligence - IBERAMIA 2014, 2014

Loading...