Jaume Zaragoza-Bernabeu

According to our database1, Jaume Zaragoza-Bernabeu authored at least 8 papers between 2020 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
HPLT's First Release of Data and Models.
Proceedings of the 25th Annual Conference of the European Association for Machine Translation (Volume 2), 2024

A New Massive Multilingual Dataset for High-Performance Language Technologies.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

FastSpell: The LangId Magic Spell.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
OpusCleaner and OpusTrainer, open source toolkits for training Machine Translation and Large language models.
CoRR, 2023

MaCoCu: Massive collection and curation of monolingual and bilingual data: focus on under-resourced languages.
Proceedings of the 24th Annual Conference of the European Association for Machine Translation, 2023

2022
Bicleaner AI: Bicleaner Goes Neural.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

2020
Bicleaner at WMT 2020: Universitat d'Alacant-Prompsit's submission to the parallel corpus filtering shared task.
Proceedings of the Fifth Conference on Machine Translation, 2020

Bifixer and Bicleaner: two open-source tools to clean your parallel data.
Proceedings of the 22nd Annual Conference of the European Association for Machine Translation, 2020


  Loading...