Thierry Etchegoyhen

Orcid: 0000-0001-7253-1693

According to our database1, Thierry Etchegoyhen authored at least 44 papers between 1998 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Automating Easy Read Text Segmentation.
CoRR, 2024

Vicomtech@WMT 2024: Shared Task on Translation into Low-Resource Languages of Spain.
Proceedings of the Ninth Conference on Machine Translation, 2024

STREAMS: Live Streaming and Micro-batching with Automated Multilingual Services.
Proceedings of the Seminar of the Spanish Society for Natural Language Processing: Projects and System Demonstrations (SEPLN-CEDI-PD 2024) co-located with the 7th Spanish Conference on Informatics (CEDI 2024), 2024

Does Context Help Mitigate Gender Bias in Neural Machine Translation?
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Automating Easy Read Text Segmentation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Promoting Target Data in Context-aware Neural Machine Translation.
Proceedings of the 25th Annual Conference of the European Association for Machine Translation (Volume 1), 2024

Split and Rephrase with Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Split and Rephrase with Large Language Models.
CoRR, 2023

IRAZ: Easy-to-Read Content Generation via Automated Text Simplification.
Proceedings of the Annual Conference of the Spanish Association for Natural Language Processing 2023: Projects and System Demonstrations (SEPLN-PD 2023) co-located with the International Conference of the Spanish Society for Natural Language Processing (SEPLN 2023), 2023

An Evaluation of Source Factors in Concatenation-Based Context-Aware Neural Machine Translation.
Proceedings of the 14th International Conference on Recent Advances in Natural Language Processing, 2023

What Works When in Context-aware Neural Machine Translation?
Proceedings of the 24th Annual Conference of the European Association for Machine Translation, 2023

Unsupervised Subtitle Segmentation with Masked Language Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

2022
Making the most of comparable corpora in Neural Machine Translation: a case study.
Lang. Resour. Evaluation, 2022

iASSIST: Low-cost, portable and embedded assistants for on-premise automated transcription and translation services.
Proceedings of the Annual Conference of the Spanish Association for Natural Language Processing: Projects and Demonstrations (SEPLN-PD 2022) co-located with the Conference of the Spanish Society for Natural Language Processing (SEPLN 2022), 2022

TANDO: A Corpus for Document-level Machine Translation.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

2021
ITAI: Adaptive Neural Machine Translation Platform.
Proceedings of the Annual Conference of the Spanish Association for Natural Language Processing: Projects and Demonstrations (SEPLN-PD 2021) co-located with the Conference of the Spanish Society for Natural Language Processing (SEPLN 2021), 2021

Online Learning over Time in Adaptive Neural Machine Translation.
Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2021), 2021

mintzai-ST: Corpus and Baselines for Basque-Spanish Speech Translation.
Proceedings of the Fifth International Conference, 2021

2020
MINTZAI: Sistemas de Aprendizaje Profundo E2E para Traducción Automática del Habla.
Proces. del Leng. Natural, 2020

ELRI: A Decentralised Network of National Relay Stations to Collect, Prepare and Share Language Resources.
Proceedings of the 1st International Workshop on Language Technology Platforms, 2020

Handle with Care: A Case Study in Comparable Corpora Exploitation for Neural Machine Translation.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

To Case or not to case: Evaluating Casing Methods for Neural Machine Translation.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020


The Multilingual Anonymisation Toolkit for Public Administrations (MAPA) Project.
Proceedings of the 22nd Annual Conference of the European Association for Machine Translation, 2020

2019
Efficient document alignment across scenarios.
Mach. Transl., 2019

2018
QUALES: Estimación Automática de Calidad de Traducción Mediante Aprendizaje Automático Supervisado y No-Supervisado.
Proces. del Leng. Natural, 2018

Supervised and Unsupervised Minimalist Quality Estimators: Vicomtech's Participation in the WMT 2018 Quality Estimation Task.
Proceedings of the Third Conference on Machine Translation: Shared Task Papers, 2018

STACC, OOV Density and N-gram Saturation: Vicomtech's Participation in the WMT 2018 Shared Task on Parallel Corpus Filtering.
Proceedings of the Third Conference on Machine Translation: Shared Task Papers, 2018

Evaluating Domain Adaptation for Machine Translation Across Scenarios.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Extracting Parallel Sentences from Comparable Corpora with STACC Variants.
Proceedings of the 11th Workshop on Building and Using Comparable Corpora, 2018


Neural Machine Translation of Basque.
Proceedings of the 21st Annual Conference of the European Association for Machine Translation, 2018

2017
Exploiting Relative Frequencies for Data Selection.
Proceedings of Machine Translation Summit XVI, Volume 1: Research Track, 2017

Weighted Set-Theoretic Alignment of Comparable Sentences.
Proceedings of the 10th Workshop on Building and Using Comparable Corpora, 2017

2016
DOCAL - Vicomtech's Participation in the WMT16 Shared Task on Bilingual Document Alignment.
Proceedings of the First Conference on Machine Translation, 2016

Exploiting a Large Strongly Comparable Corpus.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

A Portable Method for Parallel and Comparable Document Alignment.
Proceedings of the 19th Annual Conference of the European Association for Machine Translation, 2016

Statistically-Guided Controlled Language Authoring.
Proceedings of the Controlled Natural Language - 5th International Workshop, 2016

Set-Theoretic Alignment for Comparable Corpora.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

2014
Lexical Normalization of Spanish Tweets with Rule-Based Components and Language Models.
Proces. del Leng. Natural, 2014

Machine Translation for Subtitling: A Large-Scale Evaluation.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Towards Customized Automatic Segmentation of Subtitles.
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2014

2013
Lexical Normalization of Spanish Tweets with Preprocessing Rules, Domain-specific Edit Distances, and Language Models.
Proceedings of the Tweet Normalization Workshop co-located with 29th Conference of the Spanish Society for Natural Language Processing (SEPLN 2013), 2013

1998
System Demonstration Overview of GBGEN.
Proceedings of the Ninth International Workshop on Natural Language Generation, 1998


  Loading...