Xabier Saralegi

According to our database1, Xabier Saralegi authored at least 34 papers between 2004 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
XNLIeu: a dataset for cross-lingual NLI in Basque.
CoRR, 2024

Morphology Aware Source Term Masking for Terminology-Constrained NMT.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024

How Well Can BERT Learn the Grammar of an Agglutinative and Flexible-Order Language? The Case of Basque.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
Strategies for bilingual intent classification for small datasets scenarios.
Proces. del Leng. Natural, 2023

Not Enough Data to Pre-train Your Language Model? MT to the Rescue!
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Scaling Laws for BERT in Low-Resource Settings.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Information retrieval and question answering: A case study on COVID-19 scientific literature.
Knowl. Based Syst., 2022

Gender Bias Mitigation for NMT Involving Genderless Languages.
Proceedings of the Seventh Conference on Machine Translation, 2022

BasqueGLUE: A Natural Language Understanding Benchmark for Basque.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

TANDO: A Corpus for Document-level Machine Translation.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

2021
GEPSA, a tool for monitoring social challenges in digital press.
Proceedings of the First Workshop on Language Technology for Equality, 2021

Fine-Tuning BERT for COVID-19 Domain Ad-Hoc IR by Using Pseudo-qrels.
Proceedings of the Advances in Information Retrieval, 2021

2020
Grammatical Error Correction for Basque through a seq2seq neural architecture and synthetic examples.
Proces. del Leng. Natural, 2020

Elhuyar submission to the Biomedical Translation Task 2020 on terminology and abstracts translation.
Proceedings of the Fifth Conference on Machine Translation, 2020

Building a Task-oriented Dialog System for Languages with no Training Data: the Case for Basque.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Give your Text Representation Models some Love: the Case for Basque.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

2018
Talaia: a Real time Monitor of Social Media and Digital Press.
CoRR, 2018

2016
Polarity Lexicon Building: to what Extent Is the Manual Effort Worth?
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Evaluating Translation Quality and CLIR Performance of Query Sessions.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

2015
EliXa: A Modular and Flexible ABSA Platform.
Proceedings of the 9th International Workshop on Semantic Evaluation, 2015

2013
Elhuyar at Tweet-Norm 2013.
Proceedings of the Tweet Normalization Workshop co-located with 29th Conference of the Spanish Society for Natural Language Processing (SEPLN 2013), 2013

Cross-Lingual Projections vs. Corpora Extracted Subjectivity Lexicons for Less-Resourced Languages.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2013

Analyzing the Sense Distribution of Concordances Obtained by Web as Corpus Approach.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2013

Automatic Comparable Web Corpora Collection and Bilingual Terminology Extraction for Specialized Dictionary Making.
Proceedings of the Building and Using Comparable Corpora., 2013

2012
Building a Basque-Chinese Dictionary by Using English as Pivot.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

2011
Analyzing Methods for Improving Precision of Pivot Based Bilingual Dictionaries.
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011

2010
Dictionary and Monolingual Corpus-based Query Translation for Basque-English CLIR.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

Estimating Translation Probabilities from the Web for Structured Queries on CLIR.
Proceedings of the Advances in Information Retrieval, 2010

Document Expansion for Cross-Lingual Passage Retrieval.
Proceedings of the CLEF 2010 LABs and Workshops, 2010

2009
Comparing Different Approaches to Treat Translation Ambiguity in CLIR: Structured Queries vs. Target Co-occurrence Based Selection.
Proceedings of the Database and Expert Systems Applications, 2009

Elhuyar-IXA: Semantic Relatedness and Cross-Lingual Passage Retrieval.
Proceedings of the Multilingual Information Access Evaluation I. Text Retrieval Experiments, 2009

2008
Mining Term Translation from Domain Restricted Comparable Corpora.
Proces. del Leng. Natural, 2008

2007
Similitud entre Documentos Multilinges de Carácter Científico-Técnico en un Entorno Web.
Proces. del Leng. Natural, 2007

2004
A XML-Based Term Extraction Tool for Basque.
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004


  Loading...