Marta R. Costa-jussà
Orcid: 0000-0002-5703-520XAffiliations:
- Universitat Politècnica de Catalunya, Madrid, Spain
According to our database1,
Marta R. Costa-jussà
authored at least 224 papers
between 2005 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on scopus.com
-
on orcid.org
-
on d-nb.info
On csauthors.net:
Bibliography
2024
CoRR, 2024
Y-NQ: English-Yorùbá Evaluation dataset for Open-Book Reading Comprehension and Text Generation.
CoRR, 2024
2M-BELEBELE: Highly Multilingual Speech and American Sign Language Comprehension Dataset.
CoRR, 2024
On the Similarity of Circuits across Languages: a Case Study on the Subject-verb Agreement Task.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
BLASER 2.0: a metric for evaluation and quality estimation of massively multilingual speech and text translation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
ReSeTOX: Re-learning attention weights for toxicity mitigation in machine translation.
Proceedings of the 25th Annual Conference of the European Association for Machine Translation (Volume 1), 2024
Added Toxicity Mitigation at Inference Time for Multimodal and Massively Multilingual Translation.
Proceedings of the 25th Annual Conference of the European Association for Machine Translation (Volume 1), 2024
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Proceedings of the Findings of the Association for Computational Linguistics, 2024
2023
The Gender-GAP Pipeline: A Gender-Aware Polyglot Pipeline for Gender Characterisation in 55 Languages.
Proceedings of the Eighth Conference on Machine Translation, 2023
Proceedings of the 20th International Conference on Spoken Language Translation, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
SegAugment: Maximizing the Utility of Speech Translation Data with Segmentation-based Augmentations.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
HalOmi: A Manually Annotated Benchmark for Multilingual Hallucination and Omission Detection in Machine Translation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Multilingual Holistic Bias: Extending Descriptors and Patterns to Unveil Demographic Biases in Languages at Scale.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Detecting and Mitigating Hallucinations in Machine Translation: Model Internal Workings Alone Do Well, Sentence Similarity Even Better.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
2022
Multilingual Machine Translation: Deep Analysis of Language-Specific Encoder-Decoders.
J. Artif. Intell. Res., 2022
Findings of the WMT'22 Shared Task on Large-Scale Machine Translation Evaluation for African Languages.
Proceedings of the Seventh Conference on Machine Translation, 2022
OccGen: Selection of Real-world Multilingual Parallel Data Balanced in Gender within Occupations.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Multiformer: A Head-Configurable Transformer-Based Model for Direct Speech Translation.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Student Research Workshop, 2022
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022
Pretrained Speech Encoders and Efficient Fine-tuning Methods for Speech Translation: UPC at IWSLT 2022.
Proceedings of the 19th International Conference on Spoken Language Translation, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
Towards Opening the Black Box of Neural Machine Translation: Source and Target Interpretations of the Transformer.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
Neural Machine Translation for Kashmiri to English and Hindi using Pre-trained Embeddings.
Proceedings of the OITS International Conference on Information Technology, 2022
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop, 2022
Interpreting Gender Bias in Neural Machine Translation: Multilingual Architecture Matters.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
2021
Nat. Lang. Eng., 2021
Neural Comput. Appl., 2021
Mach. Transl., 2021
From bilingual to multilingual neural-based machine translation by incremental training.
J. Assoc. Inf. Sci. Technol., 2021
How to Write a Bias Statement: Recommendations for Submissions to the Workshop on Gender Bias in NLP.
CoRR, 2021
High Frequent In-domain Words Segmentation and Forward Translation for the WMT21 Biomedical Task.
Proceedings of the Sixth Conference on Machine Translation, 2021
The TALP-UPC Participation in WMT21 News Translation Task: an mBART-based NMT Approach.
Proceedings of the Sixth Conference on Machine Translation, 2021
Proceedings of the Sixth Conference on Machine Translation, 2021
Enriching the Transformer with Linguistic Factors for Low-Resource Machine Translation.
Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2021), 2021
Impact of COVID-19 in Natural Language Processing Publications: a Disaggregated Study in Gender, Contribution and Experience.
Proceedings of the First Workshop on Language Technology for Equality, 2021
End-to-End Speech Translation with Pre-trained Models and Adapters: UPC at IWSLT 2021.
Proceedings of the 18th International Conference on Spoken Language Translation, 2021
Proceedings of the 18th International Conference on Natural Language Processing (ICON 2021), National Institute of Technology Silchar, Silchar, India, December 16, 2021
Attention Weights in Transformer NMT Fail Aligning Words Between Sequences but Largely Explain Model Predictions.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021
Multilingual Machine Translation: Closing the Gap between Shared and Language-specific Encoder-Decoders.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021
Enabling Zero-Shot Multilingual Spoken Language Translation with Language-Specific Encoders and Decoders.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021
2020
Proces. del Leng. Natural, 2020
CoRR, 2020
Training Multilingual Machine Translation by Alternately Freezing Language-Specific Encoders-Decoders.
CoRR, 2020
Enriching the Transformer with Linguistic and Semantic Factors for Low-Resource Machine Translation.
CoRR, 2020
Multilingual and Interlingual Semantic Representations for Natural Language Processing: A Brief Introduction.
Comput. Linguistics, 2020
Proceedings of the Fifth Conference on Machine Translation, 2020
The TALP-UPC System Description for WMT20 News Translation Task: Multilingual Adaptation for Low Resource MT.
Proceedings of the Fifth Conference on Machine Translation, 2020
Multilingual Neural Machine Translation: Case-study for Catalan, Spanish and Portuguese Romance Languages.
Proceedings of the Fifth Conference on Machine Translation, 2020
Proceedings of the Fifth Conference on Machine Translation, 2020
Proceedings of the Fifth Conference on Machine Translation, 2020
GeBioToolkit: Automatic Extraction of Gender-Balanced Multilingual Corpus of Wikipedia Biographies.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020
Abusive language in Spanish children and young teenager's conversations: data preparation and short text classification with contextual word embeddings.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020
Proceedings of The 12th Language Resources and Evaluation Conference, 2020
Proceedings of the ECAI 2020 - 24th European Conference on Artificial Intelligence, 29 August-8 September 2020, Santiago de Compostela, Spain, August 29 - September 8, 2020, 2020
Proceedings of the 28th International Conference on Computational Linguistics, 2020
Combining Subword Representations into Word-level Representations in the Transformer Architecture.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop, 2020
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop, 2020
Proceedings of the Fourth Workshop on Structured Prediction for NLP@EMNLP 2020, 2020
2019
Chinese-Catalan: A Neural Machine Translation Approach Based on Pivoting and Attention Mechanisms.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2019
Nat. Mach. Intell., 2019
Automatic Spanish Translation of the SQuAD Dataset for Multilingual Question Answering.
CoRR, 2019
Equalizing Gender Biases in Neural Machine Translation with Word Embeddings Techniques.
CoRR, 2019
The TALP-UPC Machine Translation Systems for WMT19 News Translation Task: Pivoting Techniques for Low Resource MT.
Proceedings of the Fourth Conference on Machine Translation, 2019
Terminology-Aware Segmentation and Domain Feature for the WMT19 Biomedical Translation Task.
Proceedings of the Fourth Conference on Machine Translation, 2019
The TALP-UPC System for the WMT Similar Language Task: Statistical vs Neural Machine Translation.
Proceedings of the Fourth Conference on Machine Translation, 2019
Proceedings of the Fourth Conference on Machine Translation, 2019
Multilingual, Multi-scale and Multi-layer Visualization of Intermediate Representations.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2019
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019
2018
J. Artif. Intell. Res., 2018
Computación y Sistemas, 2018
(Self-Attentive) Autoencoder-based Universal Language Representation for Machine Translation.
CoRR, 2018
English-Catalan Neural Machine Translation in the Biomedical Domain through the cascade approach.
CoRR, 2018
Neural Machine Translation with the Transformer and Multi-Source Romance Languages for the Biomedical WMT 2018 task.
Proceedings of the Third Conference on Machine Translation: Shared Task Papers, 2018
Proceedings of the Third Conference on Machine Translation: Shared Task Papers, 2018
Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects, 2018
Proceedings of the 9th International Workshop on Spoken Dialogue System Technology, 2018
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018
Proceedings of the 6th International Conference on Learning Representations, 2018
Proceedings of the Fourth International Conference, 2018
Proceedings of the Fourth International Conference, 2018
2017
Proces. del Leng. Natural, 2017
Generación morfológica con algoritmos de aprendizaje profundo integrada en un sistema de traducción automática estadística.
Proces. del Leng. Natural, 2017
DeepVoice: Tecnologías de Aprendizaje Profundo aplicadas al Procesado de Voz y Audio.
Proces. del Leng. Natural, 2017
Chinese-Spanish neural machine translation enhanced with character and word bitmap fonts.
Mach. Transl., 2017
Introduction to the special issue on deep learning approaches for machine translation.
Comput. Speech Lang., 2017
Tradares: A Tool for the Automatic Evaluation of Human Translation Quality within a MOOC Environment.
Appl. Artif. Intell., 2017
The TALP-UPC Neural Machine Translation System for German/Finnish-English Using the Inverse Direction Model in Rescoring.
Proceedings of the Second Conference on Machine Translation, 2017
Why Catalan-Spanish Neural Machine Translation? Analysis, comparison and combination with standard Rule and Phrase-based technologies.
Proceedings of the Fourth Workshop on NLP for Similar Languages, 2017
Proceedings of the 2nd Workshop on Evaluating Vector Space Representations for NLP, 2017
Proceedings of the 25th European Symposium on Artificial Neural Networks, 2017
Proceedings of the First Workshop on Subword and Character Level Models in NLP, 2017
2016
Description of the Chinese-to-Spanish Rule-Based Machine Translation System Developed Using a Hybrid Combination of Human Annotation and Statistical Techniques.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2016
A deep source-context feature for lexical selection in statistical machine translation.
Pattern Recognit. Lett., 2016
Selection of correction candidates for the normalization of Spanish user-generated content.
Nat. Lang. Eng., 2016
J. Artif. Intell. Res., 2016
Morphology Generation for Statistical Machine Translation using Deep Learning Techniques.
CoRR, 2016
WMT 2016 Multimodal Translation System Description based on Bidirectional Recurrent Neural Networks with Double-Embeddings.
Proceedings of the First Conference on Machine Translation, 2016
The TALP-UPC Spanish-English WMT Biomedical Task: Bilingual Embeddings and Char-based Neural Language Model Rescoring in a Phrase-based System.
Proceedings of the First Conference on Machine Translation, 2016
Proceedings of the 19th Annual Conference of the European Association for Machine Translation: Projects/Products, 2016
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2016
Proceedings of the Sixth Named Entity Workshop, 2016
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016
2015
Digit. Scholarsh. Humanit., 2015
Knowl. Eng. Rev., 2015
J. Assoc. Inf. Sci. Technol., 2015
Segmentation Strategies to Face Morphology Challenges in Brazilian-Portuguese/English Statistical Machine Translation and Its Integration in Cross-Language Information Retrieval.
Computación y Sistemas, 2015
Comput. Speech Lang., 2015
Proceedings of the Fourth Workshop on Hybrid Approaches to Translation, 2015
2014
Is there Hope for Interlingua methods? A CLIR Comparison Experiment between Interlingua and Query Translation.
Res. Comput. Sci., 2014
Using annotations on Mechanical Turk to perform supervised polarity classification of Spanish customer comments.
Inf. Sci., 2014
Computación y Sistemas, 2014
ACM Comput. Surv., 2014
Comput. Informatics, 2014
English-to-Hindi system description for WMT 2014: Deep Source-Context Features for Moses.
Proceedings of the Ninth Workshop on Statistical Machine Translation, 2014
Detailed Description of the Development of a MOOC in the Topic of Statistical Machine Translation.
Proceedings of the Human-Inspired Computing and Its Applications, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 3rd Workshop on Hybrid Approaches to Machine Translation, 2014
CHISPA on the GO: A mobile Chinese-Spanish translation service for travellers in trouble.
Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014
An IR-Based Strategy for Supporting Chinese-Portuguese Translation Services in Off-line Mode.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2014
2013
Automatic normalization of short texts by combining statistical and rule-based techniques.
Lang. Resour. Evaluation, 2013
Appl. Artif. Intell., 2013
The TALP-UPC Phrase-Based Translation Systems for WMT13: System Combination with Morphology Generation, Domain Adaptation and Corpus Filtering.
Proceedings of the Eighth Workshop on Statistical Machine Translation, 2013
Morphological, Syntactical and Semantic Knowledge in Statistical Machine Translation.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013
Evaluating Indirect Strategies for Chinese - Spanish Statistical Machine Translation: Extended Abstract.
Proceedings of the IJCAI 2013, 2013
Proceedings of the Second Workshop on Hybrid Approaches to Translation, 2013
2012
Knowl. Eng. Rev., 2012
Study and correlation analysis of linguistic, perceptual, and automatic machine translation evaluations.
J. Assoc. Inf. Sci. Technol., 2012
J. Artif. Intell. Res., 2012
Study and Comparison of Rule-Based and Statistical Catalan-Spanish Machine Translation Systems.
Comput. Informatics, 2012
Proceedings of Joint V Seminar on Ontology Research in Brazil and VII International Workshop on Metamodels, 2012
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012
The ML4HMT Workshop on Optimising the Division of Labour in Hybrid Machine Translation.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012
Results from the ML4HMT-12 Shared Task on Applying Machine Learning Techniques to Optimise the Division of Labour in Hybrid Machine Translation.
Proceedings of the Second Workshop on Applying Machine Learning Techniques to Optimise the Division of Labour in Hybrid MT@COLING 2012, 2012
2011
Evaluación de estrategias para la traducción automática estadística de chino a castellano con el inglés como lengua pivote.
Proces. del Leng. Natural, 2011
Overcoming statistical machine translation limitations: error analysis and proposed solutions for the Catalan-Spanish language pair.
Lang. Resour. Evaluation, 2011
Recursive alignment block classification technique for word reordering in statistical machine translation.
Lang. Resour. Evaluation, 2011
J. Intell. Inf. Syst., 2011
The BM-I2R Haitian-Créole-to-English translation system description for the WMT 2011 evaluation campaign.
Proceedings of the Sixth Workshop on Statistical Machine Translation, 2011
Proceedings of Fifth Workshop on Syntax, 2011
Proceedings of the Fifth International Joint Conference on Natural Language Processing, 2011
2010
Using collocation segmentation to extract translation units in a phrase-based statistical machine translation system.
Proces. del Leng. Natural, 2010
Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR, 2010
Proceedings of the Advances in Natural Language Processing, 2010
Opinion Mining of Spanish Customer Comments with Non-Expert Annotations on Mechanical Turk.
Proceedings of the 2010 Workshop on Creating Speech and Language Data with Amazon's Mechanical Turk, 2010
Automatic and Human Evaluation Study of a Rule-based and a Statistical Catalan-Spanish Machine Translation Systems.
Proceedings of the International Conference on Language Resources and Evaluation, 2010
Proceedings of the International Conference on Language Resources and Evaluation, 2010
UPC-BMIC-VDU system description for the IWSLT 2010: testing several collocation segmentations in a phrase-based SMT system.
Proceedings of the 2010 International Workshop on Spoken Language Translation, 2010
Where are you From? - Tell Me HOW you Write and I Will Tell you WHO you are.
Proceedings of the ICAART 2010 - Proceedings of the International Conference on Agents and Artificial Intelligence, Volume 1, 2010
Proceedings of the International Conference on Asian Language Processing, 2010
Linguistic-based Evaluation Criteria to identify Statistical Machine Translation Errors.
Proceedings of the 14th Annual conference of the European Association for Machine Translation, 2010
Integration of statistical collocation segmentations in a phrase-based statistical machine translation system.
Proceedings of the 14th Annual conference of the European Association for Machine Translation, 2010
Plagiarism Detection Using Information Retrieval and Similarity Measures Based on Image Processing Techniques - Lab Report for PAN at CLEF 2010.
Proceedings of the CLEF 2010 LABs and Workshops, 2010
2009
Proces. del Leng. Natural, 2009
State-of-the-Art Word Reordering Approaches in Statistical Machine Translation: A Survey.
IEICE Trans. Inf. Syst., 2009
Appl. Artif. Intell., 2009
Proceedings of the Fourth Workshop on Statistical Machine Translation, 2009
Proceedings of the 6th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2009, 2009
Barcelona media SMT system description for the IWSLT 2009: introducing source context information.
Proceedings of the 2009 International Workshop on Spoken Language Translation, 2009
Improving a Catalan-Spanish Statistical Translation System using Morphosyntactic Knowledge.
Proceedings of the 13th Annual conference of the European Association for Machine Translation, 2009
2008
TECNOPARLA - Speech technologies for Catalan and its application to Speech-to-speech Translation.
Proces. del Leng. Natural, 2008
Generación de múltiples hipótesis ponderadas de reordenamiento para un sistema de traducción automática estadística.
Proces. del Leng. Natural, 2008
Proceedings of the Third Workshop on Statistical Machine Translation, 2008
Using Reordering in Statistical Machine Translation based on Alignment Block Classification.
Proceedings of the International Conference on Language Resources and Evaluation, 2008
Proceedings of the 2008 International Workshop on Spoken Language Translation, 2008
Computing multiple weighted reordering hypotheses for a phrase-based statistical machine translation system.
Proceedings of the 8th Conference of the Association for Machine Translation in the Americas: Research Papers, 2008
2007
Analysis of Statistical and Morphological Classes to Generate Weigthed Reordering Hypotheses on a Statistical Machine Translation System.
Proceedings of the Second Workshop on Statistical Machine Translation, 2007
Ngram-Based Statistical Machine Translation Enhanced with Multiple Weighted Reordering Hypotheses.
Proceedings of the Second Workshop on Statistical Machine Translation, 2007
Analysis and System Combination of Phrase- and N-Gram-Based Statistical Machine Translation Systems.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2007
Proceedings of the 2007 International Workshop on Spoken Language Translation, 2007
2006
Proces. del Leng. Natural, 2006
Proceedings of the Proceedings on the Workshop on Statistical Machine Translation, 2006
Proceedings of the Proceedings on the Workshop on Statistical Machine Translation, 2006
Proceedings of the 2006 IEEE ACL Spoken Language Technology Workshop, 2006
Proceedings of the 2006 International Workshop on Spoken Language Translation, 2006
Proceedings of the 2006 International Workshop on Spoken Language Translation, 2006
Proceedings of the 2006 International Workshop on Spoken Language Translation, 2006
2005
Proces. del Leng. Natural, 2005
Proceedings of Machine Translation Summit X: Papers, 2005
Proceedings of the 2005 International Workshop on Spoken Language Translation, 2005
Tuning a phrase-based statistical translation system for the IWSLT 2005 Chinese to English and Arabic to English tasks.
Proceedings of the 2005 International Workshop on Spoken Language Translation, 2005
Improving Phrase-Based Statistical Translation by Modifying Phrase Extraction and Including Several Features.
Proceedings of the Workshop on Building and Using Parallel Texts@ACL 2005, 2005