Marta R. Costa-jussà

Orcid: 0000-0002-5703-520X

  • Universitat Politècnica de Catalunya, Madrid, Spain

According to our database1, Marta R. Costa-jussà authored at least 224 papers between 2005 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.




In proceedings 
PhD thesis 


Online presence:



Large Concept Models: Language Modeling in a Sentence Representation Space.
CoRR, 2024

Y-NQ: English-Yorùbá Evaluation dataset for Open-Book Reading Comprehension and Text Generation.
CoRR, 2024

2M-BELEBELE: Highly Multilingual Speech and American Sign Language Comprehension Dataset.
CoRR, 2024

LCFO: Long Context and Long Form Output Dataset and Benchmarking.
CoRR, 2024

On the Role of Speech Data in Reducing Toxicity Detection Bias.
CoRR, 2024

Linguini: A benchmark for language-agnostic linguistic reasoning.
CoRR, 2024

Towards Massive Multilingual Holistic Bias.
CoRR, 2024

A Primer on the Inner Workings of Transformer-based Language Models.
CoRR, 2024

SpiRit-LM: Interleaved Spoken and Written Language Model.
CoRR, 2024

Towards Red Teaming in Multimodal and Multilingual Translation.
CoRR, 2024

On the Similarity of Circuits across Languages: a Case Study on the Subject-verb Agreement Task.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

BLASER 2.0: a metric for evaluation and quality estimation of massively multilingual speech and text translation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Unveiling the Role of Pretraining in Direct Speech Translation.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

ReSeTOX: Re-learning attention weights for toxicity mitigation in machine translation.
Proceedings of the 25th Annual Conference of the European Association for Machine Translation (Volume 1), 2024

Added Toxicity Mitigation at Inference Time for Multimodal and Massively Multilingual Translation.
Proceedings of the 25th Annual Conference of the European Association for Machine Translation (Volume 1), 2024

SpeechAlign: A Framework for Speech Translation Alignment Evaluation.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Pushing the Limits of Zero-shot End-to-End Speech Translation.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

MuTox: Universal MUltilingual Audio-based TOXicity Dataset and Zero-shot Detector.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Towards lifelong human assisted speaker diarization.
Comput. Speech Lang., 2023

Seamless: Multilingual Expressive and Streaming Speech Translation.
CoRR, 2023

Gender-specific Machine Translation with Large Language Models.
CoRR, 2023

SeamlessM4T-Massively Multilingual & Multimodal Machine Translation.
CoRR, 2023

The Gender-GAP Pipeline: A Gender-Aware Polyglot Pipeline for Gender Characterisation in 55 Languages.
Proceedings of the Eighth Conference on Machine Translation, 2023

Speech Translation with Foundation Models and Optimal Transport: UPC at IWSLT23.
Proceedings of the 20th International Conference on Spoken Language Translation, 2023

Efficient Speech Translation with Dynamic Latent Perceivers.
Proceedings of the IEEE International Conference on Acoustics, 2023

SegAugment: Maximizing the Utility of Speech Translation Data with Segmentation-based Augmentations.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

HalOmi: A Manually Annotated Benchmark for Multilingual Hallucination and Omission Detection in Machine Translation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Toxicity in Multilingual Machine Translation at Scale.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Multilingual Holistic Bias: Extending Descriptors and Patterns to Unveil Demographic Biases in Languages at Scale.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Explaining How Transformers Use Context to Build Predictions.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Detecting and Mitigating Hallucinations in Machine Translation: Model Internal Workings Alone Do Well, Sentence Similarity Even Better.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

BLASER: A Text-Free Speech-to-Speech Translation Evaluation Metric.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Multilingual Machine Translation: Deep Analysis of Language-Specific Encoder-Decoders.
J. Artif. Intell. Res., 2022

Toxicity in Multilingual Machine Translation at Scale.
CoRR, 2022

No Language Left Behind: Scaling Human-Centered Machine Translation.
CoRR, 2022

A multi-task semi-supervised framework for Text2Graph & Graph2Text.
CoRR, 2022

Findings of the WMT'22 Shared Task on Large-Scale Machine Translation Evaluation for African Languages.
Proceedings of the Seventh Conference on Machine Translation, 2022

OccGen: Selection of Real-world Multilingual Parallel Data Balanced in Gender within Occupations.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Multiformer: A Head-Configurable Transformer-Based Model for Direct Speech Translation.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Student Research Workshop, 2022

Evaluating Gender Bias in Speech Translation.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Pretrained Speech Encoders and Efficient Fine-tuning Methods for Speech Translation: UPC at IWSLT 2022.
Proceedings of the 19th International Conference on Spoken Language Translation, 2022

SHAS: Approaching optimal Segmentation for End-to-End Speech Translation.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Measuring the Mixing of Contextual Information in the Transformer.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Towards Opening the Black Box of Neural Machine Translation: Source and Target Interpretations of the Transformer.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Neural Machine Translation for Kashmiri to English and Hindi using Pre-trained Embeddings.
Proceedings of the OITS International Conference on Information Technology, 2022

On the Locality of Attention in Direct Speech Translation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop, 2022

Interpreting Gender Bias in Neural Machine Translation: Multilingual Architecture Matters.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Linguistic knowledge-based vocabularies for Neural Machine Translation.
Nat. Lang. Eng., 2021

Extensive study on the underlying gender bias in contextualized word embeddings.
Neural Comput. Appl., 2021

AI reflections in 2020.
Nat. Mach. Intell., 2021

Towards universal translation.
Nat. Mach. Intell., 2021

Semantic and syntactic information for neural machine translation.
Mach. Transl., 2021

From bilingual to multilingual neural-based machine translation by incremental training.
J. Assoc. Inf. Sci. Technol., 2021

Efficient Transformer for Direct Speech Translation.
CoRR, 2021

UPC's Speech Translation System for IWSLT 2021.
CoRR, 2021

How to Write a Bias Statement: Recommendations for Submissions to the Workshop on Gender Bias in NLP.
CoRR, 2021

Sparsely Factored Neural Machine Translation.
CoRR, 2021

High Frequent In-domain Words Segmentation and Forward Translation for the WMT21 Biomedical Task.
Proceedings of the Sixth Conference on Machine Translation, 2021

The TALP-UPC Participation in WMT21 News Translation Task: an mBART-based NMT Approach.
Proceedings of the Sixth Conference on Machine Translation, 2021

Enriching the Transformer with Linguistic Factors for Low-Resource Machine Translation.
Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2021), 2021

Impact of COVID-19 in Natural Language Processing Publications: a Disaggregated Study in Gender, Contribution and Experience.
Proceedings of the First Workshop on Language Technology for Equality, 2021

End-to-End Speech Translation with Pre-trained Models and Adapters: UPC at IWSLT 2021.
Proceedings of the 18th International Conference on Spoken Language Translation, 2021

Multi-Task Learning for Improving Gender Accuracy in Neural Machine Translation.
Proceedings of the 18th International Conference on Natural Language Processing (ICON 2021), National Institute of Technology Silchar, Silchar, India, December 16, 2021

Attention Weights in Transformer NMT Fail Aligning Words Between Sequences but Largely Explain Model Predictions.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Multilingual Machine Translation: Closing the Gap between Shared and Language-specific Encoder-Decoders.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Enabling Zero-Shot Multilingual Spoken Language Translation with Language-Specific Encoders and Decoders.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

WinoST: Evaluating Gender Bias in Speech Translation.
Dataset, October, 2020

Catalan United Nations v1.0 test set.
Dataset, June, 2020

AMALEU: Una Representación Universal del Lenguaje basada en Aprendizaje Automático.
Proces. del Leng. Natural, 2020

Gender Bias in Multilingual Neural Machine Translation: The Architecture Matters.
CoRR, 2020

Training Multilingual Machine Translation by Alternately Freezing Language-Specific Encoders-Decoders.
CoRR, 2020

MT-Adapted Datasheets for Datasets: Template and Repository.
CoRR, 2020

Enriching the Transformer with Linguistic and Semantic Factors for Low-Resource Machine Translation.
CoRR, 2020

Multilingual and Interlingual Semantic Representations for Natural Language Processing: A Brief Introduction.
Comput. Linguistics, 2020

The IPN-CIC team system submission for the WMT 2020 similar language task.
Proceedings of the Fifth Conference on Machine Translation, 2020

The TALP-UPC System Description for WMT20 News Translation Task: Multilingual Adaptation for Low Resource MT.
Proceedings of the Fifth Conference on Machine Translation, 2020

Multilingual Neural Machine Translation: Case-study for Catalan, Spanish and Portuguese Romance Languages.
Proceedings of the Fifth Conference on Machine Translation, 2020

Findings of the First Shared Task on Lifelong Learning Machine Translation.
Proceedings of the Fifth Conference on Machine Translation, 2020

GeBioToolkit: Automatic Extraction of Gender-Balanced Multilingual Corpus of Wikipedia Biographies.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Abusive language in Spanish children and young teenager's conversations: data preparation and short text classification with contextual word embeddings.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Automatic Spanish Translation of SQuAD Dataset for Multi-lingual Question Answering.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Refinement of Unsupervised Cross-Lingual Word Embeddings.
Proceedings of the ECAI 2020 - 24th European Conference on Artificial Intelligence, 29 August-8 September 2020, Santiago de Compostela, Spain, August 29 - September 8, 2020, 2020

Continual Lifelong Learning in Natural Language Processing: A Survey.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Combining Subword Representations into Word-level Representations in the Transformer Architecture.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop, 2020

Enhancing Word Embeddings with Knowledge Extracted from Lexical Resources.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop, 2020

Syntax-driven Iterative Expansion Language Models for Controllable Text Generation.
Proceedings of the Fourth Workshop on Structured Prediction for NLP@EMNLP 2020, 2020

Chinese-Catalan: A Neural Machine Translation Approach Based on Pivoting and Attention Mechanisms.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2019

An analysis of gender bias studies in natural language processing.
Nat. Mach. Intell., 2019

Automatic Spanish Translation of the SQuAD Dataset for Multilingual Question Answering.
CoRR, 2019

Towards Interlingua Neural Machine Translation.
CoRR, 2019

Joint Source-Target Self Attention with Locality Constraints.
CoRR, 2019

Evaluating the Underlying Gender Bias in Contextualized Word Embeddings.
CoRR, 2019

Equalizing Gender Biases in Neural Machine Translation with Word Embeddings Techniques.
CoRR, 2019

The TALP-UPC Machine Translation Systems for WMT19 News Translation Task: Pivoting Techniques for Low Resource MT.
Proceedings of the Fourth Conference on Machine Translation, 2019

Terminology-Aware Segmentation and Domain Feature for the WMT19 Biomedical Translation Task.
Proceedings of the Fourth Conference on Machine Translation, 2019

The TALP-UPC System for the WMT Similar Language Task: Statistical vs Neural Machine Translation.
Proceedings of the Fourth Conference on Machine Translation, 2019

Findings of the 2019 Conference on Machine Translation (WMT19).
Proceedings of the Fourth Conference on Machine Translation, 2019

Multilingual, Multi-scale and Multi-layer Visualization of Intermediate Representations.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Impact of Gender Debiased Word Embeddings in Language Modeling.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2019

From Bilingual to Multilingual Neural Machine Translation by Incremental Training.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

From Feature To Paradigm: Deep Learning In Machine Translation.
J. Artif. Intell. Res., 2018

Experimental Research on Encoder-Decoder Architectures with Attention for Chatbots.
Computación y Sistemas, 2018

(Self-Attentive) Autoencoder-based Universal Language Representation for Machine Translation.
CoRR, 2018

English-Catalan Neural Machine Translation in the Biomedical Domain through the cascade approach.
CoRR, 2018

Neural Machine Translation with the Transformer and Multi-Source Romance Languages for the Biomedical WMT 2018 task.
Proceedings of the Third Conference on Machine Translation: Shared Task Papers, 2018

The TALP-UPC Machine Translation Systems for WMT18 News Shared Translation Task.
Proceedings of the Third Conference on Machine Translation: Shared Task Papers, 2018

A Neural Approach to Language Variety Translation.
Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects, 2018

Chatbol, a Chatbot for the Spanish "La Liga".
Proceedings of the 9th International Workshop on Spoken Dialogue System Technology, 2018

From Feature to Paradigm: Deep Learning in Machine Translation (Extended Abstract).
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

A differentiable BLEU loss. Analysis and first results.
Proceedings of the 6th International Conference on Learning Representations, 2018

End-to-End Speech Translation with the Transformer.
Proceedings of the Fourth International Conference, 2018

Panel discussion on Speech technologies: Industry and Academy.
Proceedings of the Fourth International Conference, 2018

Coverage for Character Based Neural Machine Translation.
Proces. del Leng. Natural, 2017

Generación morfológica con algoritmos de aprendizaje profundo integrada en un sistema de traducción automática estadística.
Proces. del Leng. Natural, 2017

DeepVoice: Tecnologías de Aprendizaje Profundo aplicadas al Procesado de Voz y Audio.
Proces. del Leng. Natural, 2017

Chinese-Spanish neural machine translation enhanced with character and word bitmap fonts.
Mach. Transl., 2017

Introduction to the special issue on deep learning approaches for machine translation.
Comput. Speech Lang., 2017

Tradares: A Tool for the Automatic Evaluation of Human Translation Quality within a MOOC Environment.
Appl. Artif. Intell., 2017

The TALP-UPC Neural Machine Translation System for German/Finnish-English Using the Inverse Direction Model in Rescoring.
Proceedings of the Second Conference on Machine Translation, 2017

Why Catalan-Spanish Neural Machine Translation? Analysis, comparison and combination with standard Rule and Phrase-based technologies.
Proceedings of the Fourth Workshop on NLP for Similar Languages, 2017

Character-level Intra Attention Network for Natural Language Inference.
Proceedings of the 2nd Workshop on Evaluating Vector Space Representations for NLP, 2017

Bridging deep and kernel methods.
Proceedings of the 25th European Symposium on Artificial Neural Networks, 2017

Byte-based Neural Machine Translation.
Proceedings of the First Workshop on Subword and Character Level Models in NLP, 2017

Description of the Chinese-to-Spanish Rule-Based Machine Translation System Developed Using a Hybrid Combination of Human Annotation and Statistical Techniques.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2016

A deep source-context feature for lexical selection in statistical machine translation.
Pattern Recognit. Lett., 2016

Integración de Paradigmas de Traducción Automática.
Proces. del Leng. Natural, 2016

Selection of correction candidates for the normalization of Spanish user-generated content.
Nat. Lang. Eng., 2016

Introduction to the Special Issue on Cross-Language Algorithms and Applications.
J. Artif. Intell. Res., 2016

Morphology Generation for Statistical Machine Translation using Deep Learning Techniques.
CoRR, 2016

WMT 2016 Multimodal Translation System Description based on Bidirectional Recurrent Neural Networks with Double-Embeddings.
Proceedings of the First Conference on Machine Translation, 2016

The TALP-UPC Spanish-English WMT Biomedical Task: Bilingual Embeddings and Char-based Neural Language Model Rescoring in a Phrase-based System.
Proceedings of the First Conference on Machine Translation, 2016

Integration of machine translation paradigms.
Proceedings of the 19th Annual Conference of the European Association for Machine Translation: Projects/Products, 2016

Combining Phrase and Neural-Based Machine Translation: What Worked and Did Not.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2016

Moses-based official baseline for NEWS 2016.
Proceedings of the Sixth Named Entity Workshop, 2016

Character-based Neural Machine Translation.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Polibits, 2015

Towards human linguistic machine translation evaluation.
Digit. Scholarsh. Humanit., 2015

Domain adaptation strategies in statistical machine translation: a brief overview.
Knowl. Eng. Rev., 2015

How much hybridization does machine translation Need?
J. Assoc. Inf. Sci. Technol., 2015

Segmentation Strategies to Face Morphology Challenges in Brazilian-Portuguese/English Statistical Machine Translation and Its Integration in Cross-Language Information Retrieval.
Computación y Sistemas, 2015

Comput. Speech Lang., 2015

Latest trends in hybrid machine translation and its applications.
Comput. Speech Lang., 2015

Ongoing Study for Enhancing Chinese-Spanish Translation with Morphology Strategies.
Proceedings of the Fourth Workshop on Hybrid Approaches to Translation, 2015

Is there Hope for Interlingua methods? A CLIR Comparison Experiment between Interlingua and Query Translation.
Res. Comput. Sci., 2014

Using annotations on Mechanical Turk to perform supervised polarity classification of Spanish customer comments.
Inf. Sci., 2014

On-line and Off-line Chinese-Portuguese Translation Service for Mobile Applications.
Computación y Sistemas, 2014

Statistical machine translation enhancements through linguistic levels: A survey.
ACM Comput. Surv., 2014

A Large Spanish-Catalan Parallel Corpus Release for Machine Translation.
Comput. Informatics, 2014

English-to-Hindi system description for WMT 2014: Deep Source-Context Features for Moses.
Proceedings of the Ninth Workshop on Statistical Machine Translation, 2014

Detailed Description of the Development of a MOOC in the Topic of Statistical Machine Translation.
Proceedings of the Human-Inspired Computing and Its Applications, 2014

A client mobile application for Chinese-Spanish statistical machine translation.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Chinese-to-Spanish rule-based machine translation system.
Proceedings of the 3rd Workshop on Hybrid Approaches to Machine Translation, 2014

CHISPA on the GO: A mobile Chinese-Spanish translation service for travellers in trouble.
Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014

An IR-Based Strategy for Supporting Chinese-Portuguese Translation Services in Off-line Mode.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2014

Automatic normalization of short texts by combining statistical and rule-based techniques.
Lang. Resour. Evaluation, 2013

Cross-Language Document Retrieval by using nonlinear Semantic Mapping.
Appl. Artif. Intell., 2013

The TALP-UPC Phrase-Based Translation Systems for WMT13: System Combination with Morphology Generation, Domain Adaptation and Corpus Filtering.
Proceedings of the Eighth Workshop on Statistical Machine Translation, 2013

Morphological, Syntactical and Semantic Knowledge in Statistical Machine Translation.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013

Evaluating Indirect Strategies for Chinese - Spanish Statistical Machine Translation: Extended Abstract.
Proceedings of the IJCAI 2013, 2013

Workshop on Hybrid Approaches to Translation: Overview and Developments.
Proceedings of the Second Workshop on Hybrid Approaches to Translation, 2013

An overview of the phrase-based statistical machine translation techniques.
Knowl. Eng. Rev., 2012

Study and correlation analysis of linguistic, perceptual, and automatic machine translation evaluations.
J. Assoc. Inf. Sci. Technol., 2012

Evaluating Indirect Strategies for Chinese-Spanish Statistical Machine Translation.
J. Artif. Intell. Res., 2012

Study and Comparison of Rule-Based and Statistical Catalan-Spanish Machine Translation Systems.
Comput. Informatics, 2012

Initial Approaches on Cross-Lingual Information Retrieval Using SMT on User-Queries.
Proceedings of Joint V Seminar on Ontology Research in Brazil and VII International Workshop on Metamodels, 2012

Holaaa!! writin like u talk is kewl but kinda hard 4 NLP.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

The ML4HMT Workshop on Optimising the Division of Labour in Hybrid Machine Translation.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

A Richly Annotated, Multilingual Parallel Corpus for Hybrid Machine Translation.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

BUCEADOR, a multi-language search engine for digital libraries.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

Results from the ML4HMT-12 Shared Task on Applying Machine Learning Techniques to Optimise the Division of Labour in Hybrid Machine Translation.
Proceedings of the Second Workshop on Applying Machine Learning Techniques to Optimise the Division of Labour in Hybrid MT@COLING 2012, 2012

Evaluación de estrategias para la traducción automática estadística de chino a castellano con el inglés como lengua pivote.
Proces. del Leng. Natural, 2011

Overcoming statistical machine translation limitations: error analysis and proposed solutions for the Catalan-Spanish language pair.
Lang. Resour. Evaluation, 2011

Recursive alignment block classification technique for word reordering in statistical machine translation.
Lang. Resour. Evaluation, 2011

A vector-space dynamic feature for phrase-based statistical machine translation.
J. Intell. Inf. Syst., 2011

The BM-I2R Haitian-Créole-to-English translation system description for the WMT 2011 evaluation campaign.
Proceedings of the Sixth Workshop on Statistical Machine Translation, 2011

A Semantic Feature for Statistical Machine Translation.
Proceedings of Fifth Workshop on Syntax, 2011

Enhancing scarce-resource language translation through pivot combinations.
Proceedings of the Fifth International Joint Conference on Natural Language Processing, 2011

Using collocation segmentation to extract translation units in a phrase-based statistical machine translation system.
Proces. del Leng. Natural, 2010

Using Collocation Segmentation to Augment the Phrase Table.
Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR, 2010

A Non-linear Semantic Mapping Technique for Cross-Language Sentence Matching.
Proceedings of the Advances in Natural Language Processing, 2010

Opinion Mining of Spanish Customer Comments with Non-Expert Annotations on Mechanical Turk.
Proceedings of the 2010 Workshop on Creating Speech and Language Data with Amazon's Mechanical Turk, 2010

Automatic and Human Evaluation Study of a Rule-based and a Statistical Catalan-Spanish Machine Translation Systems.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

Using Linear Interpolation and Weighted Reordering Hypotheses in the Moses System.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

UPC-BMIC-VDU system description for the IWSLT 2010: testing several collocation segmentations in a phrase-based SMT system.
Proceedings of the 2010 International Workshop on Spoken Language Translation, 2010

Where are you From? - Tell Me HOW you Write and I Will Tell you WHO you are.
Proceedings of the ICAART 2010 - Proceedings of the International Conference on Agents and Artificial Intelligence, Volume 1, 2010

Sentence Similarity-Based Source Context Modelling in PBSMT.
Proceedings of the International Conference on Asian Language Processing, 2010

Linguistic-based Evaluation Criteria to identify Statistical Machine Translation Errors.
Proceedings of the 14th Annual conference of the European Association for Machine Translation, 2010

Integration of statistical collocation segmentations in a phrase-based statistical machine translation system.
Proceedings of the 14th Annual conference of the European Association for Machine Translation, 2010

Plagiarism Detection Using Information Retrieval and Similarity Measures Based on Image Processing Techniques - Lab Report for PAN at CLEF 2010.
Proceedings of the CLEF 2010 LABs and Workshops, 2010

Extracción crosslingüe de documentos usando mapas semánticos no-lineales.
Proces. del Leng. Natural, 2009

State-of-the-Art Word Reordering Approaches in Statistical Machine Translation: A Survey.
IEICE Trans. Inf. Syst., 2009

An Ngram-based reordering model.
Comput. Speech Lang., 2009

Phrase and Ngram-Based Statistical Machine Translation System Combination.
Appl. Artif. Intell., 2009

The TALP-UPC Phrase-Based Translation System for EACL-WMT 2009.
Proceedings of the Fourth Workshop on Statistical Machine Translation, 2009

Barcelona Media SMT system description for the IWSLT 2009.
Proceedings of the 6th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2009, 2009

Barcelona media SMT system description for the IWSLT 2009: introducing source context information.
Proceedings of the 2009 International Workshop on Spoken Language Translation, 2009

Improving a Catalan-Spanish Statistical Translation System using Morphosyntactic Knowledge.
Proceedings of the 13th Annual conference of the European Association for Machine Translation, 2009

TECNOPARLA - Speech technologies for Catalan and its application to Speech-to-speech Translation.
Proces. del Leng. Natural, 2008

Generación de múltiples hipótesis ponderadas de reordenamiento para un sistema de traducción automática estadística.
Proces. del Leng. Natural, 2008

The TALP-UPC Ngram-Based Statistical Machine Translation System for ACL-WMT 2008.
Proceedings of the Third Workshop on Statistical Machine Translation, 2008

Using Reordering in Statistical Machine Translation based on Alignment Block Classification.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

The TALP&I2r SMT systems for IWSLT 2008.
Proceedings of the 2008 International Workshop on Spoken Language Translation, 2008

Computing multiple weighted reordering hypotheses for a phrase-based statistical machine translation system.
Proceedings of the 8th Conference of the Association for Machine Translation in the Americas: Research Papers, 2008

Analysis of Statistical and Morphological Classes to Generate Weigthed Reordering Hypotheses on a Statistical Machine Translation System.
Proceedings of the Second Workshop on Statistical Machine Translation, 2007

Ngram-Based Statistical Machine Translation Enhanced with Multiple Weighted Reordering Hypotheses.
Proceedings of the Second Workshop on Statistical Machine Translation, 2007

Analysis and System Combination of Phrase- and N-Gram-Based Statistical Machine Translation Systems.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2007

The TALP n-gram-based SMT system for IWSLT 2007.
Proceedings of the 2007 International Workshop on Spoken Language Translation, 2007

Smooth Bilingual N-Gram Translation.
Proceedings of the EMNLP-CoNLL 2007, 2007

Sistema Estadístico de Reordenamiento de Palabras en Traducción Automática.
Proces. del Leng. Natural, 2006

<i>N</i>-gram-based Machine Translation.
Comput. Linguistics, 2006

N-gram-based SMT System Enhanced with Reordering Patterns.
Proceedings of the Proceedings on the Workshop on Statistical Machine Translation, 2006

TALP Phrase-based statistical translation system for European language pairs.
Proceedings of the Proceedings on the Workshop on Statistical Machine Translation, 2006

Machine Translation System Development Based on Human Likeness.
Proceedings of the 2006 IEEE ACL Spoken Language Technology Workshop, 2006

Continuous space language models for the IWSLT 2006 task.
Proceedings of the 2006 International Workshop on Spoken Language Translation, 2006

The TALP n-gram-based SMT system for IWSLT 2006.
Proceedings of the 2006 International Workshop on Spoken Language Translation, 2006

TALP phrase-based system and TALP system combination for IWSLT 2006.
Proceedings of the 2006 International Workshop on Spoken Language Translation, 2006

Statistical Machine Reordering.
Proceedings of the EMNLP 2006, 2006

Técnicas mejoradas para la traducción basada en frases.
Proces. del Leng. Natural, 2005

Bilingual N-gram Statistical Machine Translation.
Proceedings of Machine Translation Summit X: Papers, 2005

N-gram-based versus phrase-based statistical machine translation.
Proceedings of the 2005 International Workshop on Spoken Language Translation, 2005

Tuning a phrase-based statistical translation system for the IWSLT 2005 Chinese to English and Arabic to English tasks.
Proceedings of the 2005 International Workshop on Spoken Language Translation, 2005

Improving Phrase-Based Statistical Translation by Modifying Phrase Extraction and Including Several Features.
Proceedings of the Workshop on Building and Using Parallel Texts@ACL 2005, 2005
