André F. T. Martins

Orcid: 0000-0001-8282-625X

Affiliations:
  • University of Lisbon, Instituto de Telecomunicaçoes, Portugal
  • Carnegie Mellon University, Pittsburgh, USA (former)


According to our database1, André F. T. Martins authored at least 173 papers between 2003 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
xcomet : Transparent Machine Translation Evaluation through Fine-grained Error Detection.
Trans. Assoc. Comput. Linguistics, 2024

Assessing the Role of Context in Chat Translation Evaluation: Is Context Helpful and Under What Conditions?
Trans. Assoc. Comput. Linguistics, 2024

EuroLLM: Multilingual Language Models for Europe.
CoRR, 2024

Reranking Laws for Language Generation: A Communication-Theoretic Perspective.
CoRR, 2024

DOCE: Finding the Sweet Spot for Execution-Based Code Generation.
CoRR, 2024

How Effective are State Space Models for Machine Translation?
CoRR, 2024

A Recipe of Parallel Corpora Exploitation for Multilingual Large Language Models.
CoRR, 2024

xTower: A Multilingual LLM for Explaining and Correcting Translation Errors.
CoRR, 2024

LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks.
CoRR, 2024

QUEST: Quality-Aware Metropolis-Hastings Sampling for Machine Translation.
CoRR, 2024

XAMPLER: Learning to Retrieve Cross-Lingual In-Context Examples.
CoRR, 2024

Conformal Prediction for Natural Language Processing: A Survey.
CoRR, 2024

Is Context Helpful for Chat Translation Evaluation?
CoRR, 2024

Did Translation Models Get More Robust Without Anyone Even Noticing?
CoRR, 2024

SaulLM-7B: A pioneering Large Language Model for Law.
CoRR, 2024

Tower: An Open Multilingual Large Language Model for Translation-Related Tasks.
CoRR, 2024

CroissantLLM: A Truly Bilingual French-English Language Model.
CoRR, 2024

MaLA-500: Massive Language Adaptation of Large Language Models.
CoRR, 2024

Sparse and Structured Hopfield Networks.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Non-Exchangeable Conformal Risk Control.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Can Automatic Metrics Assess High-Quality Translations?
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Aligning Neural Machine Translation Models: Human Feedback in Training and Inference.
Proceedings of the 25th Annual Conference of the European Association for Machine Translation (Volume 1), 2024

The Center for Responsible AI Project.
Proceedings of the 25th Annual Conference of the European Association for Machine Translation (Volume 2), 2024

Non-Exchangeable Conformal Language Generation with Nearest Neighbors.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024

mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024

2023
Efficient Methods for Natural Language Processing: A Survey.
Trans. Assoc. Comput. Linguistics, 2023

Hallucinations in Large Multilingual Translation Models.
Trans. Assoc. Comput. Linguistics, 2023

Bridging the Gap: A Survey on Integrating (Human) Feedback for Natural Language Generation.
Trans. Assoc. Comput. Linguistics, 2023

The Devil is in the Errors: Leveraging Large Language Models for Fine-grained Machine Translation Evaluation.
CoRR, 2023

Conformalizing Machine Translation Evaluation.
CoRR, 2023

mPLM-Sim: Unveiling Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models.
CoRR, 2023

Discrete Latent Structure in Neural Networks.
CoRR, 2023

Scaling up CometKiwi: Unbabel-IST 2023 Submission for the Quality Estimation Shared Task.
Proceedings of the Eighth Conference on Machine Translation, 2023

Findings of the WMT 2023 Shared Task on Quality Estimation.
Proceedings of the Eighth Conference on Machine Translation, 2023

An Empirical Study of Translation Hypothesis Ensembling with Large Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Steering Large Language Models for Machine Translation with Finetuning and In-Context Learning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Empirical Assessment of kNN-MT for Real-World Translation Scenarios.
Proceedings of the 24th Annual Conference of the European Association for Machine Translation, 2023

BLEU Meets COMET: Combining Lexical and Neural Metrics Towards Robust Machine Translation Evaluation.
Proceedings of the 24th Annual Conference of the European Association for Machine Translation, 2023

Looking for a Needle in a Haystack: A Comprehensive Study of Hallucinations in Neural Machine Translation.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

CREST: A Joint Framework for Rationalization and Counterfactual Text Generation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

The Inside Story: Towards Better Understanding of Machine Translation Neural Evaluation Metrics.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

Python Code Generation by Asking Clarification Questions.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Optimal Transport for Unsupervised Hallucination Detection in Neural Machine Translation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Sparse Continuous Distributions and Fenchel-Young Losses.
J. Mach. Learn. Res., 2022

Asking Clarification Questions for Code Generation in General-Purpose Programming Language.
CoRR, 2022

Improving abstractive summarization with energy-based re-ranking.
CoRR, 2022

Efficient Methods for Natural Language Processing: A Survey.
CoRR, 2022

Efficient Machine Translation Domain Adaptation.
CoRR, 2022

Better Uncertainty Quantification for Machine Translation Evaluation.
CoRR, 2022

Findings of the WMT 2022 Shared Task on Quality Estimation.
Proceedings of the Seventh Conference on Machine Translation, 2022

CometKiwi: IST-Unbabel 2022 Submission for the Quality Estimation Shared Task.
Proceedings of the Seventh Conference on Machine Translation, 2022

COMET-22: Unbabel-IST 2022 Submission for the Metrics Shared Task.
Proceedings of the Seventh Conference on Machine Translation, 2022

Results of WMT22 Metrics Shared Task: Stop Using BLEU - Neural Metrics Are Better and More Robust.
Proceedings of the Seventh Conference on Machine Translation, 2022

Findings of the WMT 2022 Shared Task on Chat Translation.
Proceedings of the Seventh Conference on Machine Translation, 2022

Robust MT Evaluation with Sentence-level Multilingual Augmentation.
Proceedings of the Seventh Conference on Machine Translation, 2022

Unbabel-IST at the WMT Chat Translation Shared Task.
Proceedings of the Seventh Conference on Machine Translation, 2022

Learning to Scaffold: Optimizing Model Explanations for Teaching.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Quality-Aware Decoding for Neural Machine Translation.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

MLQE-PE: A Multilingual Quality Estimation and Post-Editing Dataset.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Modeling Structure with Undirected Neural Networks.
Proceedings of the International Conference on Machine Learning, 2022

Sparse Communication via Mixed Distributions.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Disentangling Uncertainty in Machine Translation Evaluation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Chunk-based Nearest Neighbor Machine Translation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

QUARTZ: Quality-Aware Machine Translation.
Proceedings of the 23rd Annual Conference of the European Association for Machine Translation, 2022

Searching for COMETINHO: The Little Metric That Could.
Proceedings of the 23rd Annual Conference of the European Association for Machine Translation, 2022

DeepSPIN: Deep Structured Prediction for Natural Language Processing.
Proceedings of the 23rd Annual Conference of the European Association for Machine Translation, 2022

Differentiable Causal Discovery Under Latent Interventions.
Proceedings of the 1st Conference on Causal Learning and Reasoning, 2022

∞-former: Infinite Memory Transformer.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Predicting Attention Sparsity in Transformers.
Proceedings of the Sixth Workshop on Structured Prediction for NLP, 2022

2021
When Does Translation Require Context? A Data-driven, Multilingual Exploration.
CoRR, 2021

Reconciling the Discrete-Continuous Divide: Towards a Mathematical Theory of Sparse Communication.
CoRR, 2021

IST-Unbabel 2021 Submission for the Quality Estimation Shared Task.
Proceedings of the Sixth Conference on Machine Translation, 2021

Findings of the WMT 2021 Shared Task on Quality Estimation.
Proceedings of the Sixth Conference on Machine Translation, 2021

Are References Really Needed? Unbabel-IST 2021 Submission for the Metrics Shared Task.
Proceedings of the Sixth Conference on Machine Translation, 2021

Smoothing and Shrinking the Sparse Seq2Seq Search Space.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Sparse And Structured Visual Attention.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Multimodal Continuous Visual Attention Mechanisms.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

IST-Unbabel 2021 Submission for the Explainable Quality Estimation Shared Task.
Proceedings of the 2nd Workshop on Evaluation and Comparison of NLP Systems, 2021

SPECTRA: Sparse Structured Text Rationalization.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Uncertainty-Aware Machine Translation Evaluation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Do Context-Aware Translation Models Pay the Right Attention?
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Measuring and Increasing Context Usage in Context-Aware Machine Translation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Learning with Fenchel-Young losses.
J. Mach. Learn. Res., 2020

MLQE-PE: A Multilingual Quality Estimation and Post-Editing Dataset.
CoRR, 2020

Towards Prediction Explainability through Sparse Communication.
CoRR, 2020

Findings of the WMT 2020 Shared Task on Quality Estimation.
Proceedings of the Fifth Conference on Machine Translation, 2020

IST-Unbabel Participation in the WMT20 Quality Estimation Shared Task.
Proceedings of the Fifth Conference on Machine Translation, 2020

Findings of the WMT 2020 Shared Task on Chat Translation.
Proceedings of the Fifth Conference on Machine Translation, 2020

One-Size-Fits-All Multilingual Models.
Proceedings of the 17th SIGMORPHON Workshop on Computational Research in Phonetics, 2020

Sparse and Continuous Attention Mechanisms.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Efficient Marginalization of Discrete and Structured Latent Variables via Sparsity.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

LP-SparseMAP: Differentiable Relaxed Optimization for Sparse Structured Prediction.
Proceedings of the 37th International Conference on Machine Learning, 2020

Understanding the Mechanics of SPIGOT: Surrogate Gradients for Latent Structure Learning.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Sparse Text Generation.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Project MAIA: Multilingual AI Agent Assistant.
Proceedings of the 22nd Annual Conference of the European Association for Machine Translation, 2020

DeepSPIN: Deep Structured Prediction for Natural Language Processing.
Proceedings of the 22nd Annual Conference of the European Association for Machine Translation, 2020

Document-level Neural MT: A Systematic Comparison.
Proceedings of the 22nd Annual Conference of the European Association for Machine Translation, 2020

Learning Non-Monotonic Automatic Post-Editing of Translations from Human Orderings.
Proceedings of the 22nd Annual Conference of the European Association for Machine Translation, 2020

The Explanation Game: Towards Prediction Explainability through Sparse Communication.
Proceedings of the Third BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, 2020

Revisiting Higher-Order Dependency Parsers.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Notes on Latent Structure Models and SPIGOT.
CoRR, 2019

Unbabel's Submission to the WMT2019 APE Shared Task: BERT-Based Encoder-Decoder for Automatic Post-Editing.
Proceedings of the Fourth Conference on Machine Translation, 2019

Unbabel's Participation in the WMT19 Translation Quality Estimation Shared Task.
Proceedings of the Fourth Conference on Machine Translation, 2019

Findings of the WMT 2019 Shared Tasks on Quality Estimation.
Proceedings of the Fourth Conference on Machine Translation, 2019

Jointly Extracting and Compressing Documents with Summary State Representations.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Selective Attention for Context-aware Neural Machine Translation.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Translator2Vec: Understanding and Representing Human Post-Editors.
Proceedings of Machine Translation Summit XVII Volume 1: Research Track, 2019

Adaptively Sparse Transformers.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Learning Classifiers with Fenchel-Young Losses: Generalized Entropies, Margins, and Algorithms.
Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019

Sparse Sequence-to-Sequence Models.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Scheduled Sampling for Transformers.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Latent Structure Models for Natural Language Processing.
Proceedings of the 57th Conference of the Association for Computational Linguistics: Tutorial Abstracts, 2019

Joint Learning of Named Entity Recognition and Entity Linking.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

OpenKiwi: An Open Source Framework for Quality Estimation.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

A Simple and Effective Approach to Automatic Post-Editing with Transfer Learning.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Findings of the WMT 2018 Shared Task on Quality Estimation.
Proceedings of the Third Conference on Machine Translation: Shared Task Papers, 2018

Contextual Neural Model for Translating Bilingual Multi-Speaker Conversations.
Proceedings of the Third Conference on Machine Translation: Research Papers, 2018

SparseMAP: Differentiable Sparse Structured Inference.
Proceedings of the 35th International Conference on Machine Learning, 2018

Interpretable Structure Induction via Sparse Attention.
Proceedings of the Workshop: Analyzing and Interpreting Neural Networks for NLP, 2018

Towards Dynamic Computation Graphs via Sparse Latent Structure.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Sparse and Constrained Attention for Neural Machine Translation.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Marian: Fast Neural Machine Translation in C++.
Proceedings of ACL 2018, Melbourne, Australia, July 15-20, 2018, System Demonstrations, 2018

2017
Pushing the Limits of Translation Quality Estimation.
Trans. Assoc. Comput. Linguistics, 2017

Unbabel's Participation in the WMT17 Translation Quality Estimation Shared Task.
Proceedings of the Second Conference on Machine Translation, 2017

Learning What's Easy: Fully Differentiable Neural Easy-First Taggers.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

2016
Unbabel's Participation in the WMT16 Word-Level Translation Quality Estimation Shared Task.
Proceedings of the First Conference on Machine Translation, 2016

SUMMA at TAC Knowledge Base Population Task 2016.
Proceedings of the 2016 Text Analysis Conference, 2016

From Softmax to Sparsemax: A Sparse Model of Attention and Multi-Label Classification.
Proceedings of the 33nd International Conference on Machine Learning, 2016

Semi-Supervised Learning of Sequence Models with Method of Moments.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Jointly Learning to Embed and Predict with Multiple Languages.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

2015
AD<sup>3</sup>: alternating directions dual decomposition for MAP inference in graphical models.
J. Mach. Learn. Res., 2015

Lisbon: Evaluating TurboSemanticParser on Multiple Languages and Out-of-Domain Data.
Proceedings of the 9th International Workshop on Semantic Evaluation, 2015

Transferring Coreference Resolvers with Posterior Regularization.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

Parsing as Reduction.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

Aligning Opinions: Cross-Lingual Opinion Mining with Dependencies.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

2014
Frame-Semantic Parsing.
Comput. Linguistics, 2014

Priberam: A Turbo Semantic Parser with Second Order Features.
Proceedings of the 8th International Workshop on Semantic Evaluation, 2014

Priberam Compressive Summarization Corpus: A New Multi-Document Summarization Corpus for European Portuguese.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

A Joint Model for Quotation Attribution and Coreference Resolution.
Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014

2013
On the Combination of Information-Theoretic Kernels with Generative Embeddings.
Proceedings of the Similarity-Based Pattern Analysis and Recognition, 2013

Combining information theoretic kernels with generative embeddings for classification.
Neurocomputing, 2013

Linguistic structure prediction with the sparseptron.
XRDS, 2013

Turning on the Turbo: Fast Third-Order Non-Projective Turbo Parsers.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

Fast and Robust Compressive Summarization with Dual Decomposition and Multi-Task Learning.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

2012
Alternating Directions Dual Decomposition
CoRR, 2012

An Exact Dual Decomposition Algorithm for Shallow Semantic Parsing with Constraints.
Proceedings of the First Joint Conference on Lexical and Computational Semantics, 2012

Structured Sparsity in Natural Language Processing: Models, Algorithms and Applications.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2012

2011
Online Learning of Structured Predictors with Multiple Kernels.
Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, 2011

Renal Cancer Cell Classification Using Generative Embeddings and Information Theoretic Kernels.
Proceedings of the Pattern Recognition in Bioinformatics, 2011

An Augmented Lagrangian Approach to Constrained MAP Inference.
Proceedings of the 28th International Conference on Machine Learning, 2011

Structured Sparsity in Structured Prediction.
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011

Dual Decomposition with Many Overlapping Components.
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011

2010
Information Theoretical Kernels for Generative Embeddings Based on Hidden Markov Models.
Proceedings of the Structural, 2010

2D Shape Recognition Using Information Theoretic Kernels.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Combining free energy score spaces with information theoretic kernels: Application to scene classification.
Proceedings of the International Conference on Image Processing, 2010

Turbo Parsers: Dependency Parsing by Approximate Variational Inference.
Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, 2010

2009
Nonextensive Information Theoretic Kernels on Measures.
J. Mach. Learn. Res., 2009

Polyhedral outer approximations with application to natural language parsing.
Proceedings of the 26th Annual International Conference on Machine Learning, 2009

Concise Integer Linear Programming Formulations for Dependency Parsing.
Proceedings of the ACL 2009, 2009

2008
Nonextensive Generalizations of the Jensen-Shannon Divergence
CoRR, 2008

Tsallis kernels on measures.
Proceedings of the 2008 IEEE Information Theory Workshop, 2008

Nonextensive entropic kernels.
Proceedings of the Machine Learning, 2008

Stacking Dependency Parsers.
Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, 2008

Priberam's Question Answering system in QA@CLEF 2008.
Proceedings of the Working Notes for CLEF 2008 Workshop co-located with the 12th European Conference on Digital Libraries (ECDL 2008) , 2008

2007
Priberam's Question Answering System in QA@CLEF 2007.
Proceedings of the Working Notes for CLEF 2007 Workshop co-located with the 11th European Conference on Digital Libraries (ECDL 2007), 2007

2006
An integrated evaluation method for e-learning: a case study.
Interact. Technol. Smart Educ., 2006

Web-Based Support for Resource-Effective E-Learning.
Proceedings of the WEBIST 2006, 2006

Priberam's Question Answering System in a Cross-Language Environment.
Proceedings of the Working Notes for CLEF 2006 Workshop co-located with the 10th European Conference on Digital Libraries (ECDL 2006), 2006

2005
Orientation in Manhattan: Equiprojective Classes and Sequential Estimation.
IEEE Trans. Pattern Anal. Mach. Intell., 2005

Priberam's Question Answering System for Portuguese.
Proceedings of the Accessing Multilingual Information Repositories, 2005

2004
Design and Implementation of a Semantic Search Engine for Portuguese.
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

2003
Navigating in Manhattan: 3D orientation from video without correspondences.
Proceedings of the 2003 International Conference on Image Processing, 2003


  Loading...