David Vilar

Affiliations:
  • RWTH Aachen University


According to our database1, David Vilar authored at least 65 papers between 2003 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Introducing the NewsPaLM MBR and QE Dataset: LLM-Generated High-Quality Parallel Data Outperforms Traditional Web-Crawled Data.
CoRR, 2024

Efficient Minimum Bayes Risk Decoding using Low-Rank Matrix Completion Algorithms.
CoRR, 2024

Quality-Aware Translation Models: Efficient Generation and Quality Estimation in a Single Model.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Quality Control at Your Fingertips: Quality-Aware Translation Models.
CoRR, 2023

There's No Data like Better Data: Using QE Metrics for MT Data Filtering.
Proceedings of the Eighth Conference on Machine Translation, 2023

Prompting PaLM for Translation: Assessing Strategies and Performance.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
A Natural Diet: Towards Improving Naturalness of Machine Translation Output.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Scaling Up Influence Functions.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
A Statistical Extension of Byte-Pair Encoding.
Proceedings of the 18th International Conference on Spoken Language Translation, 2021

Controlling Machine Translation for Multiple Attributes with Additive Interventions.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Bandits Don't Follow Rules: Balancing Multi-Facet Machine Translation with Multi-Armed Bandits.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

2020
Sockeye 2: A Toolkit for Neural Machine Translation.
Proceedings of the 22nd Annual Conference of the European Association for Machine Translation, 2020

The Sockeye 2 Neural Machine Translation Toolkit at AMTA 2020.
Proceedings of the 14th Conference of the Association for Machine Translation in the Americas, 2020

2019
Automatic error classification with multiple error labels.
Proceedings of Machine Translation Summit XVII Volume 1: Research Track, 2019

2018
Learning Hidden Unit Contribution for Adapting Neural Machine Translation Models.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Fast Lexically Constrained Decoding with Dynamic Beam Allocation for Neural Machine Translation.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

The Sockeye Neural Machine Translation Toolkit at AMTA 2018.
Proceedings of the 13th Conference of the Association for Machine Translation in the Americas, 2018

2017
Sockeye: A Toolkit for Neural Machine Translation.
CoRR, 2017

2014
Involving language professionals in the evaluation of machine translation.
Lang. Resour. Evaluation, 2014

The taraXÜ corpus of human-annotated machine translations.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Simple and Effective Approach for Consistent Training of Hierarchical Phrase-based Translation Models.
Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014

2013
A Performance Study of Cube Pruning for Large-Scale Hierarchical Machine Translation.
Proceedings of the Seventh Workshop on Syntax, 2013

What can we learn about the selection mechanism for post-editing?
Proceedings of the 2nd Workshop on Post-editing Technology and Practice, 2013

Learning from Human Judgments of Machine Translation Output.
Proceedings of Machine Translation Summit XIV: Posters, 2013

A CCG-based Quality Estimation Metric for Statistical Machine Translation Learning from Human Judgments of Machine Translation Output.
Proceedings of Machine Translation Summit XIV: Posters, 2013

2012
Jane: an advanced freely available hierarchical machine translation toolkit.
Mach. Transl., 2012

Cardinality pruning and language model heuristics for hierarchical phrase-based translation.
Mach. Transl., 2012

DFKI's SMT System for WMT 2012.
Proceedings of the Seventh Workshop on Statistical Machine Translation, 2012

Involving Language Professionals in the Evaluation of Machine Translation.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

Towards the Integration of MT into a LSP Translation Workflow.
Proceedings of the 16th Annual conference of the European Association for Machine Translation, 2012

2011
Investigations on hierarchical phrase-based machine translation.
PhD thesis, 2011

A Guide to Jane, an Open Source Hierarchical Translation Toolkit.
Prague Bull. Math. Linguistics, 2011

DFKI Hybrid Machine Translation System for WMT 2011 - On the Integration of SMT and RBMT.
Proceedings of the Sixth Workshop on Statistical Machine Translation, 2011

Evaluation without references: IBM1 scores as evaluation metrics.
Proceedings of the Sixth Workshop on Statistical Machine Translation, 2011

Evaluate with Confidence Estimation: Machine ranking of translation outputs using grammatical features.
Proceedings of the Sixth Workshop on Statistical Machine Translation, 2011

DFKI's SC and MT submissions to IWSLT 2011.
Proceedings of the 2011 International Workshop on Spoken Language Translation, 2011

Lightly-Supervised Training for Hierarchical Phrase-Based Machine Translation.
Proceedings of the First workshop on Unsupervised Learning in NLP@EMNLP 2011, 2011

Advancements in Arabic-to-English Hierarchical Machine Translation.
Proceedings of the 15th Annual conference of the European Association for Machine Translation, 2011

2010
Jane: Open Source Hierarchical Translation, Extended with Reordering and Lexicon Models.
Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR, 2010

If i only had a parser: poor man's syntax for hierarchical machine translation.
Proceedings of the 2010 International Workshop on Spoken Language Translation, 2010

The RWTH aachen machine translation system for IWSLT 2010.
Proceedings of the 2010 International Workshop on Spoken Language Translation, 2010

A combination of hierarchical systems with forced alignments from phrase-based systems.
Proceedings of the 2010 International Workshop on Spoken Language Translation, 2010

A Cocktail of Deep Syntactic Features for Hierarchical Machine Translation.
Proceedings of the 9th Conference of the Association for Machine Translation in the Americas: Research Papers, 2010

2009
The RWTH Machine Translation System for WMT 2009.
Proceedings of the Fourth Workshop on Statistical Machine Translation, 2009

On LM Heuristics for the Cube Growing Algorithm.
Proceedings of the 13th Annual conference of the European Association for Machine Translation, 2009

2008
Using a Bilingual Context in Word-Based Statistical Machine Translation.
Proceedings of the Pattern Recognition in Information Systems, 2008

The RWTH machine translation system for IWSLT 2008.
Proceedings of the 2008 International Workshop on Spoken Language Translation, 2008

Analysing soft syntax features and heuristics for hierarchical phrase based machine translation.
Proceedings of the 2008 International Workshop on Spoken Language Translation, 2008

2007
Can We Translate Letters?
Proceedings of the Second Workshop on Statistical Machine Translation, 2007

Human Evaluation of Machine Translation Through Binary System Comparisons.
Proceedings of the Second Workshop on Statistical Machine Translation, 2007

Bridging the Gap between Naive Bayes and Maximum Entropy Text Classification.
Proceedings of the Pattern Recognition in Information Systems, 2007

Analysis and System Combination of Phrase- and N-Gram-Based Statistical Machine Translation Systems.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2007

The RWTH machine translation system for IWSLT 2007.
Proceedings of the 2007 International Workshop on Spoken Language Translation, 2007

2006
Error Analysis of Statistical Machine Translation Output.
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

AER: do we need to "improve" our alignments?
Proceedings of the 2006 International Workshop on Spoken Language Translation, 2006

2005
Statistical Machine Translation of European Parliamentary Speeches.
Proceedings of Machine Translation Summit X: Papers, 2005

Comparison of generation strategies for interactive machine translation.
Proceedings of the 10th EAMT Conference: Practical applications of machine translation, 2005

Augmenting a Small Parallel Text with Morpho-Syntactic Language.
Proceedings of the Workshop on Building and Using Parallel Texts@ACL 2005, 2005

Preprocessing and Normalization for Automatic Evaluation of Machine Translation.
Proceedings of the Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and/or Summarization@ACL 2005, 2005

Novel Reordering Approaches in Phrase-Based Statistical Machine Translation.
Proceedings of the Workshop on Building and Using Parallel Texts@ACL 2005, 2005

2004
Multi-label Text Classification Using Multinomial Models.
Proceedings of the Advances in Natural Language Processing, 4th International Conference, 2004

Effect of Feature Smoothing Methods in Text Classification Tasks.
Proceedings of the Pattern Recognition in Information Systems, 2004

2003
Connectionist classification and specific stochastic models in the understanding process of a dialogue system.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Uniclass and Multiclass Connectionist Classification of Dialogue Acts.
Proceedings of the Progress in Pattern Recognition, 2003

Dialogue Act Classification in a Spoken Dialogue System.
Proceedings of the Current Topics in Artificial Intelligence, 2003


  Loading...