Marcin Junczys-Dowmunt

According to our database1, Marcin Junczys-Dowmunt authored at least 60 papers between 2008 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.



In proceedings 
PhD thesis 




PyMarian: Fast Neural Machine Translation and Evaluation in Python.
CoRR, 2024

On Instruction-Finetuning Neural Machine Translation Models.
Proceedings of the Ninth Conference on Machine Translation, 2024

Evaluation and Large-scale Training for Contextual Machine Translation.
Proceedings of the Ninth Conference on Machine Translation, 2024

On-the-Fly Fusion of Large Language Models and Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

SOTASTREAM: A Streaming Approach to Machine Translation Training.
CoRR, 2023

Escaping the sentence-level paradigm in machine translation.
CoRR, 2023

Cometoid: Distilling Strong Reference-based Machine Translation Metrics into Even Stronger Quality Estimation Metrics.
Proceedings of the Eighth Conference on Machine Translation, 2023

Perplexity-Driven Case Encoding Needs Augmentation for CAPITALIZATION Robustness.
Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2023

Revisiting Locality Sensitive Hashing for Vocabulary Selection in Fast Neural Machine Translation.
Proceedings of the Seventh Conference on Machine Translation, 2022

On User Interfaces for Large-Scale Document-Level Human Evaluation of Machine Translation Outputs.
CoRR, 2021

To Ship or Not to Ship: An Extensive Evaluation of Automatic Metrics for Machine Translation.
Proceedings of the Sixth Conference on Machine Translation, 2021

The JHU-Microsoft Submission for WMT21 Quality Estimation Shared Task.
Proceedings of the Sixth Conference on Machine Translation, 2021

The Curious Case of Hallucinations in Neural Machine Translation.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Levenshtein Training for Word-level Quality Estimation.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Microsoft Translator at WMT 2019: Towards Large-Scale Document-Level Neural Machine Translation.
Proceedings of the Fourth Conference on Machine Translation, 2019

From Research to Production and Back: Ludicrously Fast Neural Machine Translation.
Proceedings of the 3rd Workshop on Neural Generation and Translation@EMNLP-IJCNLP 2019, 2019

Neural Grammatical Error Correction Systems with Unsupervised Pre-training on Synthetic Data.
Proceedings of the Fourteenth Workshop on Innovative Use of NLP for Building Educational Applications, 2019

Minimally-Augmented Grammatical Error Correction.
Proceedings of the 5th Workshop on Noisy User-generated Text, 2019

Achieving Human Parity on Automatic Chinese to English News Translation.
CoRR, 2018

Dual Conditional Cross-Entropy Filtering of Noisy Parallel Corpora.
Proceedings of the Third Conference on Machine Translation: Shared Task Papers, 2018

MS-UEdin Submission to the WMT2018 APE Shared Task: Dual-Source Transformer for Automatic Post-Editing.
Proceedings of the Third Conference on Machine Translation: Shared Task Papers, 2018

Microsoft's Submission to the WMT2018 News Translation Task: How I Learned to Stop Worrying and Love the Data.
Proceedings of the Third Conference on Machine Translation: Shared Task Papers, 2018

Approaching Neural Grammatical Error Correction as a Low-Resource Machine Translation Task.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Near Human-Level Performance in Grammatical Error Correction with Hybrid Machine Translation.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Accelerating Asynchronous Stochastic Gradient Descent for Neural Machine Translation.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Are we experiencing the Golden Age of Automatic Post-Editing?
Proceedings of the AMTA 2018 Workshop on Translation Quality Estimation and Automatic Post-Editing, 2018

Marian: Cost-effective High-Quality Neural Machine Translation in C++.
Proceedings of the 2nd Workshop on Neural Machine Translation and Generation, 2018

Marian: Fast Neural Machine Translation in C++.
Proceedings of ACL 2018, Melbourne, Australia, July 15-20, 2018, System Demonstrations, 2018

Pushing the Limits of Translation Quality Estimation.
Trans. Assoc. Comput. Linguistics, 2017

Syntax-aware Neural Machine Translation Using CCG.
CoRR, 2017

Predicting Target Language CCG Supertags Improves Neural Machine Translation.
Proceedings of the Second Conference on Machine Translation, 2017

The AMU-UEdin Submission to the WMT 2017 Shared Task on Automatic Post-Editing.
Proceedings of the Second Conference on Machine Translation, 2017

An Exploration of Neural Sequence-to-Sequence Architectures for Automatic Post-Editing.
Proceedings of the Eighth International Joint Conference on Natural Language Processing, 2017

Nematus: a Toolkit for Neural Machine Translation.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Log-linear Combinations of Monolingual and Bilingual Neural Machine Translation Models for Automatic Post-Editing.
Proceedings of the First Conference on Machine Translation, 2016

The AMU-UEDIN Submission to the WMT16 News Translation Task: Attention-based NMT Models as Feature Functions in Phrase-based SMT.
Proceedings of the First Conference on Machine Translation, 2016

The United Nations Parallel Corpus v1.0.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

The University of Edinburgh's systems submission to the MT task at IWSLT.
Proceedings of the 13th International Conference on Spoken Language Translation, 2016

Is Neural Machine Translation Ready for Deployment? A Case Study on 30 Translation Directions.
Proceedings of the 13th International Conference on Spoken Language Translation, 2016

Phrase-based Machine Translation is State-of-the-Art for Automatic Grammatical Error Correction.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Fast, Scalable Phrase-Based SMT Decoding.
Proceedings of the 12th Conferences of the Association for Machine Translation in the Americas: MT Researchers' Track, 2016

Target-Side Context for Discriminative Models in Statistical Machine Translation.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Reinvestigating the Classification Approach to the Article and Preposition Error Correction.
Proceedings of the Human Language Technology. Challenges for Computer Science and Linguistics, 2015

Large scale speech-to-text translation with out-of-domain corpora using better context-based models and domain adaptation.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Human Evaluation of Grammatical Error Correction Systems.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

SMT at the International Maritime Organization: experiences with combining in-house corpora with out-of-domain corpora.
Proceedings of the 18th Annual Conference of the European Association for Machine Translation, 2015

The WikEd Error Corpus: A Corpus of Corrective Wikipedia Edits and Its Application to Grammatical Error Correction.
Proceedings of the Advances in Natural Language Processing, 2014

SMT of German patents at WIPO: decompounding and verb structure pre-reordering.
Proceedings of the 17th Annual conference of the European Association for Machine Translation, 2014

The AMU System in the CoNLL-2014 Shared Task: Grammatical Error Correction by Data-Intensive and Feature-Rich Statistical Machine Translation.
Proceedings of the Eighteenth Conference on Computational Natural Language Learning: Shared Task, 2014

PSI-Toolkit: A Natural Language Processing Pipeline.
Proceedings of the Computational Linguistics - Applications, 2013

Phrasal Rank-Encoding: Exploiting Phrase Redundancy and Translational Relations for Phrase Table Compression.
Prague Bull. Math. Linguistics, 2012

A Genetic Programming Experiment in Natural Language Grammar Engineering.
Proceedings of the Text, Speech and Dialogue - 15th International Conference, 2012

A Space-Efficient Phrase Table Implementation Using Minimal Perfect Hash Functions.
Proceedings of the Text, Speech and Dialogue - 15th International Conference, 2012

A Phrase Table without Phrases: Rank Encoding for Better Phrase Table Compression.
Proceedings of the 16th Annual conference of the European Association for Machine Translation, 2012

SyMGiza++: Symmetrized Word Alignment Models for Statistical Machine Translation.
Proceedings of the Security and Intelligent Information Systems, 2011

SyMGiza++: A Tool for Parallel Computation of Symmetrized Word Alignment Models.
Proceedings of the International Multiconference on Computer Science and Information Technology, 2010

A Maximum Entropy Approach to Syntactic Translation Rule Filtering.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2010

It's all about the trees towards a hybrid syntax-based MT system.
Proceedings of the International Multiconference on Computer Science and Information Technology, 2009

Influence of accurate compound noun splitting on bilingual vocabulary extraction.
Proceedings of the Text Resources and Lexical Knowledge. Selected Papers from the 9th Conference on Natural Language Processing, 2008
