Kenneth Heafield

Orcid: 0000-0002-6344-9927

According to our database1, Kenneth Heafield authored at least 76 papers between 2008 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
The Llama 3 Herd of Models.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
et al.
CoRR, 2024

Iterative Translation Refinement with Large Language Models.
Proceedings of the 25th Annual Conference of the European Association for Machine Translation (Volume 1), 2024

Monolingual or Multilingual Instruction Tuning: Which Makes a Better Alpaca.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024

Code-Switched Language Identification is Harder Than You Think.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

Document-Level Machine Translation with Large-Scale Public Parallel Corpora.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Efficient Methods for Natural Language Processing: A Survey.
Trans. Assoc. Comput. Linguistics, 2023

Monolingual or Multilingual Instruction Tuning: Which Makes a Better Alpaca.
CoRR, 2023

Cheating to Identify Hard Problems for Neural Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

An Open Dataset and Model for Language Identification.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

2022
Efficient Methods for Natural Language Processing: A Survey.
CoRR, 2022

No Language Left Behind: Scaling Human-Centered Machine Translation.
CoRR, 2022

Exploring Diversity in Back Translation for Low-Resource Machine Translation.
CoRR, 2022

Findings of the WMT 2022 Shared Task on Efficient Translation.
Proceedings of the Seventh Conference on Machine Translation, 2022

Edinburgh's Submission to the WMT 2022 Efficiency Task.
Proceedings of the Seventh Conference on Machine Translation, 2022

Approaching Neural Chinese Word Segmentation as a Low-Resource Machine Translation Task.
Proceedings of the 36th Pacific Asia Conference on Language, Information and Computation, 2022

Cheat Codes to Quantify Missing Source Information in Neural Machine Translation.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

The EuroPat Corpus: A Parallel Corpus of European Patent Data.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Constrained Regeneration for Cross-Lingual Query-Focused Extractive Summarization.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

2021
Direct simultaneous speech to speech translation.
CoRR, 2021

Findings of the WMT 2021 Shared Task on Efficient Translation.
Proceedings of the Sixth Conference on Machine Translation, 2021

The University of Edinburgh's English-German and English-Hausa Submissions to the WMT21 News Translation Task.
Proceedings of the Sixth Conference on Machine Translation, 2021

Pruning Neural Machine Translation for Speed Using Group Lasso.
Proceedings of the Sixth Conference on Machine Translation, 2021

Efficient Machine Translation with Model Pruning and Quantization.
Proceedings of the Sixth Conference on Machine Translation, 2021


TranslateLocally: Blazing-fast translation running on the local CPU.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, 2021

Gender bias amplification during Speed-Quality optimization in Neural Machine Translation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Exploring Monolingual Data for Neural Machine Translation with Knowledge Distillation.
CoRR, 2020

Speed-optimized, Compact Student Models that Distill Knowledge from a Larger Teacher Model: the UEDIN-CUNI Submission to the WMT 2020 News Translation Task.
Proceedings of the Fifth Conference on Machine Translation, 2020

Losing Heads in the Lottery: Pruning Transformer Attention in Neural Machine Translation.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

The Sockeye 2 Neural Machine Translation Toolkit at AMTA 2020.
Proceedings of the 14th Conference of the Association for Machine Translation in the Americas, 2020

Findings of the Fourth Workshop on Neural Generation and Translation.
Proceedings of the Fourth Workshop on Neural Generation and Translation, 2020

Edinburgh's Submissions to the 2020 Machine Translation Efficiency Task.
Proceedings of the Fourth Workshop on Neural Generation and Translation, 2020

Compressing Neural Machine Translation Models with 4-bit Precision.
Proceedings of the Fourth Workshop on Neural Generation and Translation, 2020

Parallel Sentence Mining by Constrained Decoding.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020


In Neural Machine Translation, What Does Transfer Learning Transfer?
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Neural Machine Translation with 4-Bit Precision and Beyond.
CoRR, 2019

Incorporating Source Syntax into Transformer-Based Neural Machine Translation.
Proceedings of the Fourth Conference on Machine Translation, 2019

Surprise Languages: Rapid-Response Cross-Language IR.
Proceedings of the 9th International Workshop on Evaluating Information Access co-located with the 14th NTCIR Conference on the Evaluation of Information Access Technologies (NTCIR 2019), 2019

From Research to Production and Back: Ludicrously Fast Neural Machine Translation.
Proceedings of the 3rd Workshop on Neural Generation and Translation@EMNLP-IJCNLP 2019, 2019

Zero-Resource Neural Machine Translation with Monolingual Pivot Data.
Proceedings of the 3rd Workshop on Neural Generation and Translation@EMNLP-IJCNLP 2019, 2019

Combining Global Sparse Gradients with Local Gradients in Distributed Neural Network Training.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Making Asynchronous Stochastic Gradient Descent Work for Transformers.
Proceedings of the 3rd Workshop on Neural Generation and Translation@EMNLP-IJCNLP 2019, 2019

Neural Grammatical Error Correction Systems with Unsupervised Pre-training on Synthetic Data.
Proceedings of the Fourteenth Workshop on Innovative Use of NLP for Building Educational Applications, 2019

2018
Exploring Hyper-Parameter Optimization for Neural Machine Translation on GPU Architectures.
CoRR, 2018

Findings of the WMT 2018 Shared Task on Parallel Corpus Filtering.
Proceedings of the Third Conference on Machine Translation: Shared Task Papers, 2018

The University of Edinburgh's Submissions to the WMT18 News Translation Task.
Proceedings of the Third Conference on Machine Translation: Shared Task Papers, 2018

Approaching Neural Grammatical Error Correction as a Low-Resource Machine Translation Task.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Multi-Source Syntactic Neural Machine Translation.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Accelerating Asynchronous Stochastic Gradient Descent for Neural Machine Translation.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Marian: Cost-effective High-Quality Neural Machine Translation in C++.
Proceedings of the 2nd Workshop on Neural Machine Translation and Generation, 2018

Fast Neural Machine Translation Implementation.
Proceedings of the 2nd Workshop on Neural Machine Translation and Generation, 2018

Neural Machine Translation Techniques for Named Entity Transliteration.
Proceedings of the Seventh Named Entities Workshop, 2018

Marian: Fast Neural Machine Translation in C++.
Proceedings of ACL 2018, Melbourne, Australia, July 15-20, 2018, System Demonstrations, 2018

2017
The University of Edinburgh's Neural MT Systems for WMT17.
Proceedings of the Second Conference on Machine Translation, 2017

Copied Monolingual Data Improves Low-Resource Neural Machine Translation.
Proceedings of the Second Conference on Machine Translation, 2017

Sparse Communication for Distributed Gradient Descent.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

2016
Normalized Log-Linear Interpolation of Backoff Language Models is Efficient.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

2015
Language Identification and Modeling in Specialized Hardware.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

2014
Stanford University's Submissions to the WMT 2014 Translation Task.
Proceedings of the Ninth Workshop on Statistical Machine Translation, 2014

Edinburgh's Phrase-based Machine Translation Systems for WMT-14.
Proceedings of the Ninth Workshop on Statistical Machine Translation, 2014

N-gram Counts and Language Models from the Common Crawl.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Faster Phrase-Based Decoding by Refining Feature State.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

2013
Edinburgh's Machine Translation Systems for European Language Pairs.
Proceedings of the Eighth Workshop on Statistical Machine Translation, 2013

Grouping Language Model Boundary Words to Speed K-Best Extraction from Hypergraphs.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013

Scalable Modified Kneser-Ney Language Model Estimation.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

2012
Language Model Rest Costs and Space-Efficient Storage.
Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, 2012

2011
CMU System Combination in WMT 2011.
Proceedings of the Sixth Workshop on Statistical Machine Translation, 2011

KenLM: Faster and Smaller Language Model Queries.
Proceedings of the Sixth Workshop on Statistical Machine Translation, 2011

Left language model state for syntactic machine translation.
Proceedings of the 2011 International Workshop on Spoken Language Translation, 2011

2010
Combining Machine Translation Output with Open SourceThe Carnegie Mellon Multi-Engine Machine Translation Scheme.
Prague Bull. Math. Linguistics, 2010

The Machine Translation Toolpack for LoonyBin: Automated Management of Experimental Machine Translation HyperWorkflows.
Prague Bull. Math. Linguistics, 2010

CMU Multi-Engine Machine Translation for WMT 2010.
Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR, 2010

Voting on N-grams for Machine Translation System Combination.
Proceedings of the 9th Conference of the Association for Machine Translation in the Americas: Research Papers, 2010

2009
Machine Translation System Combination with Flexible Word Ordering.
Proceedings of the Fourth Workshop on Statistical Machine Translation, 2009

2008
Mining business topics in source code using latent dirichlet allocation.
Proceedings of the Proceeding of the 1st Annual India Software Engineering Conference, 2008


  Loading...