Kenton Murray

Orcid: 0000-0002-5628-1003

Affiliations:
  • Johns Hopkins University, Baltimore, MD, USA


According to our database1, Kenton Murray authored at least 45 papers between 2013 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Upsample or Upweight? Balanced Training on Heavily Imbalanced Datasets.
CoRR, 2024

X-ALMA: Plug & Play Modules and Adaptive Rejection for Quality Translation at Scale.
CoRR, 2024

Preliminary WMT24 Ranking of General MT Systems and LLMs.
CoRR, 2024

Faux Polyglot: A Study on Information Disparity in Multilingual Large Language Models.
CoRR, 2024

Narrowing the Gap between Zero- and Few-shot Machine Translation by Matching Styles.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

Kreyòl-MT: Building MT for Latin American, Caribbean and Colonial African Creole Languages.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Where are you from? Geolocating Speech and Applications to Language Identification.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Contrastive Preference Optimization: Pushing the Boundaries of LLM Performance in Machine Translation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Error Norm Truncation: Robust Training in the Presence of Data Noise for Text Generation Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Evaluating Large Language Models along Dimensions of Language Variation: A Systematik Invesdigatiom uv Cross-lingual Generalization.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Exploring Geometric Representational Disparities between Multilingual and Bilingual Translation Models.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Can Synthetic Speech Improve End-to-End Conversational Speech Translation?
Proceedings of the 16th Conference of the Association for Machine Translation in the Americas, 2024

2023
Linking Symptom Inventories using Semantic Textual Similarity.
CoRR, 2023

MegaWika: Millions of reports and their sources across 50 diverse languages.
CoRR, 2023

Exploring Representational Disparities Between Multilingual and Bilingual Translation Models.
CoRR, 2023

Findings of the 2023 Conference on Machine Translation (WMT23): LLMs Are Here but Not Quite There Yet.
Proceedings of the Eighth Conference on Machine Translation, 2023


Condensing Multilingual Knowledge with Lightweight Language-Specific Modules.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Towards Being Parameter-Efficient: A Stratified Sparsely Activated Transformer with Dynamic Capacity.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Why Does Zero-Shot Cross-Lingual Generation Fail? An Explanation and a Solution.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Language Agnostic Code-Mixing Data Augmentation by Predicting Linguistic Patterns.
CoRR, 2022

Por Qué Não Utiliser Alla Språk? Mixed Training with Gradient Optimization in Few-Shot Cross-Lingual Transfer.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022


The Importance of Being Parameters: An Intra-Distillation Method for Serious Gains.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Transfer Learning Approaches for Building Cross-Language Dense Retrieval Models.
Proceedings of the Advances in Information Retrieval, 2022

Strategies for Adapting Multilingual Pre-training for Domain-Specific Machine Translation.
Proceedings of the 15th biennial conference of the Association for Machine Translation in the Americas (Volume 1: Research Track), 2022

2021
Joint Universal Syntactic and Semantic Parsing.
Trans. Assoc. Comput. Linguistics, 2021

NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
CoRR, 2021

Gradual Fine-Tuning for Low-Resource Domain Adaptation.
CoRR, 2021

Data Augmentation by Concatenation for Low-Resource Translation: A Mystery and a Solution.
Proceedings of the 18th International Conference on Spoken Language Translation, 2021

Everything Is All It Takes: A Multipronged Strategy for Zero-Shot Cross-Lingual Information Extraction.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

BERT, mBERT, or BiBERT? A Study on Contextualized Embeddings for Neural Machine Translation.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

2020
Collecting Verified COVID-19 Question Answer Pairs.
Proceedings of the 1st Workshop on NLP for COVID-19@ EMNLP 2020, Online, December 2020, 2020

The JHU Submission to the 2020 Duolingo Shared Task on Simultaneous Translation and Paraphrase for Language Education.
Proceedings of the Fourth Workshop on Neural Generation and Translation, 2020

2019
Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation.
Proceedings of the 3rd Workshop on Neural Generation and Translation@EMNLP-IJCNLP 2019, 2019

Efficiency through Auto-Sizing: Notre Dame NLP's Submission to the WNGT 2019 Efficiency Task.
Proceedings of the 3rd Workshop on Neural Generation and Translation@EMNLP-IJCNLP 2019, 2019

2018
Incident-Driven Machine Translation and Name Tagging for Low-resource Languages.
Mach. Transl., 2018

An Empirical Exploration of Curriculum Learning for Neural Machine Translation.
CoRR, 2018

Correcting Length Bias in Neural Machine Translation.
Proceedings of the Third Conference on Machine Translation: Research Papers, 2018

2016
Probabilistic Neural Programs.
CoRR, 2016

Semantic Scan: Detecting Subtle, Spatially Localized Events in Text Streams.
CoRR, 2016

2015
Auto-Sizing Neural Networks: With Applications to n-gram Language Models.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

2014
CoBaFi: collaborative bayesian filtering.
Proceedings of the 23rd International World Wide Web Conference, 2014

2013
The CMU Machine Translation Systems at WMT 2013: Syntax, Synthetic Translation Options, and Pseudo-References.
Proceedings of the Eighth Workshop on Statistical Machine Translation, 2013

QCRI at IWSLT 2013: experiments in Arabic-English and English-Arabic spoken language translation.
Proceedings of the 10th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2013, 2013


  Loading...