Jonathan May

Orcid: 0000-0002-5284-477X

According to our database1, Jonathan May authored at least 121 papers between 2002 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
FoodPuzzle: Developing Large Language Model Agents as Flavor Scientists.
CoRR, 2024

BotEval: Facilitating Interactive Human Evaluation.
CoRR, 2024

Style Transfer with Multi-iteration Preference Optimization.
CoRR, 2024

GNOME: Generating Negotiations through Open-Domain Mapping of Exchanges.
CoRR, 2024

Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length.
CoRR, 2024

Authorship Style Transfer with Policy Optimization.
CoRR, 2024

Multilingual Meta-Distillation Alignment for Semantic Retrieval.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

LegalDiscourse: Interpreting When Laws Apply and To Whom.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Leitner-Guided Memory Replay for Cross-lingual Continual Learning.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Can Language Model Moderators Improve the Health of Online Discourse?
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

CPL-NoViD: Context-Aware Prompt-Based Learning for Norm Violation Detection in Online Communities.
Proceedings of the Eighteenth International AAAI Conference on Web and Social Media, 2024

Are Large Language Models Capable of Generating Human-Level Narratives?
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Explaining Mixtures of Sources in News Articles.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Speechworthy Instruction-tuned Language Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

More Victories, Less Cooperation: Assessing Cicero's Diplomacy Play.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Tracking the Newsworthiness of Public Documents.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

GPT is Not an Annotator: The Necessity of Human Annotation in Fairness Benchmark Construction.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Multilingual Sentence-Level Semantic Search using Meta-Distillation Learning.
CoRR, 2023

Blend and Match: Distilling Semantic Search Models with Different Inductive Biases and Model Architectures.
Proceedings of the Companion Proceedings of the ACM Web Conference 2023, 2023

Anger Breeds Controversy: Analyzing Controversy and Emotions on Reddit.
Proceedings of the Social, Cultural, and Behavioral Modeling, 2023

Feedback Loops and Complex Dynamics of Harmful Speech in Online Discussions.
Proceedings of the Social, Cultural, and Behavioral Modeling, 2023

Mega: Moving Average Equipped Gated Attention.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Identifying Informational Sources in News Articles.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Analyzing Norm Violations in Live-Stream Chat.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Challenges in Context-Aware Neural Machine Translation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Continual Dialogue State Tracking via Example-Guided Question Answering.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Bridging the Gap between Native Text and Translated Text through Adversarial Learning: A Case Study on Cross-Lingual Event Extraction.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

Cross-lingual Continual Learning.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

RECAP: Retrieval-Enhanced Context-Aware Prefix Encoder for Personalized Dialogue Response Generation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Know Where You're Going: Meta-Learning for Parameter-Efficient Fine-Tuning.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

WinoQueer: A Community-in-the-Loop Benchmark for Anti-LGBTQ+ Bias in Large Language Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Checks and Strategies for Enabling Code-Switched Machine Translation.
CoRR, 2022

Towards WinoQueer: Developing a Benchmark for Anti-Queer Bias in Large Language Models.
CoRR, 2022

NewsEdits: A News Article Revision Dataset and a Document-Level Reasoning Challenge.
CoRR, 2022

Cross-lingual Lifelong Learning.
CoRR, 2022

NewsEdits: A News Article Revision Dataset and a Novel Document-Level Reasoning Challenge.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Augmenting Training Data for Massive Semantic Matching Models in Low-Traffic E-commerce Stores.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Industry Track, 2022

Opponent Modeling in Negotiation Dialogues by Related Data Adaptation.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

Investigating the Benefits of Free-Form Rationales.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Know Thy Strengths: Comprehensive Dialogue State Tracking Diagnostics.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Machine Translation Robustness to Natural Asemantic Variation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Segmenting Numerical Substitution Ciphers.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

2021
CheckDST: Measuring Real-World Generalization of Dialogue State Tracking Performance.
CoRR, 2021

Viola: A Topic Agnostic Generate-and-Rank Dialogue System.
CoRR, 2021

\textit{StateCensusLaws.org}: A Web Application for Consuming and Annotating Legal Discourse Learning.
CoRR, 2021

"Don't quote me on that": Finding Mixtures of Sources in News Articles.
CoRR, 2021

Modeling "Newsworthiness" for Lead-Generation Across Corpora.
CoRR, 2021

\textit{NewsEdits}: A Dataset of Revision Histories for News Articles (Technical Report: Data Processing).
CoRR, 2021

On the Strengths of Cross-Attention in Pretrained Transformers for Machine Translation.
CoRR, 2021

Multitask Learning for Class-Imbalanced Discourse Classification.
CoRR, 2021

Luna: Linear Unified Nested Attention.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

X-METRA-ADA: Cross-lingual Meta-Transfer learning Adaptation to Natural Language Understanding and Question Answering.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Macro-Average: Rare Types Are Important Too.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

CaSiNo: A Corpus of Campsite Negotiation Dialogues for Automatic Negotiation Systems.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Explaining Face Presentation Attack Detection Using Natural Language.
Proceedings of the 16th IEEE International Conference on Automatic Face and Gesture Recognition, 2021

Salience-Aware Event Chain Modeling for Narrative Understanding.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Multitask Semi-Supervised Learning for Class-Imbalanced Discourse Classification.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Cross-Attention is All You Need: Adapting Pretrained Transformers for Machine Translation.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

PERFUME: Programmatic Extraction and Refinement for Usability of Mathematical Expression.
Proceedings of the Checkmate@CCS 2021, 2021

WARP: Word-level Adversarial ReProgramming.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Many-to-English Machine Translation Tools, Data, and Pretrained Models.
Proceedings of the Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Can Sequence-to-Sequence Models Crack Substitution Ciphers?
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Summary-Oriented Question Generation for Informational Queries.
Proceedings of the 1st Workshop on Document-grounded Dialogue and Conversational Question Answering, 2021

2020
Question Generation for Supporting Informational Query Intents.
CoRR, 2020

Zero-Shot Learning of Text Adventure Games with Sentence-Level Semantics.
CoRR, 2020

BERT in Negotiations: Early Prediction of Buyer-Seller Negotiation Outcomes.
CoRR, 2020

Neural Machine Translation with Imbalanced Classes.
CoRR, 2020

Cross-lingual Structure Transfer for Zero-resource Event Extraction.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Learning to Generalize for Sequential Decision Making.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Connecting the Dots: Event Graph Schema Induction with Path Language Modeling.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Finding the Optimal Vocabulary Size for Neural Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Experience Grounds Language.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Grounding Conversations with Improvised Dialogues.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
A Universal Parent Model for Low-Resource Neural Machine Translation Transfer.
CoRR, 2019

Learn How to Cook a New Recipe in a New House: Using Map Familiarization, Curriculum Learning, and Common Sense to Learn Families of Text-Based Adventure Games.
CoRR, 2019

Cross-lingual Multi-Level Adversarial Transfer to Enhance Low-Resource Name Tagging.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

A Grounded Unsupervised Universal Part-of-Speech Tagger for Low-Resource Languages.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Cross-lingual Structure Transfer for Relation and Event Extraction.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Do Nuclear Submarines Have Nuclear Captains? A Challenge Dataset for Commonsense Reasoning over Adjectives and Objects.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

What Matters for Neural Cross-Lingual Named Entity Recognition: An Empirical Analysis.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Contextualized Cross-Lingual Event Trigger Extraction with Minimal Resources.
Proceedings of the 23rd Conference on Computational Natural Language Learning, 2019

Comprehensible Context-driven Text Game Playing.
Proceedings of the IEEE Conference on Games, 2019

Translating Translationese: A Two-Step Approach to Unsupervised Machine Translation.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

SARAL: A Low-Resource Cross-Lingual Domain-Focused Information Retrieval System for Effective Rapid Document Triage.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Cross-lingual Joint Entity and Word Embedding to Improve Entity Linking and Parallel Sentence Mining.
Proceedings of the 2nd Workshop on Deep Learning Approaches for Low-Resource NLP, 2019

2018
Incident-Driven Machine Translation and Name Tagging for Low-resource Languages.
Mach. Transl., 2018

Augmenting Statistical Machine Translation with Subword Translation of Out-of-Vocabulary Words.
CoRR, 2018

ELISA-EDL: A Cross-lingual Entity Extraction, Linking and Localization System.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics, 2018

Recurrent Neural Networks as Weighted Language Recognizers.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Translating a Language You Don't Know In the Chinese Room.
Proceedings of ACL 2018, Melbourne, Australia, July 15-20, 2018, System Demonstrations, 2018

Out-of-the-box Universal Romanization Tool uroman.
Proceedings of ACL 2018, Melbourne, Australia, July 15-20, 2018, System Demonstrations, 2018

2017
Recurrent Neural Networks as Weighted Language Recognizers.
CoRR, 2017

Liberal Entity Extraction: Rapid Construction of Fine-Grained Entity Typing Systems.
Big Data, 2017

SemEval-2017 Task 9: Abstract Meaning Representation Parsing and Generation.
Proceedings of the 11th International Workshop on Semantic Evaluation, 2017

Team ELISA System for DARPA LORELEI Speech Evaluation 2016.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Cross-lingual Name Tagging and Linking for 282 Languages.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016
Building a Fine-Grained Entity Typing System Overnight for a New X (X = Language, Domain, Genre).
CoRR, 2016

SemEval-2016 Task 8: Meaning Representation Parsing.
Proceedings of the 10th International Workshop on Semantic Evaluation, 2016

Simple, Fast Noise-Contrastive Estimation for Large RNN Vocabularies.
Proceedings of the NAACL HLT 2016, 2016

Extracting Structured Scholarly Information from the Machine Translation Literature.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Transfer Learning for Low-Resource Neural Machine Translation.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

2015
Using Syntax-Based Machine Translation to Parse English into Abstract Meaning Representation.
CoRR, 2015

Parsing English into Abstract Meaning Representation Using Syntax-Based Machine Translation.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

2014
An Arabizi-English social media statistical machine translation system.
Proceedings of the 11th Conference of the Association for Machine Translation in the Americas: MT Researchers Track, 2014

2013
Identifying Useful Human Correction Feedback from an On-Line Machine Translation Service.
Proceedings of the IJCAI 2013, 2013

Models of Translation Competitions.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

2012
An Analysis (and an Annotated Corpus) of User Responses to Machine Translation Output.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

2011
Tuning as Ranking.
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011

2010
Determinization of Weighted Tree Automata Using Factorizations.
J. Autom. Lang. Comb., 2010

Re-structuring, Re-labeling, and Re-aligning for Syntax-Based Machine Translation.
Comput. Linguistics, 2010

Efficient Inference through Cascades of Weighted Tree Transducers.
Proceedings of the ACL 2010, 2010

2009
Backward and forward bisimulation minimization of tree automata.
Theor. Comput. Sci., 2009

2008
Training Tree Transducers.
Comput. Linguistics, 2008

2007
Backward and Forward Bisimulation Minimisation of Tree Automata.
Proceedings of the Implementation and Application of Automata, 2007

Syntactic Re-Alignment Models for Machine Translation.
Proceedings of the EMNLP-CoNLL 2007, 2007

Bisimulation Minimisation for Weighted Tree Automata.
Proceedings of the Developments in Language Theory, 11th International Conference, 2007

2006
Tiburon: A Weighted Tree Automata Toolkit.
Proceedings of the Implementation and Application of Automata, 2006

A Better N-Best List: Practical Determinization of Weighted Finite Tree Automata.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2006

2003
Surprise! What's in a Cebuano or Hindi Name?
ACM Trans. Asian Lang. Inf. Process., 2003

Answer Selection and Confidence Estimation.
Proceedings of the New Directions in Question Answering, 2003

2002
TREC 2002 QA at BBN: Answer Selection and Confidence Estimation.
Proceedings of The Eleventh Text REtrieval Conference, 2002


  Loading...