Mike Lewis

CoRR, 2023

Scaling Expert Language Models with Unsupervised Domain Discovery.

[BibT_eX]

[DOI]

CoRR, 2023

LIMA: Less Is More for Alignment.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

MEGABYTE: Predicting Million-byte Sequences with Multiscale Transformers.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Coder Reviewer Reranking for Code Generation.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Retrieval-Augmented Multimodal Language Modeling.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Progressive Prompts: Continual Learning for Language Models.

[BibT_eX]

[DOI]

Anastasia Razdaibiedina

Proceedings of the Eleventh International Conference on Learning Representations, 2023

InCoder: A Generative Model for Code Infilling and Synthesis.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

AutoReply: Detecting Nonsense in Dialogue with Discriminative Replies.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Measuring and Narrowing the Compositionality Gap in Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Residual Prompt Tuning: improving prompt tuning with residual reparameterization.

[BibT_eX]

[DOI]

Anastasia Razdaibiedina

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Nonparametric Masked Language Modeling.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Contrastive Decoding: Open-ended Text Generation as Optimization.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

In-context Examples Selection for Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022

Improving Chess Commentaries by Combining Language Models with Symbolic Reasoning Engines.

[BibT_eX]

[DOI]

CoRR, 2022

AutoReply: Detecting Nonsense in Dialogue Introspectively with Discriminative Replies.

[BibT_eX]

[DOI]

CoRR, 2022

LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale.

[BibT_eX]

[DOI]

CoRR, 2022

Branch-Train-Merge: Embarrassingly Parallel Training of Expert Language Models.

[BibT_eX]

[DOI]

CoRR, 2022

Few-shot Mining of Naturally Occurring Inputs and Outputs.

[BibT_eX]

[DOI]

CoRR, 2022

CM3: A Causal Masked Multimodal Model of the Internet.

[BibT_eX]

[DOI]

CoRR, 2022

GPT3.int8(): 8-bit Matrix Multiplication for Transformers at Scale.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Sparse Distillation: Speeding Up Text Classification by Using Bigger Student Models.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

MetaICL: Learning to Learn In Context.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

DEMix Layers: Disentangling Domains for Modular Language Modeling.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Tricks for Training Sparse Translation Models.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Train Short, Test Long: Attention with Linear Biases Enables Input Length Extrapolation.

[BibT_eX]

[DOI]

Ofir Press

Noah A. Smith

Proceedings of the Tenth International Conference on Learning Representations, 2022

8-bit Optimizers via Block-wise Quantization.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

HTLM: Hyper-Text Pre-Training and Prompting of Language Models.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Improving Passage Retrieval with Zero-Shot Question Generation.

[BibT_eX]

[DOI]

Devendra Singh Sachan

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Rethinking the Role of Demonstrations: What Makes In-Context Learning Work?

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Noisy Channel Language Model Prompting for Few-Shot Text Classification.

[BibT_eX]

[DOI]

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Question Answering Infused Pre-training of General-Purpose Contextualized Representations.

[BibT_eX]

[DOI]

Robin Jia

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021

Sparse Distillation: Speeding Up Text Classification by Using Bigger Models.

[BibT_eX]

[DOI]

CoRR, 2021

Multitasking Inhibits Semantic Drift.

[BibT_eX]

[DOI]

Athul Paul Jacob

Michael Sejr Schlichtkrull

Jacob Andreas

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

BASE Layers: Simplifying Training of Large, Sparse Models.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

Nearest Neighbor Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

Joint Verification and Reranking for Open Fact Checking Over Tables.

[BibT_eX]

[DOI]

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Shortformer: Better Language Modeling using Shorter Inputs.

[BibT_eX]

[DOI]

Ofir Press

Noah A. Smith

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020

Multilingual Denoising Pre-training for Neural Machine Translation.

[BibT_eX]

[DOI]

Trans. Assoc. Comput. Linguistics, 2020

Conversational Semantic Parsing.

[BibT_eX]

[DOI]

CoRR, 2020

Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Pre-training via Paraphrasing.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Generalization through Memorization: Nearest Neighbor Language Models.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Learning Representations, 2020

Grounded Adaptation for Zero-shot Executable Semantic Parsing.

[BibT_eX]

[DOI]

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Conversational Semantic Parsing.

[BibT_eX]

[DOI]

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Asking and Answering Questions to Evaluate the Factual Consistency of Summaries.

[BibT_eX]

[DOI]

Alex Wang

Kyunghyun Cho

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension.

[BibT_eX]

[DOI]

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019

Enforcing Encoder-Decoder Modularity in Sequence-to-Sequence Models.

[BibT_eX]

[DOI]

CoRR, 2019

RoBERTa: A Robustly Optimized BERT Pretraining Approach.

[BibT_eX]

[DOI]

CoRR, 2019

MelNet: A Generative Model for Audio in the Frequency Domain.

[BibT_eX]

[DOI]

Sean Vasquez

CoRR, 2019

Improving Semantic Parsing for Task Oriented Dialog.

[BibT_eX]

[DOI]

CoRR, 2019

Hierarchical Decision Making by Generating and Following Natural Language Instructions.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Cross-lingual Transfer Learning for Multilingual Task Oriented Dialog.

[BibT_eX]

[DOI]

Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Generative Question Answering: Learning to Answer the Whole Question.

[BibT_eX]

[DOI]

Angela Fan

Proceedings of the 7th International Conference on Learning Representations, 2019

Span-based Hierarchical Semantic Parsing for Task-Oriented Dialog.

[BibT_eX]

[DOI]

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Community Regularization of Visually-Grounded Dialog.

[BibT_eX]

[DOI]

Akshat Agarwal

Swaminathan Gurumurthy

Vasu Sharma

Katia P. Sycara

Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

Strategies for Structuring Story Generation.

[BibT_eX]

[DOI]

Angela Fan

Yann N. Dauphin

Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018

Evaluating Visual Reasoning through Grounded Language Understanding.

[BibT_eX]

[DOI]

AI Mag., 2018

Common ground control system (CGCS) to support autonomous object observation, collection, and response in multi-domain environments.

[BibT_eX]

[DOI]

Paul C. Hershey

Mike Sica

Proceedings of the 2018 Annual IEEE International Systems Conference, 2018

Hierarchical Text Generation and Planning for Strategic Dialogue.

[BibT_eX]

[DOI]

Denis Yarats

Proceedings of the 35th International Conference on Machine Learning, 2018

Semantic Parsing for Task Oriented Dialog using Hierarchical Representations.

[BibT_eX]

[DOI]

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Neural Compositional Denotational Semantics for Question Answering.

[BibT_eX]

[DOI]

Nitish Gupta

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

A Dataset for Telling the Stories of Social Media Videos.

[BibT_eX]

[DOI]

Spandana Gella

Marcus Rohrbach

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Hierarchical Neural Story Generation.

[BibT_eX]

[DOI]

Angela Fan

Yann N. Dauphin

Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017

Deal or No Deal? End-to-End Learning of Negotiation Dialogues.

[BibT_eX]

[DOI]

Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

End-to-end Neural Coreference Resolution.

[BibT_eX]

[DOI]

Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

A Corpus of Natural Language for Visual Reasoning.

[BibT_eX]

[DOI]

Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Deep Semantic Role Labeling: What Works and What's Next.

[BibT_eX]

[DOI]

Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016

LSTM CCG Parsing.

[BibT_eX]

[DOI]

Kenton Lee

Proceedings of the NAACL HLT 2016, 2016

Global Neural CCG Parsing with Optimality Guarantees.

[BibT_eX]

[DOI]

Kenton Lee

Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Human-in-the-Loop Parsing.

[BibT_eX]

[DOI]

Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

2015

Joint A* CCG Parsing and Semantic Role Labelling.

[BibT_eX]

[DOI]

Luheng He

Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Question-Answer Driven Semantic Role Labeling: Using Natural Language to Annotate Natural Language.

[BibT_eX]

[DOI]

Luheng He

Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

2014

Improved CCG Parsing with Semi-supervised Supertagging.

[BibT_eX]

[DOI]

Trans. Assoc. Comput. Linguistics, 2014

Extracting common sense knowledge from text for robot planning.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE International Conference on Robotics and Automation, 2014

A* CCG Parsing with a Supertag-factored Model.

[BibT_eX]

[DOI]

Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

2013

Combined Distributional and Logical Semantics.

[BibT_eX]

[DOI]

Trans. Assoc. Comput. Linguistics, 2013

Migrating To The Cloud: Lessons And Limitations Of 'Traditional' IS Success Models.

[BibT_eX]

[DOI]

Imran Khan Azeemi

Theo Tryfonas

Proceedings of the Conference on Systems Engineering Research, 2013

Grounded spatial symbols for task planning based on experience.

[BibT_eX]

[DOI]

Proceedings of the 13th IEEE-RAS International Conference on Humanoid Robots, 2013

Unsupervised Induction of Cross-Lingual Semantic Relations.

[BibT_eX]

[DOI]