Mike Lewis

According to our database1, Mike Lewis authored at least 94 papers between 2007 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Law of the Weakest Link: Cross Capabilities of Large Language Models.
CoRR, 2024

MoMa: Efficient Early-Fusion Pre-training with Mixture of Modality-Aware Experts.
CoRR, 2024

Lory: Fully Differentiable Mixture-of-Experts for Autoregressive Language Model Pre-training.
CoRR, 2024

Effective Long-Context Scaling of Foundation Models.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

REPLUG: Retrieval-Augmented Black-Box Language Models.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Trusting Your Evidence: Hallucinate Less with Context-aware Decoding.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Short Papers, 2024

Efficient Streaming Language Models with Attention Sinks.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

In-Context Pretraining: Language Modeling Beyond Document Boundaries.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

RA-DIT: Retrieval-Augmented Dual Instruction Tuning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Self-Alignment with Instruction Backtranslation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
LegoNN: Building Modular Encoder-Decoder Models.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Questions Are All You Need to Train a Dense Passage Retriever.
Trans. Assoc. Comput. Linguistics, 2023

Contrastive Decoding Improves Reasoning in Large Language Models.
CoRR, 2023

Scaling Expert Language Models with Unsupervised Domain Discovery.
CoRR, 2023

LIMA: Less Is More for Alignment.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

MEGABYTE: Predicting Million-byte Sequences with Multiscale Transformers.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Coder Reviewer Reranking for Code Generation.
Proceedings of the International Conference on Machine Learning, 2023

Retrieval-Augmented Multimodal Language Modeling.
Proceedings of the International Conference on Machine Learning, 2023

Progressive Prompts: Continual Learning for Language Models.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

InCoder: A Generative Model for Code Infilling and Synthesis.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

AutoReply: Detecting Nonsense in Dialogue with Discriminative Replies.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Measuring and Narrowing the Compositionality Gap in Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Residual Prompt Tuning: improving prompt tuning with residual reparameterization.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Nonparametric Masked Language Modeling.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Contrastive Decoding: Open-ended Text Generation as Optimization.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

In-context Examples Selection for Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Improving Chess Commentaries by Combining Language Models with Symbolic Reasoning Engines.
CoRR, 2022

AutoReply: Detecting Nonsense in Dialogue Introspectively with Discriminative Replies.
CoRR, 2022

LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale.
CoRR, 2022

Branch-Train-Merge: Embarrassingly Parallel Training of Expert Language Models.
CoRR, 2022

Few-shot Mining of Naturally Occurring Inputs and Outputs.
CoRR, 2022

CM3: A Causal Masked Multimodal Model of the Internet.
CoRR, 2022

GPT3.int8(): 8-bit Matrix Multiplication for Transformers at Scale.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Sparse Distillation: Speeding Up Text Classification by Using Bigger Student Models.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

MetaICL: Learning to Learn In Context.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

DEMix Layers: Disentangling Domains for Modular Language Modeling.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Tricks for Training Sparse Translation Models.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Train Short, Test Long: Attention with Linear Biases Enables Input Length Extrapolation.
Proceedings of the Tenth International Conference on Learning Representations, 2022

8-bit Optimizers via Block-wise Quantization.
Proceedings of the Tenth International Conference on Learning Representations, 2022

HTLM: Hyper-Text Pre-Training and Prompting of Language Models.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Improving Passage Retrieval with Zero-Shot Question Generation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Rethinking the Role of Demonstrations: What Makes In-Context Learning Work?
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Noisy Channel Language Model Prompting for Few-Shot Text Classification.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Question Answering Infused Pre-training of General-Purpose Contextualized Representations.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021
Sparse Distillation: Speeding Up Text Classification by Using Bigger Models.
CoRR, 2021

Multitasking Inhibits Semantic Drift.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

BASE Layers: Simplifying Training of Large, Sparse Models.
Proceedings of the 38th International Conference on Machine Learning, 2021

Nearest Neighbor Machine Translation.
Proceedings of the 9th International Conference on Learning Representations, 2021

Joint Verification and Reranking for Open Fact Checking Over Tables.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Shortformer: Better Language Modeling using Shorter Inputs.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Multilingual Denoising Pre-training for Neural Machine Translation.
Trans. Assoc. Comput. Linguistics, 2020

Conversational Semantic Parsing.
CoRR, 2020

Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Pre-training via Paraphrasing.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Generalization through Memorization: Nearest Neighbor Language Models.
Proceedings of the 8th International Conference on Learning Representations, 2020

Grounded Adaptation for Zero-shot Executable Semantic Parsing.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Conversational Semantic Parsing.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Asking and Answering Questions to Evaluate the Factual Consistency of Summaries.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Enforcing Encoder-Decoder Modularity in Sequence-to-Sequence Models.
CoRR, 2019

RoBERTa: A Robustly Optimized BERT Pretraining Approach.
CoRR, 2019

MelNet: A Generative Model for Audio in the Frequency Domain.
CoRR, 2019

Improving Semantic Parsing for Task Oriented Dialog.
CoRR, 2019

Hierarchical Decision Making by Generating and Following Natural Language Instructions.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Cross-lingual Transfer Learning for Multilingual Task Oriented Dialog.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Generative Question Answering: Learning to Answer the Whole Question.
Proceedings of the 7th International Conference on Learning Representations, 2019

Span-based Hierarchical Semantic Parsing for Task-Oriented Dialog.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Community Regularization of Visually-Grounded Dialog.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

Strategies for Structuring Story Generation.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Evaluating Visual Reasoning through Grounded Language Understanding.
AI Mag., 2018

Common ground control system (CGCS) to support autonomous object observation, collection, and response in multi-domain environments.
Proceedings of the 2018 Annual IEEE International Systems Conference, 2018

Hierarchical Text Generation and Planning for Strategic Dialogue.
Proceedings of the 35th International Conference on Machine Learning, 2018

Semantic Parsing for Task Oriented Dialog using Hierarchical Representations.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Neural Compositional Denotational Semantics for Question Answering.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

A Dataset for Telling the Stories of Social Media Videos.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Hierarchical Neural Story Generation.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017
Deal or No Deal? End-to-End Learning of Negotiation Dialogues.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

End-to-end Neural Coreference Resolution.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

A Corpus of Natural Language for Visual Reasoning.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Deep Semantic Role Labeling: What Works and What's Next.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016
LSTM CCG Parsing.
Proceedings of the NAACL HLT 2016, 2016

Global Neural CCG Parsing with Optimality Guarantees.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Human-in-the-Loop Parsing.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

2015
Joint A* CCG Parsing and Semantic Role Labelling.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Question-Answer Driven Semantic Role Labeling: Using Natural Language to Annotate Natural Language.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

2014
Improved CCG Parsing with Semi-supervised Supertagging.
Trans. Assoc. Comput. Linguistics, 2014

Extracting common sense knowledge from text for robot planning.
Proceedings of the 2014 IEEE International Conference on Robotics and Automation, 2014

A* CCG Parsing with a Supertag-factored Model.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

2013
Combined Distributional and Logical Semantics.
Trans. Assoc. Comput. Linguistics, 2013

Migrating To The Cloud: Lessons And Limitations Of 'Traditional' IS Success Models.
Proceedings of the Conference on Systems Engineering Research, 2013

Grounded spatial symbols for task planning based on experience.
Proceedings of the 13th IEEE-RAS International Conference on Humanoid Robots, 2013

Unsupervised Induction of Cross-Lingual Semantic Relations.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

2007
HMDB: the Human Metabolome Database.
Nucleic Acids Res., 2007


  Loading...