Danqi Chen

Orcid: 0000-0001-5308-2634

Affiliations:
  • Princeton University, NJ, USA
  • Stanford University, USA (PhD 2018)


According to our database1, Danqi Chen authored at least 97 papers between 2009 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization.
CoRR, 2024

HELMET: How to Evaluate Long-Context Language Models Effectively and Thoroughly.
CoRR, 2024

How to Train Long-Context Language Models (Effectively).
CoRR, 2024

BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval.
CoRR, 2024

Representing Rule-based Chatbots with Transformers.
CoRR, 2024

CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs.
CoRR, 2024

Finding Transformer Circuits with Edge Pruning.
CoRR, 2024

SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors.
CoRR, 2024

Fantastic Copyrighted Beasts and How (Not) to Generate Them.
CoRR, 2024

AI Risk Management Should Incorporate Both Safety and Security.
CoRR, 2024

Certifiably Robust RAG against Retrieval Corruption.
CoRR, 2024

SimPO: Simple Preference Optimization with a Reference-Free Reward.
CoRR, 2024

Lory: Fully Differentiable Mixture-of-Experts for Autoregressive Language Model Pre-training.
CoRR, 2024

Foundational Challenges in Assuring Alignment and Safety of Large Language Models.
CoRR, 2024

Reliable, Adaptable, and Attributable Language Models with Retrieval.
CoRR, 2024

Improving Language Understanding from Screenshots.
CoRR, 2024

Language Models as Science Tutors.
CoRR, 2024

LESS: Selecting Influential Data for Targeted Instruction Tuning.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

QuRating: Selecting High-Quality Data for Training Language Models.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Interpretability Illusions in the Generalization of Simplified Models.
Proceedings of the Forty-first International Conference on Machine Learning, 2024


Evaluating Large Language Models at Evaluating Instruction Following.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Detecting Pretraining Data from Large Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Catastrophic Jailbreak of Open-source LLMs via Exploiting Generation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

LitSearch: A Retrieval Benchmark for Scientific Literature Search.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Long-Context Language Modeling with Parallel Context Encoding.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

The Heuristic Core: Understanding Subnetwork Generalization in Pretrained Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Trans. Mach. Learn. Res., 2023

CSTS: Conditional Semantic Textual Similarity.
CoRR, 2023

SIGIR 2023 Workshop on Retrieval Enhanced Machine Learning (REML @ SIGIR 2023).
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Fine-Tuning Language Models with Just Forward Passes.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Learning Transformer Programs.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

A Kernel-Based View of Language Model Fine-Tuning.
Proceedings of the International Conference on Machine Learning, 2023

MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Poisoning Retrieval Corpora by Injecting Adversarial Passages.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Privacy Implications of Retrieval-Based Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Enabling Large Language Models to Generate Text with Citations.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

C-STS: Conditional Semantic Textual Similarity.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Adapting Language Models to Compress Contexts.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Should You Mask 15% in Masked Language Modeling?
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Retrieval-based Language Models and Applications.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics: Tutorial Abstracts, 2023

Training Trajectories of Language Models Across Scales.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Optimizing Test-Time Query Representations for Dense Retrieval.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Measuring Inductive Biases of In-Context Learning with Underspecified Demonstrations.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

What In-Context Learning "Learns" In-Context: Disentangling Task Recognition and Task Learning.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Controllable Text Generation with Language Constraints.
CoRR, 2022

Refining Query Representations for Dense Retrieval at Test Time.
CoRR, 2022

Recovering Private Text in Federated Learning of Language Models.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Can Rationalization Improve Robustness?
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Training Language Models with Memory Augmentation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Generating Natural Language Proofs with Verifier-Guided Search.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Prompting ELECTRA: Few-Shot Learning with Discriminative Pre-Trained Models.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Don't Prompt, Search! Mining-based Zero-Shot Learning with Language Models.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

MABEL: Attenuating Gender Bias using Textual Entailment Data.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Finding Dataset Shortcuts with Grammar Induction.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Structured Pruning Learns Compact and Accurate Models.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Ditch the Gold Standard: Re-evaluating Conversational Question Answering.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Factual Probing Is [MASK]: Learning vs. Learning to Recall.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

A Frustratingly Easy Approach for Entity and Relation Extraction.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Non-Parametric Few-Shot Learning for Word Sense Disambiguation.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Simple Entity-Centric Questions Challenge Dense Retrievers.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

SimCSE: Simple Contrastive Learning of Sentence Embeddings.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Single-dataset Experts for Multi-dataset Question Answering.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Phrase Retrieval Learns Passage Retrieval, Too.
Proceedings of the 3rd Conference on Automated Knowledge Base Construction, 2021

Learning Dense Representations of Phrases at Scale.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Making Pre-trained Language Models Better Few-shot Learners.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
SpanBERT: Improving Pre-training by Representing and Predicting Spans.
Trans. Assoc. Comput. Linguistics, 2020

A Frustratingly Easy Approach for Joint Entity and Relation Extraction.
CoRR, 2020

Dense Passage Retrieval for Open-Domain Question Answering.
CoRR, 2020

All-round and Accurate Online Education Model amid COVID19.
Proceedings of the IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology, 2020


Dense Passage Retrieval for Open-Domain Question Answering.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

TextHide: Tackling Data Privacy for Language Understanding Tasks.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Open-Domain Question Answering.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: Tutorial Abstracts, 2020

2019
CoQA: A Conversational Question Answering Challenge.
Trans. Assoc. Comput. Linguistics, 2019

Knowledge Guided Text Retrieval and Reading for Open Domain Question Answering.
CoRR, 2019

RoBERTa: A Robustly Optimized BERT Pretraining Approach.
CoRR, 2019

A Discrete Hard EM Approach for Weakly Supervised Question Answering.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

MRQA 2019 Shared Task: Evaluating Generalization in Reading Comprehension.
Proceedings of the 2nd Workshop on Machine Reading for Question Answering, 2019

2018
Neural reading comprehension and beyond.
PhD thesis, 2018

2017
Position-aware Attention and Supervised Data Improve Slot Filling.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Reading Wikipedia to Answer Open-Domain Questions.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016
Stanford at TAC KBP 2016: Sealing Pipeline Leaks and Understanding Chinese.
Proceedings of the 2016 Text Analysis Conference, 2016

SecHDFS: A Secure Data Allocation Scheme for Heterogenous Hadoop Systems.
Proceedings of the IEEE International Conference on Networking, 2016

A secure data allocation solution for heterogeneous Hadoop systems: SecHDFS.
Proceedings of the 35th IEEE International Performance Computing and Communications Conference, 2016

A Thorough Examination of the CNN/Daily Mail Reading Comprehension Task.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

2015
Bootstrapped Self Training for Knowledge Base Population.
Proceedings of the 2015 Text Analysis Conference, 2015

Representing Text for Joint Embedding of Text and Knowledge Bases.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Observed versus latent features for knowledge base and text inference.
Proceedings of the 3rd Workshop on Continuous Vector Space Models and their Compositionality, 2015

2014
A Fast and Accurate Dependency Parser using Neural Networks.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

2013
Learning New Facts From Knowledge Bases With Neural Tensor Networks and Semantic Word Vectors
Proceedings of the 1st International Conference on Learning Representations, 2013

Reasoning With Neural Tensor Networks for Knowledge Base Completion.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

2012
Beyond ten blue links: enabling user click modeling in federated web search.
Proceedings of the Fifth International Conference on Web Search and Web Data Mining, 2012

2011
Characterizing Inverse Time Dependency in Multi-class Learning.
Proceedings of the 11th IEEE International Conference on Data Mining, 2011

PhenOMIM: An OMIM-based secondary database purported for phenotypic comparison.
Proceedings of the 33rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2011

2009
Audio-Visual Emotion Recognition Based on a DBN Model with Constrained Asynchrony.
Proceedings of the Fifth International Conference on Image and Graphics, 2009


  Loading...