We stand with Ukraine

We stand with Ukraine

Eric Wallace

According to our database¹, Eric Wallace authored at least 53 papers between 2018 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

Trading Inference-Time Compute for Adversarial Robustness.

[BibT_eX]

[DOI]

Wojciech Zaremba

,

Evgenia Nitishinskaya

,

,

,

,

,

,

,

,

Johannes Heidecke

,

CoRR, January, 2025

2024

OpenAI o1 System Card.

[BibT_eX]

[DOI]

,

,

,

Adam Richardson

,

Ahmed El-Kishky

,

,

,

Aleksander Madry

,

,

,

,

,

Alex Tachard Passos

,

Alexander Neitz

,

Alexander Prokofiev

,

,

,

,

,

,

,

Andrew Duberstein

,

Andrew Kondrich

,

Andrey Mishchenko

,

,

,

,

,

Behrooz Ghorbani

,

,

Benjamin Sokolowsky

,

,

,

,

,

,

Brandon Houghton

,

Brandon McKinzie

,

,

Camillo Lugaresi

,

,

,

,

Charles de Bourcy

,

,

,

,

,

,

Christopher Hesse

,

Claudia Fischer

,

,

,

,

,

,

,

,

,

,

Dimitris Tsipras

,

,

,

,

,

,

Elizabeth Proehl

,

,

,

,

,

,

,

Felipe Petroski Such

,

,

Florencia Leoni

,

Foivos Tsimpourlas

,

,

Fred von Lohmann

,

,

,

Giambattista Parascandolo

,

,

,

,

Guillaume Leclerc

,

,

,

,

,

Hessam Bagherinezhad

,

,

Hunter Lightman

,

Hyung Won Chung

,

,

,

,

Ignasi Clavera Gilaberte

,

CoRR, 2024

Deliberative Alignment: Reasoning Enables Safer Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

Hyung Won Chung

,

,

Johannes Heidecke

,

,

CoRR, 2024

Predicting Emergent Capabilities by Finetuning.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2024

The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions.

[BibT_eX]

[DOI]

,

,

,

,

Johannes Heidecke

,

CoRR, 2024

Unfamiliar Finetuning Examples Control How Language Models Hallucinate.

[BibT_eX]

[DOI]

,

,

Claire J. Tomlin

,

,

CoRR, 2024

Privacy Side Channels in Machine Learning Systems.

[BibT_eX]

[DOI]

Edoardo Debenedetti

,

,

,

Christopher A. Choquette-Choo

,

Matthew Jagielski

,

,

Nicholas Carlini

,

Florian Tramèr

Proceedings of the 33rd USENIX Security Symposium, 2024

Covert Malicious Finetuning: Challenges in Safeguarding LLM Adaptation.

[BibT_eX]

[DOI]

,

,

,

,

,

Jacob Steinhardt

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Stealing part of a production language model.

[BibT_eX]

[DOI]

Nicholas Carlini

,

,

Krishnamurthy Dj Dvijotham

,

,

Jonathan Hayase

,

A. Feder Cooper

,

,

Matthew Jagielski

,

,

,

,

,

Florian Tramèr

Proceedings of the Forty-first International Conference on Machine Learning, 2024

SILO Language Models: Isolating Legal Risk In a Nonparametric Datastore.

[BibT_eX]

[DOI]

,

Suchin Gururangan

,

,

,

Hannaneh Hajishirzi

,

,

Luke Zettlemoyer

Proceedings of the Twelfth International Conference on Learning Representations, 2024

The False Promise of Imitating Proprietary Language Models.

[BibT_eX]

[DOI]

Arnav Gudibande

,

,

,

,

,

,

,

Proceedings of the Twelfth International Conference on Learning Representations, 2024

What Evidence Do Language Models Find Convincing?

[BibT_eX]

[DOI]

,

,

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023

Scalable Extraction of Training Data from (Production) Language Models.

[BibT_eX]

[DOI]

,

Nicholas Carlini

,

Jonathan Hayase

,

Matthew Jagielski

,

A. Feder Cooper

,

Daphne Ippolito

,

Christopher A. Choquette-Choo

,

,

Florian Tramèr

,

CoRR, 2023

SILO Language Models: Isolating Legal Risk In a Nonparametric Datastore.

[BibT_eX]

[DOI]

,

Suchin Gururangan

,

,

Hannaneh Hajishirzi

,

,

Luke Zettlemoyer

CoRR, 2023

The False Promise of Imitating Proprietary LLMs.

[BibT_eX]

[DOI]

Arnav Gudibande

,

,

,

,

,

,

,

CoRR, 2023

Extracting Training Data from Diffusion Models.

[BibT_eX]

[DOI]

Nicholas Carlini

,

,

,

Matthew Jagielski

,

,

Florian Tramèr

,

,

Daphne Ippolito

,

Proceedings of the 32nd USENIX Security Symposium, 2023

Poisoning Language Models During Instruction Tuning.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the International Conference on Machine Learning, 2023

Large Language Models Struggle to Learn Long-Tail Knowledge.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the International Conference on Machine Learning, 2023

Measuring Forgetting of Memorized Training Examples.

[BibT_eX]

[DOI]

Matthew Jagielski

,

,

Florian Tramèr

,

Daphne Ippolito

,

,

Nicholas Carlini

,

,

,

Abhradeep Guha Thakurta

,

Nicolas Papernot

,

Proceedings of the Eleventh International Conference on Learning Representations, 2023

InCoder: A Generative Model for Code Infilling and Synthesis.

[BibT_eX]

[DOI]

,

Armen Aghajanyan

,

,

,

,

,

,

,

Luke Zettlemoyer

,

Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022

Deduplicating Training Data Mitigates Privacy Risks in Language Models.

[BibT_eX]

[DOI]

,

,

Proceedings of the International Conference on Machine Learning, 2022

Analyzing Dynamic Adversarial Training Data in the Limit.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Automated Crossword Solving.

[BibT_eX]

[DOI]

,

Nicholas Tomlin

,

,

,

,

Matthew L. Ginsberg

,

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Cutting Down on Prompts and Parameters: Simple Few-Shot Learning with Language Models.

[BibT_eX]

[DOI]

Robert L. Logan IV

,

Ivana Balazevic

,

,

,

,

Sebastian Riedel

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021

Calibrate Before Use: Improving Few-Shot Performance of Language Models.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2021

Extracting Training Data from Large Language Models.

[BibT_eX]

[DOI]

Nicholas Carlini

,

Florian Tramèr

,

,

Matthew Jagielski

,

Ariel Herbert-Voss

,

,

,

,

,

Úlfar Erlingsson

,

,

Proceedings of the 30th USENIX Security Symposium, 2021

Detoxifying Language Models Risks Marginalizing Minority Voices.

[BibT_eX]

[DOI]

,

,

,

Suchin Gururangan

,

,

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Concealed Data Poisoning Attacks on NLP Models.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Calibrate Before Use: Improving Few-shot Performance of Language Models.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 38th International Conference on Machine Learning, 2021

2020

Customizing Triggers with Concealed Data Poisoning.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2020

Trustworthy AI Inference Systems: An Industry Research View.

[BibT_eX]

[DOI]

CoRR, 2020

Evaluating NLP Models via Contrast Sets.

[BibT_eX]

[DOI]

CoRR, 2020

Train Large, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Joseph E. Gonzalez

CoRR, 2020

Train Big, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 37th International Conference on Machine Learning, 2020

Gradient-based Analysis of NLP Models is Manipulable.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Imitation Attacks and Defenses for Black-box Machine Translation Systems.

[BibT_eX]

[DOI]

,

,

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Interpreting Predictions of NLP Models.

[BibT_eX]

[DOI]

,

,

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: Tutorial Abstracts, 2020

AutoPrompt: Eliciting Knowledge from Language Models with Automatically Generated Prompts.

[BibT_eX]

[DOI]

,

Yasaman Razeghi

,

Robert L. Logan IV

,

,

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Evaluating Models' Local Decision Boundaries via Contrast Sets.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Pretrained Transformers Improve Out-of-Distribution Robustness.

[BibT_eX]

[DOI]

,

,

,

,

Rishabh Krishnan

,

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019

Trick Me If You Can: Human-in-the-loop Generation of Adversarial Question Answering Examples.

[BibT_eX]

[DOI]

,

Pedro Rodriguez

,

,

,

Jordan L. Boyd-Graber

Trans. Assoc. Comput. Linguistics, 2019

Universal Adversarial Triggers for NLP.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2019

Understanding Impacts of High-Order Loss Approximations and Features in Deep Learning Interpretation.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 36th International Conference on Machine Learning, 2019

Do NLP Models Know Numbers? Probing Numeracy in Embeddings.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

AllenNLP Interpret: A Framework for Explaining Predictions of NLP Models.

[BibT_eX]

[DOI]

,

,

,

Sanjay Subramanian

,

,

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Universal Adversarial Triggers for Attacking and Analyzing NLP.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Compositional Questions Do Not Necessitate Multi-hop Reasoning.

[BibT_eX]

[DOI]

,

,

,

,

Hannaneh Hajishirzi

,

Luke Zettlemoyer

Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Misleading Failures of Partial-input Baselines.

[BibT_eX]

[DOI]

,

,

Jordan L. Boyd-Graber

Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018

Trick Me If You Can: Adversarial Writing of Trivia Challenge Questions.

[BibT_eX]

[DOI]

,

Pedro Rodriguez

,

,

Jordan L. Boyd-Graber

CoRR, 2018

Right Answer for the Wrong Reason: Discovery and Mitigation.

[BibT_eX]

[DOI]

,

,

,

Pedro Rodriguez

,

Alvin Grissom II

,

Jordan L. Boyd-Graber

CoRR, 2018

Interpreting Neural Networks with Nearest Neighbors.

[BibT_eX]

[DOI]

,

,

Jordan L. Boyd-Graber

Proceedings of the Workshop: Analyzing and Interpreting Neural Networks for NLP, 2018

Pathologies of Neural Models Make Interpretation Difficult.

[BibT_eX]

[DOI]

,

,

Alvin Grissom II

,

,

Pedro Rodriguez

,

Jordan L. Boyd-Graber

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Trick Me If You Can: Adversarial Writing of Trivia Challenge Questions.

[BibT_eX]

[DOI]

,

Jordan L. Boyd-Graber

Proceedings of ACL 2018, Melbourne, Australia, July 15-20, 2018, Student Research Workshop, 2018

Loading...