Niloofar Mireshghallah

CoRR, 2024

AI as Humanity's Salieri: Quantifying Linguistic Creativity of Language Models via Systematic Attribution of Machine Text against Web Text.

[BibT_eX]

[DOI]

Ximing Lu

Melanie Sclar

Skyler Hallinan

Khyathi Raghavi Chandu

Nouha Dziri

Yejin Choi

CoRR, 2024

HAICOSYSTEM: An Ecosystem for Sandboxing Safety Risks in Human-AI Interactions.

[BibT_eX]

[DOI]

Ronan Le Bras

Maarten Sap

CoRR, 2024

Trust No Bot: Discovering Personal Disclosures in Human-LLM Conversations in the Wild.

[BibT_eX]

[DOI]

CoRR, 2024

WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Breaking News: Case Studies of Generative AI's Use in Journalism.

[BibT_eX]

[DOI]

Natalie Grace Brigham

Chongjiu Gao

Tadayoshi Kohno

Franziska Roesner

CoRR, 2024

Alpaca against Vicuna: Using LLMs to Uncover Memorization of LLMs.

[BibT_eX]

[DOI]

Aly M. Kassem

Omar Mahmoud

CoRR, 2024

Do Membership Inference Attacks Work on Large Language Models?

[BibT_eX]

[DOI]

Michael Duan

Anshuman Suri

CoRR, 2024

A Roadmap to Pluralistic Alignment.

[BibT_eX]

[DOI]

Christopher Michael Rytting

CoRR, 2024

LatticeGen: Hiding Generated Text in a Lattice for Privacy-Aware Large Language Model Generation on Cloud.

[BibT_eX]

[DOI]

Binyi Chen

Hao Wang

Yulia Tsvetkov

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

Position: A Roadmap to Pluralistic Alignment.

[BibT_eX]

[DOI]

Christopher Michael Rytting

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Privacy-Preserving In-Context Learning with Differentially Private Few-Shot Generation.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Can LLMs Keep a Secret? Testing Privacy Implications of Language Models via Contextual Integrity Theory.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model Generation.

[BibT_eX]

[DOI]

Tong Chen

Akari Asai

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Smaller Language Models are Better Zero-shot Machine-Generated Text Detectors.

[BibT_eX]

[DOI]

Justus Mattern

Sicun Gao

Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

2023

Auditing and Mitigating Safety Risks in Large Language Models

[BibT_eX]

[DOI]

PhD thesis, 2023

Report of the 1st Workshop on Generative AI and Law.

[BibT_eX]

[DOI]

CoRR, 2023

Misusing Tools in Large Language Models With Visual Adversarial Examples.

[BibT_eX]

[DOI]

Earlence Fernandes

CoRR, 2023

LatticeGen: A Cooperative Framework which Hides Generated Text in a Lattice for Privacy-Aware Generation on Cloud.

[BibT_eX]

[DOI]

Binyi Chen

Hao Wang

Yulia Tsvetkov

CoRR, 2023

Are Chatbots Ready for Privacy-Sensitive Applications? An Investigation into Input Regurgitation and Prompt-Induced Sanitization.

[BibT_eX]

[DOI]

CoRR, 2023

Smaller Language Models are Better Black-box Machine-Generated Text Detectors.

[BibT_eX]

[DOI]

Justus Mattern

Sicun Gao

CoRR, 2023

Simple Temporal Adaptation to Changing Label Sets: Hashtag Prediction via Dense KNN.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

A Block Metropolis-Hastings Sampler for Controllable Energy-based Text Generation.

[BibT_eX]

[DOI]

Jarad Forristal

Greg Durrett

Proceedings of the 27th Conference on Computational Natural Language Learning, 2023

Privacy-Preserving Domain Adaptation of Semantic Parsers.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Membership Inference Attacks against Language Models via Neighbourhood Comparison.

[BibT_eX]

[DOI]

Justus Mattern

Zhijing Jin

Bernhard Schölkopf

Mrinmaya Sachan

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022

Non-Parametric Temporal Adaptation for Social Media Topic Classification.

[BibT_eX]

[DOI]

CoRR, 2022

Memorization in NLP Fine-tuning Methods.

[BibT_eX]

[DOI]

Archit Uniyal

Tianhao Wang

David Evans

CoRR, 2022

Mix and Match: Learning-free Controllable Text Generation using Energy Language Models.

[BibT_eX]

[DOI]

Kartik Goyal

CoRR, 2022

Differentially Private Model Compression.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

UserIdentifier: Implicit User Representations for Simple and Effective Personalized Sentiment Analysis.

[BibT_eX]

[DOI]

Vaishnavi Shrivastava

Milad Shokouhi

Robert Sim

Dimitrios Dimitriadis

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

What Does it Mean for a Language Model to Preserve Privacy?

[BibT_eX]

[DOI]

Hannah Brown

Katherine Lee

Florian Tramèr

Proceedings of the FAccT '22: 2022 ACM Conference on Fairness, Accountability, and Transparency, Seoul, Republic of Korea, June 21, 2022

An Empirical Analysis of Memorization in Fine-tuned Autoregressive Language Models.

[BibT_eX]

[DOI]

Archit Uniyal

Tianhao Wang

David E. Evans

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Quantifying Privacy Risks of Masked Language Models Using Membership Inference Attacks.

[BibT_eX]

[DOI]

Kartik Goyal

Archit Uniyal

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Mix and Match: Learning-free Controllable Text Generationusing Energy Language Models.

[BibT_eX]

[DOI]

Kartik Goyal

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021

Efficient Hyperparameter Optimization for Differentially Private Deep Learning.

[BibT_eX]

[DOI]

Aman Priyanshu

Rakshit Naidu

Mohammad Malekzadeh

CoRR, 2021

Benchmarking Differential Privacy and Federated Learning for BERT Models.

[BibT_eX]

[DOI]

CoRR, 2021

When Differential Privacy Meets Interpretability: A Case Study.

[BibT_eX]

[DOI]

CoRR, 2021

DP-SGD vs PATE: Which Has Less Disparate Impact on Model Accuracy?

[BibT_eX]

[DOI]

Patrik Joslin Kenfack

Andrew Trask

CoRR, 2021

Privacy Regularization: Joint Privacy-Utility Optimization in Language Models.

[BibT_eX]

[DOI]

Huseyin A. Inan

Marcello Hasegawa

Victor Rühle

Robert Sim

CoRR, 2021

Not All Features Are Equal: Discovering Essential Features for Preserving Prediction Privacy.

[BibT_eX]

[DOI]

Proceedings of the WWW '21: The Web Conference 2021, 2021

Privacy Regularization: Joint Privacy-Utility Optimization in LanguageModels.

[BibT_eX]

[DOI]

Huseyin A. Inan

Marcello Hasegawa

Victor Rühle

Robert Sim

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

U-Noise: Learnable Noise Masks for Interpretable Image Segmentation.

[BibT_eX]

[DOI]

Teddy Koker

Tom Titcombe

Georgios Kaissis

Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Style Pooling: Automatic Text Style Obfuscation for Improved Classification Fairness.

[BibT_eX]

[DOI]

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

2020

ReLeQ : A Reinforcement Learning Approach for Automatic Deep Quantization of Neural Networks.

[BibT_eX]

[DOI]

Ahmed T. Elthakeb

Prannoy Pilligundla

Amir Yazdanbakhsh

Hadi Esmaeilzadeh

IEEE Micro, 2020

Privacy in Deep Learning: A Survey.

[BibT_eX]

[DOI]

CoRR, 2020

A Principled Approach to Learning Stochastic Representations for Privacy in Deep Neural Inference.

[BibT_eX]

[DOI]

CoRR, 2020

Gradient-Based Deep Quantization of Neural Networks through Sinusoidal Adaptive Regularization.

[BibT_eX]

[DOI]

Ahmed T. Elthakeb

Prannoy Pilligundla

Tarek Elgindi

Charles-Alban Deledalle

Hadi Esmaeilzadeh

CoRR, 2020

Divide and Conquer: Leveraging Intermediate Feature Representations for Quantized Training of Neural Networks.

[BibT_eX]

[DOI]

Ahmed Taha Elthakeb

Prannoy Pilligundla

Fatemeh Mireshghallah

Alexander Cloninger

Hadi Esmaeilzadeh

Proceedings of the 37th International Conference on Machine Learning, 2020

Neither Private Nor Fair: Impact of Data Imbalance on Utility and Fairness in Differential Privacy.

[BibT_eX]

[DOI]

Tom Farrand

Sahib Singh

Andrew Trask

Proceedings of the PPMLP'20: Proceedings of the 2020 Workshop on Privacy-Preserving Machine Learning in Practice, 2020

Shredder: Learning Noise Distributions to Protect Inference Privacy.

[BibT_eX]

[DOI]

Proceedings of the ASPLOS '20: Architectural Support for Programming Languages and Operating Systems, 2020

2019

Energy-Efficient Permanent Fault Tolerance in Hard Real-Time Systems.

[BibT_eX]

[DOI]

Mohammad Bakhshalipour

Mohammad Sadrosadati

Hamid Sarbazi-Azad

IEEE Trans. Computers, 2019

Shredder: Learning Noise to Protect Privacy with Partial DNN Inference on the Edge.

[BibT_eX]

[DOI]