2024

Llama Guard 3-1B-INT4: Compact and Efficient Safeguard for Human-AI Conversations.

[DOI]

CoRR, 2024

Llama Guard 3 Vision: Safeguarding Human-AI Image Understanding Conversations.

[DOI]

,

,

,

,

,

,

,

Zacharie Delpierre Coudert

,

Kartikeya Upasani

,

Mahesh Pasupuleti

CoRR, 2024

Towards Understanding the Fragility of Multilingual LLMs against Fine-Tuning Attacks.

[DOI]

,

,

,

,

,

,

CoRR, 2024

Persistent Pre-Training Poisoning of LLMs.

[DOI]

,

,

,

,

Eric Michael Smith

,

Nicholas Carlini

,

Florian Tramèr

,

Daphne Ippolito

CoRR, 2024

Backtracking Improves Generation Safety.

[DOI]

,

,

,

Kartikeya Upasani

,

Daniel M. Bikel

,

,

Eric Michael Smith

CoRR, 2024

BadMerging: Backdoor Attacks Against Model Merging.

[DOI]

,

,

,

,

,

CoRR, 2024

The Llama 3 Herd of Models.

[DOI]

Abhimanyu Dubey

,

,

,

Abhishek Kadian

,

,

,

,

,

,

,

,

Anthony Hartshorn

,

,

,

Archie Sravankumar

,

,

Arthur Hinsvark

,

,

,

Aurélien Rodriguez

,

Austen Gregerson

,

,

Baptiste Rozière

,

,

,

,

Charlotte Caucheteux

,

,

,

,

Chris McConnell

,

Christian Keller

,

Christophe Touret

,

,

,

Cristian Canton Ferrer

,

Cyrus Nikolaidis

,

Damien Allonsius

,

,

,

,

,

Dhruv Choudhary

,

,

Diego Garcia-Olano

,

,

,

,

,

,

,

Eric Michael Smith

,

Filip Radenovic

,

,

Gabriel Synnaeve

,

,

Georgia Lewis Anderson

,

,

Grégoire Mialon

,

,

Guillem Cucurell

,

,

Hannah Korevaar

,

,

,

,

Imanol Arrieta Ibarra

,

Isabel M. Kloumann

,

,

,

,

,

,

,

,

,

,

Jelmer van der Linde

,

Jennifer Billock

,

,

,

,

,

,

,

,

,

,

,

,

,

Joshua Johnstun

,

,

,

Kalyan Vasuden Alwala

,

Kartikeya Upasani

,

,

,

Kenneth Heafield

,

,

et al.

CoRR, 2024

Where Have You Been? A Study of Privacy Risk for Point-of-Interest Recommendation.

[DOI]

,

,

,

,

,

,

,

Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

FFB: A Fair Fairness Benchmark for In-Processing Group Fairness Methods.

[DOI]

,

,

,

,

,

,

Proceedings of the Twelfth International Conference on Learning Representations, 2024

EAVE: Efficient Product Attribute Value Extraction via Lightweight Sparse-layer Interaction.

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

BadMerging: Backdoor Attacks Against Model Merging.

[DOI]

,

,

,

,

,

Proceedings of the 2024 on ACM SIGSAC Conference on Computer and Communications Security, 2024

2023

Llama Guard: LLM-based Input-Output Safeguard for Human-AI Conversations.

[DOI]

,

Kartikeya Upasani

,

,

,

,

,

Michael Tontchev

,

,

,

Davide Testuggine

,

CoRR, 2023

Retrieval Enhanced Data Augmentation for Question Answering on Privacy Policies.

[DOI]

Md. Rizwan Parvez

,

,

Wasi Uddin Ahmad

,

,

Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

PLUE: Language Understanding Evaluation Benchmark for Privacy Policies in English.

[DOI]

,

Wasi Uddin Ahmad

,

,

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

2022

AUC-oriented Graph Neural Network for Fraud Detection.

[DOI]

,

,

,

,

,

,

,

Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

Reliable Representations Make A Stronger Defender: Unsupervised Structure Refinement for Robust GNN.

[DOI]

,

,

,

,

,

,

Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Conditional Supervised Contrastive Learning for Fair Text Classification.

[DOI]

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

ADAPT: Adversarial Domain Adaptation with Purifier Training for Cross-Domain Credit Risk Forecasting.

[DOI]

,

,

,

,

,

Proceedings of the Database Systems for Advanced Applications, 2022

Bi-Level Selection via Meta Gradient for Graph-Based Fraud Detection.

[DOI]

,

,

,

,

,

,

Proceedings of the Database Systems for Advanced Applications, 2022

Towards Return Parity in Markov Decision Processes.

[DOI]

,

,

,

,

,

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

2021

Pick and Choose: A GNN-based Imbalanced Learning Approach for Fraud Detection.

[DOI]

,

,

,

,

,

,

Proceedings of the WWW '21: The Web Conference 2021, 2021

Credit Risk and Limits Forecasting in E-Commerce Consumer Lending Service via Multi-view-aware Mixture-of-experts Nets.

[DOI]

,

,

,

,

,

,

Proceedings of the WSDM '21, 2021

Understanding and Mitigating Accuracy Disparity in Regression.

[DOI]

,

,

Geoffrey J. Gordon

,

Proceedings of the 38th International Conference on Machine Learning, 2021

Intent Classification and Slot Filling for Privacy Policies.

[DOI]

Wasi Uddin Ahmad

,

,

,

,

,

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020

Hybrid Batch Attacks: Finding Black-box Adversarial Examples with Limited Queries.

[DOI]

,

,

,

Proceedings of the 29th USENIX Security Symposium, 2020

Trade-offs and Guarantees of Adversarial Representation Learning for Information Obfuscation.

[DOI]

,

,

,

Geoffrey J. Gordon

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Learning to Undersampling for Class Imbalanced Credit Risk Forecasting.

[DOI]

,

,

,

,

,

,

Proceedings of the 20th IEEE International Conference on Data Mining, 2020

PolicyQA: A Reading Comprehension Dataset for Privacy Policies.

[DOI]

Wasi Uddin Ahmad

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

2019

Adversarial Task-Specific Privacy Preservation under Attribute Attack.

[DOI]

,

,

,

Geoffrey J. Gordon

CoRR, 2019

Side Channel Attacks in Computation Offloading Systems with GPU Virtualization.

[DOI]

,

,

,

Faysal Hossain Shezan

,

Proceedings of the 2019 IEEE Security and Privacy Workshops, 2019

Inquiry Spam Detection via Jointly Exploiting Temporal-Categorical Behavior and Linguistics.

[DOI]

,

,

,

Proceedings of the 2019 IEEE International Conference on Data Mining, 2019

2018

Privacy Partitioning: Protecting User Data During the Deep Learning Inference Phase.

[DOI]

,

,

,

,

,

,

CoRR, 2018

Privacy Partition: A Privacy-Preserving Framework for Deep Neural Networks in Edge Networks.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the 2018 IEEE/ACM Symposium on Edge Computing, 2018