Llama Guard 3-1B-INT4: Compact and Efficient Safeguard for Human-AI Conversations.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
Llama Guard 3 Vision: Safeguarding Human-AI Image Understanding Conversations.
CoRR, 2024
Towards Understanding the Fragility of Multilingual LLMs against Fine-Tuning Attacks.
CoRR, 2024
Persistent Pre-Training Poisoning of LLMs.
CoRR, 2024
Backtracking Improves Generation Safety.
CoRR, 2024
BadMerging: Backdoor Attacks Against Model Merging.
CoRR, 2024
Where Have You Been? A Study of Privacy Risk for Point-of-Interest Recommendation.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024
FFB: A Fair Fairness Benchmark for In-Processing Group Fairness Methods.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
EAVE: Efficient Product Attribute Value Extraction via Lightweight Sparse-layer Interaction.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
BadMerging: Backdoor Attacks Against Model Merging.
Proceedings of the 2024 on ACM SIGSAC Conference on Computer and Communications Security, 2024
Llama Guard: LLM-based Input-Output Safeguard for Human-AI Conversations.
,
,
,
,
,
,
,
,
,
,
CoRR, 2023
Retrieval Enhanced Data Augmentation for Question Answering on Privacy Policies.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023
PLUE: Language Understanding Evaluation Benchmark for Privacy Policies in English.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023
AUC-oriented Graph Neural Network for Fraud Detection.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022
Reliable Representations Make A Stronger Defender: Unsupervised Structure Refinement for Robust GNN.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022
Conditional Supervised Contrastive Learning for Fair Text Classification.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022
ADAPT: Adversarial Domain Adaptation with Purifier Training for Cross-Domain Credit Risk Forecasting.
Proceedings of the Database Systems for Advanced Applications, 2022
Bi-Level Selection via Meta Gradient for Graph-Based Fraud Detection.
Proceedings of the Database Systems for Advanced Applications, 2022
Towards Return Parity in Markov Decision Processes.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022
Pick and Choose: A GNN-based Imbalanced Learning Approach for Fraud Detection.
Proceedings of the WWW '21: The Web Conference 2021, 2021
Credit Risk and Limits Forecasting in E-Commerce Consumer Lending Service via Multi-view-aware Mixture-of-experts Nets.
Proceedings of the WSDM '21, 2021
Understanding and Mitigating Accuracy Disparity in Regression.
Proceedings of the 38th International Conference on Machine Learning, 2021
Intent Classification and Slot Filling for Privacy Policies.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
Hybrid Batch Attacks: Finding Black-box Adversarial Examples with Limited Queries.
Proceedings of the 29th USENIX Security Symposium, 2020
Trade-offs and Guarantees of Adversarial Representation Learning for Information Obfuscation.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Learning to Undersampling for Class Imbalanced Credit Risk Forecasting.
Proceedings of the 20th IEEE International Conference on Data Mining, 2020
PolicyQA: A Reading Comprehension Dataset for Privacy Policies.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020
Adversarial Task-Specific Privacy Preservation under Attribute Attack.
CoRR, 2019
Side Channel Attacks in Computation Offloading Systems with GPU Virtualization.
Proceedings of the 2019 IEEE Security and Privacy Workshops, 2019
Inquiry Spam Detection via Jointly Exploiting Temporal-Categorical Behavior and Linguistics.
Proceedings of the 2019 IEEE International Conference on Data Mining, 2019
Privacy Partitioning: Protecting User Data During the Deep Learning Inference Phase.
CoRR, 2018
Privacy Partition: A Privacy-Preserving Framework for Deep Neural Networks in Edge Networks.
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 2018 IEEE/ACM Symposium on Edge Computing, 2018