SemEval-2025 Task 4: Unlearning sensitive content from Large Language Models.
CoRR, April, 2025
LUME: LLM Unlearning with Multitask Evaluations.
CoRR, February, 2025
Fact-or-Fair: A Checklist for Behavioral Testing of AI Models on Fairness-Related Queries.
CoRR, February, 2025
MQuAKE-Remastered: Multi-Hop Knowledge Editing Can Only Be Advanced with Reliable Evaluations.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Unlocking the Potential of Text-to-Image Diffusion with PAC-Bayesian Theory.
CoRR, 2024
White Men Lead, Black Women Help: Uncovering Gender, Racial, and Intersectional Bias in Language Agency.
CoRR, 2024
Survey of Bias In Text-to-Image Generation: Definition, Evaluation, and Mitigation.
CoRR, 2024
The Male CEO and the Female Assistant: Probing Gender Biases in Text-To-Image Models Through Paired Stereotype Test.
CoRR, 2024
MACAROON: Training Vision-Language Models To Be Your Engaged Partners.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
The Factuality Tax of Diversity-Intervened Text-to-Image Generation: Benchmark and Fact-Augmented Intervention.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Sequence-Level Certainty Reduces Hallucination In Knowledge-Grounded Dialogue Generation.
CoRR, 2023
ABC-KD: Attention-Based-Compression Knowledge Distillation for Deep Learning-Based Noise Suppression.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Are Personalized Stochastic Parrots More Dangerous? Evaluating Persona Biases in Dialogue Systems.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
"Kelly is a Warm Person, Joseph is a Role Model": Gender Biases in LLM-Generated Reference Letters.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
TheoremQA: A Theorem-driven Question Answering Dataset.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Does BERT Exacerbate Gender or L1 Biases in Automated English Speaking Assessment?
Proceedings of the 18th Workshop on Innovative Use of NLP for Building Educational Applications, 2023
PIP: Parse-Instructed Prefix for Syntactically Controlled Paraphrase Generation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
Improving the Adversarial Robustness of NLP Models by Information Bottleneck.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022