Enhanced Detection of Conversational Mental Manipulation Through Advanced Prompting Techniques.
CoRR, 2024
Addressing Healthcare-related Racial and LGBTQ+ Biases in Pretrained Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024
Proto-lm: A Prototypical Network-Based Framework for Built-in Interpretability in Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023