Ninareh Mehrabi

According to our database1, Ninareh Mehrabi authored at least 24 papers between 2018 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
MICo: Preventative Detoxification of Large Language Models through Inhibition Control.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

Tokenization Matters: Navigating Data-Scarce Tokenization for Gender Inclusive Language Technologies.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

The steerability of large language models toward data-driven personas.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Data Advisor: Dynamic Data Curation for Safety Alignment of Large Language Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Attribute Controlled Fine-tuning for Large Language Models: A Case Study on Detoxification.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

FLIRT: Feedback Loop In-context Red Teaming.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Prompt Perturbation Consistency Learning for Robust Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024

Tree-of-Traversals: A Zero-Shot Reasoning Algorithm for Augmenting Black-box Language Models with Knowledge Graphs.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Are you talking to ['xem'] or ['x', 'em']? On Tokenization and Addressing Misgendering in LLMs with Pronoun Tokenization Parity.
CoRR, 2023

JAB: Joint Adversarial Prompting and Belief Augmentation.
CoRR, 2023

Resolving Ambiguities in Text-to-Image Generative Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Where Does Bias in Common Sense Knowledge Models Come From?
IEEE Internet Comput., 2022

A Survey on Bias and Fairness in Machine Learning.
ACM Comput. Surv., 2022

Is the Elephant Flying? Resolving Ambiguities in Text-to-Image Generative Models.
CoRR, 2022

Towards Multi-Objective Statistically Fair Federated Learning.
CoRR, 2022

Robust Conversational Agents against Imperceptible Toxicity Triggers.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

2021
Attributing Fair Decisions with Attention Interventions.
CoRR, 2021

Lawyers are Dishonest? Quantifying Representational Harms in Commonsense Knowledge Resources.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Exacerbating Algorithmic Bias through Fairness Attacks.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
The Leaky Pipeline in Physics Publishing.
CoRR, 2020

Statistical Equity: A Fairness Classification Objective.
CoRR, 2020

Man is to Person as Woman is to Location: Measuring Gender Bias in Named Entity Recognition.
Proceedings of the HT '20: 31st ACM Conference on Hypertext and Social Media, 2020

2019
Debiasing community detection: the importance of lowly connected nodes.
Proceedings of the ASONAM '19: International Conference on Advances in Social Networks Analysis and Mining, 2019

2018
DynamicGEM: A Library for Dynamic Graph Embedding Methods.
CoRR, 2018


  Loading...