Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Data Advisor: Dynamic Data Curation for Safety Alignment of Large Language Models.

[BibT_eX]

[DOI]

Fei Wang

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Attribute Controlled Fine-tuning for Large Language Models: A Case Study on Detoxification.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

FLIRT: Feedback Loop In-context Red Teaming.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Prompt Perturbation Consistency Learning for Robust Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024

Tree-of-Traversals: A Zero-Shot Reasoning Algorithm for Augmenting Black-box Language Models with Knowledge Graphs.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023

Are you talking to ['xem'] or ['x', 'em']? On Tokenization and Addressing Misgendering in LLMs with Pronoun Tokenization Parity.

[BibT_eX]

[DOI]

CoRR, 2023

JAB: Joint Adversarial Prompting and Belief Augmentation.

[BibT_eX]

[DOI]

CoRR, 2023

Resolving Ambiguities in Text-to-Image Generative Models.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022

Where Does Bias in Common Sense Knowledge Models Come From?

[BibT_eX]

[DOI]

IEEE Internet Comput., 2022

A Survey on Bias and Fairness in Machine Learning.

[BibT_eX]

[DOI]

ACM Comput. Surv., 2022

Is the Elephant Flying? Resolving Ambiguities in Text-to-Image Generative Models.

[BibT_eX]

[DOI]

CoRR, 2022

Towards Multi-Objective Statistically Fair Federated Learning.

[BibT_eX]

[DOI]

CoRR, 2022

Robust Conversational Agents against Imperceptible Toxicity Triggers.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

2021

Attributing Fair Decisions with Attention Interventions.

[BibT_eX]

[DOI]

CoRR, 2021

Lawyers are Dishonest? Quantifying Representational Harms in Commonsense Knowledge Resources.

[BibT_eX]

[DOI]

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Exacerbating Algorithmic Bias through Fairness Attacks.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

The Leaky Pipeline in Physics Publishing.

[BibT_eX]

[DOI]

CoRR, 2020

Statistical Equity: A Fairness Classification Objective.

[BibT_eX]

[DOI]

Ninareh Mehrabi

Yuzhong Huang

Fred Morstatter

CoRR, 2020

Man is to Person as Woman is to Location: Measuring Gender Bias in Named Entity Recognition.

[BibT_eX]

[DOI]

Proceedings of the HT '20: 31st ACM Conference on Hypertext and Social Media, 2020

2019

Debiasing community detection: the importance of lowly connected nodes.

[BibT_eX]

[DOI]

Proceedings of the ASONAM '19: International Conference on Advances in Social Networks Analysis and Mining, 2019

2018

DynamicGEM: A Library for Dynamic Graph Embedding Methods.

[BibT_eX]

[DOI]

CoRR, 2018

Ninareh Mehrabi

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...