Kavel Rao

According to our database¹, Kavel Rao authored at least 6 papers between 2023 and 2025.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

ColorGrid: A Multi-Agent Non-Stationary Environment for Goal Inference and Assistance.

[BibT_eX]

[DOI]

CoRR, January, 2025

2024

To Err is AI : A Case Study Informing LLM Flaw Reporting Practices.

[BibT_eX]

[DOI]

CoRR, 2024

WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models.

[BibT_eX]

[DOI]

Niloofar Mireshghallah

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

WildGuard: Open One-stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Value Kaleidoscope: Engaging AI with Pluralistic Human Values, Rights, and Duties.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

What Makes it Ok to Set a Fire? Iterative Self-distillation of Contexts and Rationales for Disambiguating Defeasible Social and Moral Situations.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Kavel Rao

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...