Kavel Rao

According to our database1, Kavel Rao authored at least 5 papers between 2023 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

2023
2024
0
1
2
3
4
5
1
3
1

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
To Err is AI : A Case Study Informing LLM Flaw Reporting Practices.
CoRR, 2024

WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

WildGuard: Open One-stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Value Kaleidoscope: Engaging AI with Pluralistic Human Values, Rights, and Duties.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
What Makes it Ok to Set a Fire? Iterative Self-distillation of Contexts and Rationales for Disambiguating Defeasible Social and Moral Situations.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023


  Loading...