2025

Absolute Zero: Reinforced Self-play Reasoning with Zero Data.

[DOI]

CoRR, May, 2025

DiveR-CT: Diversity-enhanced Red Teaming Large Language Model Assistants with Relaxing Constraints.

[DOI]

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024

DiveR-CT: Diversity-enhanced Red Teaming with Relaxing Constraints.

[DOI]

CoRR, 2024

ExpeL: LLM Agents Are Experiential Learners.

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024