2025
Absolute Zero: Reinforced Self-play Reasoning with Zero Data.
CoRR, May, 2025

DiveR-CT: Diversity-enhanced Red Teaming Large Language Model Assistants with Relaxing Constraints.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
DiveR-CT: Diversity-enhanced Red Teaming with Relaxing Constraints.
CoRR, 2024

ExpeL: LLM Agents Are Experiential Learners.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024