2025
Robust Reward Alignment via Hypothesis Space Batch Cutting.
CoRR, February, 2025

2024
Language-Model-Assisted Bi-Level Programming for Reward Learning from Internet Videos.
CoRR, 2024

Safe MPC Alignment with Human Directional Feedback.
CoRR, 2024

2021
DialogXL: All-in-One XLNet for Multi-Party Conversation Emotion Recognition.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021