Robust Reward Alignment via Hypothesis Space Batch Cutting.
CoRR, February, 2025
Language-Model-Assisted Bi-Level Programming for Reward Learning from Internet Videos.
CoRR, 2024
Safe MPC Alignment with Human Directional Feedback.
CoRR, 2024
DialogXL: All-in-One XLNet for Multi-Party Conversation Emotion Recognition.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021