Binary Classifier Optimization for Large Language Model Alignment.
CoRR, 2024
TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Hexa: Self-Improving for Knowledge-Grounded Dialogue System.
CoRR, 2023
Effortless Integration of Memory Management into Open-Domain Conversation Systems.
CoRR, 2023
Efficient Latent Variable Modeling for Knowledge-Grounded Dialogue Generation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Boundary-aware Self-supervised Learning for Video Scene Segmentation.
CoRR, 2022
BaSSL: Boundary-aware Self-Supervised Learning for Video Scene Segmentation.
Proceedings of the Computer Vision - ACCV 2022, 2022
Winning the ICCV'2021 VALUE Challenge: Task-aware Ensemble and Transfer Learning with Visual Concepts.
CoRR, 2021