2025
Mutual-Taught for Co-adapting Policy and Reward Models.
CoRR, June, 2025

FuseRL: Dense Preference Optimization for Heterogeneous Model Fusion.
CoRR, April, 2025

Explainable Synthetic Image Detection through Diffusion Timestep Ensembling.
CoRR, March, 2025

Layer-Specific Scaling of Positional Encodings for Superior Long-Context Modeling.
CoRR, March, 2025

Weighted-Reward Preference Optimization for Implicit Model Fusion.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
ProFuser: Progressive Fusion of Large Language Models.
CoRR, 2024

Towards Biologically Plausible Computing: A Comprehensive Comparison.
CoRR, 2024

Searching for Best Practices in Retrieval-Augmented Generation.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023
A Novel Two-Stage Generation Framework for Promoting the Persona-Consistency and Diversity of Responses in Neural Dialog Systems.
IEEE Trans. Neural Networks Learn. Syst., March, 2023

A Unified Generation Approach for Robust Dialogue State Tracking.
Proceedings of the Natural Language Processing and Chinese Computing, 2023

PsyCoT: Psychological Questionnaire as Powerful Chain-of-Thought for Personality Detection.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Dual-Feedback Knowledge Retrieval for Task-Oriented Dialogue Systems.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023