Mutual-Taught for Co-adapting Policy and Reward Models.
CoRR, June, 2025
FuseRL: Dense Preference Optimization for Heterogeneous Model Fusion.
CoRR, April, 2025
Explainable Synthetic Image Detection through Diffusion Timestep Ensembling.
CoRR, March, 2025
Layer-Specific Scaling of Positional Encodings for Superior Long-Context Modeling.
CoRR, March, 2025
Weighted-Reward Preference Optimization for Implicit Model Fusion.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
ProFuser: Progressive Fusion of Large Language Models.
CoRR, 2024
Towards Biologically Plausible Computing: A Comprehensive Comparison.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
Searching for Best Practices in Retrieval-Augmented Generation.
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
A Novel Two-Stage Generation Framework for Promoting the Persona-Consistency and Diversity of Responses in Neural Dialog Systems.
IEEE Trans. Neural Networks Learn. Syst., March, 2023
A Unified Generation Approach for Robust Dialogue State Tracking.
Proceedings of the Natural Language Processing and Chinese Computing, 2023
PsyCoT: Psychological Questionnaire as Powerful Chain-of-Thought for Personality Detection.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Dual-Feedback Knowledge Retrieval for Task-Oriented Dialogue Systems.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023