Theoretical Benefit and Limitation of Diffusion Language Model.
CoRR, February, 2025
How Numerical Precision Affects Arithmetical Reasoning Capabilities of LLMs.
Proceedings of the Findings of the Association for Computational Linguistics, 2025
How Numerical Precision Affects Mathematical Reasoning Capabilities of LLMs.
CoRR, 2024
DPO Meets PPO: Reinforced Token Optimization for RLHF.
CoRR, 2024
Rethinking Model-based, Policy-based, and Value-based Reinforcement Learning via the Lens of Representation Complexity.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Do Efficient Transformers Really Save Computation?
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Rethinking Model-based, Policy-based, and Value-based Reinforcement Learning via the Lens of Representation Complexity.
CoRR, 2023
Towards Revealing the Mystery behind Chain of Thought: A Theoretical Perspective.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
A Complete Expressiveness Hierarchy for Subgraph GNNs via Subgraph Weisfeiler-Lehman Tests.
Proceedings of the International Conference on Machine Learning, 2023