2025
Theoretical Benefit and Limitation of Diffusion Language Model.
CoRR, February, 2025

How Numerical Precision Affects Arithmetical Reasoning Capabilities of LLMs.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
How Numerical Precision Affects Mathematical Reasoning Capabilities of LLMs.
CoRR, 2024

DPO Meets PPO: Reinforced Token Optimization for RLHF.
CoRR, 2024

Rethinking Model-based, Policy-based, and Value-based Reinforcement Learning via the Lens of Representation Complexity.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Do Efficient Transformers Really Save Computation?
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023
Rethinking Model-based, Policy-based, and Value-based Reinforcement Learning via the Lens of Representation Complexity.
CoRR, 2023

Towards Revealing the Mystery behind Chain of Thought: A Theoretical Perspective.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

A Complete Expressiveness Hierarchy for Subgraph GNNs via Subgraph Weisfeiler-Lehman Tests.
Proceedings of the International Conference on Machine Learning, 2023