2025
RIG: Synergizing Reasoning and Imagination in End-to-End Generalist Policy.
CoRR, March, 2025

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning.
CoRR, February, 2025

MindSearch: Mimicking Human Minds Elicits Deep AI Searcher.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Are Your LLMs Capable of Stable Reasoning?
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
Are Your LLMs Capable of Stable Reasoning?
CoRR, 2024

CIBench: Evaluating Your LLMs with a Code Interpreter Plugin.
CoRR, 2024

InternLM2 Technical Report.
CoRR, 2024

InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning.
CoRR, 2024

AlchemistCoder: Harmonizing and Eliciting Code Capability by Hindsight Tuning on Multi-source Data.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

T-Eval: Evaluating the Tool Utilization Capability of Large Language Models Step by Step.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
T-Eval: Evaluating the Tool Utilization Capability Step by Step.
CoRR, 2023

MultiModal-GPT: A Vision and Language Model for Dialogue with Humans.
CoRR, 2023

2021
supervised adptive threshold network for instance segmentation.
CoRR, 2021

Boundary-based Real-time Text Detection on Container Code.
Proceedings of the 2021 International Symposium on Computer Science and Intelligent Control, 2021

2020
A Hybrid Model for Container-code Detection.
Proceedings of the 13th International Congress on Image and Signal Processing, 2020