2025
RIG: Synergizing Reasoning and Imagination in End-to-End Generalist Policy.
CoRR, March, 2025

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning.
CoRR, February, 2025

2024
Are Your LLMs Capable of Stable Reasoning?
CoRR, 2024

MindSearch: Mimicking Human Minds Elicits Deep AI Searcher.
CoRR, 2024

CIBench: Evaluating Your LLMs with a Code Interpreter Plugin.
CoRR, 2024

InternLM2 Technical Report.
CoRR, 2024

InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning.
CoRR, 2024

AlchemistCoder: Harmonizing and Eliciting Code Capability by Hindsight Tuning on Multi-source Data.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

T-Eval: Evaluating the Tool Utilization Capability of Large Language Models Step by Step.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
T-Eval: Evaluating the Tool Utilization Capability Step by Step.
CoRR, 2023

MultiModal-GPT: A Vision and Language Model for Dialogue with Humans.
CoRR, 2023

2021
supervised adptive threshold network for instance segmentation.
CoRR, 2021

Boundary-based Real-time Text Detection on Container Code.
Proceedings of the 2021 International Symposium on Computer Science and Intelligent Control, 2021

2020
A Hybrid Model for Container-code Detection.
Proceedings of the 13th International Congress on Image and Signal Processing, 2020