RIG: Synergizing Reasoning and Imagination in End-to-End Generalist Policy.
CoRR, March, 2025
Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, February, 2025
MindSearch: Mimicking Human Minds Elicits Deep AI Searcher.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Are Your LLMs Capable of Stable Reasoning?
Proceedings of the Findings of the Association for Computational Linguistics, 2025
Are Your LLMs Capable of Stable Reasoning?
CoRR, 2024
CIBench: Evaluating Your LLMs with a Code Interpreter Plugin.
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
AlchemistCoder: Harmonizing and Eliciting Code Capability by Hindsight Tuning on Multi-source Data.
,
,
,
,
,
,
,
,
,
,
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
T-Eval: Evaluating the Tool Utilization Capability of Large Language Models Step by Step.
,
,
,
,
,
,
,
,
,
,
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
T-Eval: Evaluating the Tool Utilization Capability Step by Step.
,
,
,
,
,
,
,
,
,
,
CoRR, 2023
MultiModal-GPT: A Vision and Language Model for Dialogue with Humans.
CoRR, 2023
supervised adptive threshold network for instance segmentation.
CoRR, 2021
Boundary-based Real-time Text Detection on Container Code.
Proceedings of the 2021 International Symposium on Computer Science and Intelligent Control, 2021
A Hybrid Model for Container-code Detection.
Proceedings of the 13th International Congress on Image and Signal Processing, 2020