RIG: Synergizing Reasoning and Imagination in End-to-End Generalist Policy.
CoRR, March, 2025
Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, February, 2025
Are Your LLMs Capable of Stable Reasoning?
CoRR, 2024
MindSearch: Mimicking Human Minds Elicits Deep AI Searcher.
CoRR, 2024
CIBench: Evaluating Your LLMs with a Code Interpreter Plugin.
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
AlchemistCoder: Harmonizing and Eliciting Code Capability by Hindsight Tuning on Multi-source Data.
,
,
,
,
,
,
,
,
,
,
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
T-Eval: Evaluating the Tool Utilization Capability of Large Language Models Step by Step.
,
,
,
,
,
,
,
,
,
,
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
T-Eval: Evaluating the Tool Utilization Capability Step by Step.
,
,
,
,
,
,
,
,
,
,
CoRR, 2023
MultiModal-GPT: A Vision and Language Model for Dialogue with Humans.
CoRR, 2023
supervised adptive threshold network for instance segmentation.
CoRR, 2021
Boundary-based Real-time Text Detection on Container Code.
Proceedings of the 2021 International Symposium on Computer Science and Intelligent Control, 2021
A Hybrid Model for Container-code Detection.
Proceedings of the 13th International Congress on Image and Signal Processing, 2020