2025
Generative AI Act II: Test Time Scaling Drives Cognition Engineering.
CoRR, April, 2025

DIVE: Diversified Iterative Self-Improvement.
CoRR, January, 2025

2024
O1 Replication Journey - Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?
CoRR, 2024

O1 Replication Journey: A Strategic Progress Report - Part 1.
CoRR, 2024

OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

InFoBench: Evaluating Instruction Following Ability in Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
T5Score: Discriminative Fine-tuning of Generative Evaluation Metrics.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

2022
Searching for Effective Multilingual Fine-Tuning Methods: A Case Study in Summarization.
CoRR, 2022

2021
Automating Claim Construction in Patent Applications: The CMUmine Dataset.
Proceedings of the Natural Legal Language Processing Workshop 2021, 2021

2013
An arc-shaped front nose for the mole in space exploration.
Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2013