×
2025
SPIN-Bench: How Well Do LLMs Plan Strategically and Reason Socially?
[DOI]
Jianzhu Yao
,
Kevin Wang
,
Ryan Hsieh
,
Haisu Zhou
,
Tianqing Zou
,
Zerui Cheng
,
Zhangyang Wang
,
Pramod Viswanath
CoRR, March, 2025
2024
MMSci: A Multimodal Multi-Discipline Dataset for PhD-Level Scientific Comprehension.
[DOI]
Zekun Li
,
Xianjun Yang
,
Kyuri Choi
,
Wanrong Zhu
,
Ryan Hsieh
,
HyeonJung Kim
,
Jin Hyuk Lim
,
Sungyoung Ji
,
Byungju Lee
,
Xifeng Yan
,
Linda Ruth Petzold
,
Stephen D. Wilson
,
Woosang Lim
,
William Yang Wang
CoRR, 2024