2024
SciBench: Evaluating College-Level Scientific Problem-Solving Abilities of Large Language Models.
Proceedings of the Forty-first International Conference on Machine Learning, 2024