2024
Knowledge Graph Enhanced Language Agents for Recommendation.
CoRR, 2024

ScholarChemQA: Unveiling the Power of Language Models in Chemical Research Question Answering.
CoRR, 2024

Data Interpreter: An LLM Agent For Data Science.
CoRR, 2024

Rethinking Scientific Summarization Evaluation: Grounding Explainable Metrics on Facet-aware Benchmark.
CoRR, 2024

Defending Jailbreak Prompts via In-Context Adversarial Game.
CoRR, 2024

Can LLMs Solve Molecule Puzzles? A Multimodal Benchmark for Molecular Structure Elucidation.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Large Language Model Based Multi-agents: A Survey of Progress and Challenges.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

A Property-Guided Diffusion Model For Generating Molecular Graphs.
Proceedings of the IEEE International Conference on Acoustics, 2024

SceMQA: A Scientific College Entrance Level Multimodal Question Answering Benchmark.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, 2024

2023
Modeling non-uniform uncertainty in Reaction Prediction via Boosting and Dropout.
CoRR, 2023

What indeed can GPT models do in chemistry? A comprehensive benchmark on eight tasks.
CoRR, 2023

Few-shot News Recommendation via Cross-lingual Transfer.
Proceedings of the ACM Web Conference 2023, 2023

What can Large Language Models do in chemistry? A comprehensive benchmark on eight tasks.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2022
Improving Few-shot News Recommendation via Cross-lingual Transfer.
CoRR, 2022