2024

WTU-EVAL: A Whether-or-Not Tool Usage Evaluation Benchmark for Large Language Models.

[DOI]

CoRR, 2024

2023

Vision-based automatic order check method for online medicine dispensing cabinet under incomplete data.

[DOI]

Eng. Appl. Artif. Intell., 2023

Exploiting Emotion-Semantic Correlations for Empathetic Response Generation.

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

MenatQA: A New Dataset for Testing the Temporal Comprehension and Reasoning Abilities of Large Language Models.

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023