WTU-EVAL: A Whether-or-Not Tool Usage Evaluation Benchmark for Large Language Models.
CoRR, 2024
Vision-based automatic order check method for online medicine dispensing cabinet under incomplete data.
Eng. Appl. Artif. Intell., 2023
Exploiting Emotion-Semantic Correlations for Empathetic Response Generation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
MenatQA: A New Dataset for Testing the Temporal Comprehension and Reasoning Abilities of Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023