TALE: A Tool-Augmented Framework for Reference-Free Evaluation of Large Language Models.
CoRR, April, 2025
DAFE: LLM-Based Evaluation Through Dynamic Arbitration for Free-Form Question-Answering.
CoRR, March, 2025
Reference-Guided Verdict: LLMs-as-Judges in Automatic Evaluation of Free-Form Text.
CoRR, 2024
Quantifying the Capabilities of LLMs across Scale and Precision.
CoRR, 2024
Ethics of AI: A Systematic Literature Review of Principles and Challenges.
Proceedings of the EASE 2022: The International Conference on Evaluation and Assessment in Software Engineering 2022, Gothenburg, Sweden, June 13, 2022
The Influence of Cost Drivers on Effort Estimation in Distributed Software Development.
Proceedings of the EASE 2022: The International Conference on Evaluation and Assessment in Software Engineering 2022, Gothenburg, Sweden, June 13, 2022
What users really think about the usability of smartphone applications: diversity based empirical investigation.
Multim. Tools Appl., 2021
Ethics of AI: A Systematic Literature Review of Principles and Challenges.
CoRR, 2021
System and Software Processes in Practice: Insights from Chinese Industry.
Proceedings of the EASE 2021: Evaluation and Assessment in Software Engineering, 2021
Cross-Project Software Fault Prediction Using Data Leveraging Technique to Improve Software Quality.
Proceedings of the EASE '20: Evaluation and Assessment in Software Engineering, 2020
Towards Process Improvement in DevOps: A Systematic Literature Review.
Proceedings of the EASE '20: Evaluation and Assessment in Software Engineering, 2020