Evaluating the Data Model Robustness of Text-to-SQL Systems Based on Real User Queries.
Proceedings of the Proceedings 28th International Conference on Extending Database Technology, 2025
ScienceBenchmark: A Complex Real-World Benchmark for Evaluating Natural Language to SQL Systems.
Proc. VLDB Endow., December, 2023
Spider4SPARQL: A Complex Benchmark for Evaluating Knowledge Graph Question Answering Systems.
Proceedings of the IEEE International Conference on Big Data, 2023