2024
GIDCL: A Graph-Enhanced Interpretable Data Cleaning Framework with Large Language Models.
Proc. ACM Manag. Data, December, 2024

Enriching Relations with Additional Attributes for ER.
Proc. VLDB Endow., July, 2024

Efficient Mixture of Experts based on Large Language Models for Low-Resource Data Preprocessing.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

A Retrieval-Augmented Framework for Tabular Interpretation with Large Language Model.
Proceedings of the Database Systems for Advanced Applications, 2024

Unsupervised Domain Adaptation for Entity Blocking Leveraging Large Language Models.
Proceedings of the IEEE International Conference on Big Data, 2024

2023
Splitting Tuples of Mismatched Entities.
Proc. ACM Manag. Data, December, 2023

Expanding the prediction capacity in long sequence time-series forecasting.
Artif. Intell., May, 2023

2017
Stacked Kernel Network.
CoRR, 2017