2024
Neural Locality Sensitive Hashing for Entity Blocking.
Proceedings of the 2024 SIAM International Conference on Data Mining, 2024

BPID: A Benchmark for Personal Identity Deduplication.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: EMNLP 2024, 2024

Textual Dataset Distillation via Language Model Embedding.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

2015
Modeling Social Attention for Stock Analysis: An Influence Propagation Perspective.
Proceedings of the 2015 IEEE International Conference on Data Mining, 2015