2025
CSR-Bench: Benchmarking LLM Agents in Deployment of Computer Science Research Repositories.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Learning LLM Preference over Intra-Dialogue Pairs: A Framework for Utterance-level Understandings.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

2024
GRAM: Generative Retrieval Augmented Matching of Data Schemas in the Context of Data Security.
CoRR, 2024

Neural Locality Sensitive Hashing for Entity Blocking.
Proceedings of the 2024 SIAM International Conference on Data Mining, 2024

BPID: A Benchmark for Personal Identity Deduplication.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: EMNLP 2024, 2024