2025
UniCAIM: A Unified CAM/CIM Architecture with Static-Dynamic KV Cache Pruning for Efficient Long-Context LLM Inference.
CoRR, April, 2025

MPCache: MPC-Friendly KV Cache Eviction for Efficient Private Large Language Model Inference.
CoRR, January, 2025

2024
PrivQuant: Communication-Efficient Private Inference with Quantized Network/Protocol Co-Optimization.
IACR Cryptol. ePrint Arch., 2024

EQO: Exploring Ultra-Efficient Private Inference with Winograd-Based Protocol and Quantization Co-Optimization.
CoRR, 2024

Kuaiji: the First Chinese Accounting Large Language Model.
CoRR, 2024

ASGEA: Exploiting Logic Rules from Align-Subgraphs for Entity Alignment.
CoRR, 2024

FlexHE: A flexible Kernel Generation Framework for Homomorphic Encryption-Based Private Inference.
Proceedings of the 43rd IEEE/ACM International Conference on Computer-Aided Design, 2024

FaiMA: Feature-aware In-context Learning for Multi-domain Aspect-based Sentiment Analysis.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

BAT: Behavior-Aware Human-Like Trajectory Prediction for Autonomous Driving.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
BAT: Behavior-Aware Human-Like Trajectory Prediction for Autonomous Driving.
CoRR, 2023

CoPriv: Network/Protocol Co-Optimization for Communication-Efficient Private Inference.
CoRR, 2023

CoPriv: Network/Protocol Co-Optimization for Communication-Efficient Private Inference.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

MPCViT: Searching for Accurate and Efficient MPC-Friendly Vision Transformer with Heterogeneous Attention.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Converge to the Truth: Factual Error Correction via Iterative Constrained Editing.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
MPCViT: Searching for MPC-friendly Vision Transformer with Heterogeneous Attention.
CoRR, 2022

Connecting the Hosts: Street-Level IP Geolocation with Graph Neural Networks.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022