UniCAIM: A Unified CAM/CIM Architecture with Static-Dynamic KV Cache Pruning for Efficient Long-Context LLM Inference.
CoRR, April, 2025
MPCache: MPC-Friendly KV Cache Eviction for Efficient Private Large Language Model Inference.
CoRR, January, 2025
PrivQuant: Communication-Efficient Private Inference with Quantized Network/Protocol Co-Optimization.
IACR Cryptol. ePrint Arch., 2024
EQO: Exploring Ultra-Efficient Private Inference with Winograd-Based Protocol and Quantization Co-Optimization.
CoRR, 2024
Kuaiji: the First Chinese Accounting Large Language Model.
CoRR, 2024
ASGEA: Exploiting Logic Rules from Align-Subgraphs for Entity Alignment.
CoRR, 2024
FlexHE: A flexible Kernel Generation Framework for Homomorphic Encryption-Based Private Inference.
Proceedings of the 43rd IEEE/ACM International Conference on Computer-Aided Design, 2024
FaiMA: Feature-aware In-context Learning for Multi-domain Aspect-based Sentiment Analysis.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
BAT: Behavior-Aware Human-Like Trajectory Prediction for Autonomous Driving.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
BAT: Behavior-Aware Human-Like Trajectory Prediction for Autonomous Driving.
CoRR, 2023
CoPriv: Network/Protocol Co-Optimization for Communication-Efficient Private Inference.
CoRR, 2023
CoPriv: Network/Protocol Co-Optimization for Communication-Efficient Private Inference.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
MPCViT: Searching for Accurate and Efficient MPC-Friendly Vision Transformer with Heterogeneous Attention.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Converge to the Truth: Factual Error Correction via Iterative Constrained Editing.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
MPCViT: Searching for MPC-friendly Vision Transformer with Heterogeneous Attention.
CoRR, 2022
Connecting the Hosts: Street-Level IP Geolocation with Graph Neural Networks.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022