From Large to Super-Tiny: End-to-End Optimization for Cost-Efficient LLMs.
CoRR, April, 2025
AI-Empowered RIS-Assisted Networks: CV-Enabled RIS Selection and DNN-Enabled Transmission.
IEEE Trans. Veh. Technol., November, 2024
Accelerating Neural Network Inference by Overflow Aware Quantization.
CoRR, 2020