LoL-PIM: Long-Context LLM Decoding with Scalable DRAM-PIM System.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
LauWS: Local Adaptive Unstructured Weight Sparsity of Load Balance for DNN in Near-Data Processing.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2024
SK Hynix AI-Specific Computing Memory Solution: From AiM Device to Heterogeneous AiMX-xPU System for Comprehensive LLM Inference.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 36th IEEE Hot Chips Symposium, 2024
Memory-Centric Computing with SK Hynix's Domain-Specific Memory.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 35th IEEE Hot Chips Symposium, 2023