RoPIM: A Processing-in-Memory Architecture for Accelerating Rotary Positional Embedding in Transformer Models.
IEEE Comput. Archit. Lett., 2025
The Breakthrough Memory Solutions for Improved Performance on LLM Inference.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
IEEE Micro, 2024
An LPDDR-based CXL-PNM Platform for TCO-efficient Inference of Transformer-based Large Language Models.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2024
SMT: Software-Defined Memory Tiering for Heterogeneous Computing Systems With CXL Memory Expander.
IEEE Micro, 2023
Samsung PIM/PNM for Transfmer Based AI : Energy Efficiency on PIM/PNM Cluster.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 35th IEEE Hot Chips Symposium, 2023
Near-Memory Processing in Action: Accelerating Personalized Recommendation With AxDIMM.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
IEEE Micro, 2022
Aquabolt-XL HBM2-PIM, LPDDR5-PIM With In-Memory Processing, and AXDIMM With Acceleration Buffer.
,
,
,
,
,
,
,
,
,
,
,
,
,
IEEE Micro, 2022
Improving In-Memory Database Operations with Acceleration DIMM (AxDIMM).
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the International Conference on Management of Data, 2022
Understanding and Reducing Weight-Load Overhead of Systolic Deep Learning Accelerators.
Proceedings of the 18th International SoC Design Conference, 2021
Aquabolt-XL: Samsung HBM2-PIM with in-memory processing for ML accelerators and beyond.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the IEEE Hot Chips 33 Symposium, 2021