2025

RoPIM: A Processing-in-Memory Architecture for Accelerating Rotary Positional Embedding in Transformer Models.

[DOI]

Yunhyeong Jeon

Minwoo Jang

IEEE Comput. Archit. Lett., 2025

2024

The Breakthrough Memory Solutions for Improved Performance on LLM Inference.

[DOI]

IEEE Micro, 2024

An LPDDR-based CXL-PNM Platform for TCO-efficient Inference of Transformer-based Large Language Models.

[DOI]

Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2024

2023

SMT: Software-Defined Memory Tiering for Heterogeneous Computing Systems With CXL Memory Expander.

[DOI]

IEEE Micro, 2023

Samsung PIM/PNM for Transfmer Based AI : Energy Efficiency on PIM/PNM Cluster.

[DOI]

Proceedings of the 35th IEEE Hot Chips Symposium, 2023

2022

Near-Memory Processing in Action: Accelerating Personalized Recommendation With AxDIMM.

[DOI]

IEEE Micro, 2022

Aquabolt-XL HBM2-PIM, LPDDR5-PIM With In-Memory Processing, and AXDIMM With Acceleration Buffer.

[DOI]

IEEE Micro, 2022

Improving In-Memory Database Operations with Acceleration DIMM (AxDIMM).

[DOI]

Vishnu Charan Thummala

Ravi Shankar JV

Sachin Suresh Upadhya

Mohammed Ibrahim Khan

Jin Hyun Kim

Proceedings of the International Conference on Management of Data, 2022

2021

Understanding and Reducing Weight-Load Overhead of Systolic Deep Learning Accelerators.

[DOI]

Proceedings of the 18th International SoC Design Conference, 2021

Aquabolt-XL: Samsung HBM2-PIM with in-memory processing for ML accelerators and beyond.

[DOI]

Proceedings of the IEEE Hot Chips 33 Symposium, 2021