2023

AutoScratch: ML-Optimized Cache Management for Inference-Oriented GPUs.

[DOI]

Yaosheng Fu

,

Evgeny Bolotin

,

Aamer Jaleel

,

Gal Dalal

,

Shie Mannor

,

Jacob Subag

,

Noam Korem

,

Michael Behar

,

David W. Nellans

Proceedings of the Sixth Conference on Machine Learning and Systems, 2023