×
2023
AutoScratch: ML-Optimized Cache Management for Inference-Oriented GPUs.
[DOI]
Yaosheng Fu
,
Evgeny Bolotin
,
Aamer Jaleel
,
Gal Dalal
,
Shie Mannor
,
Jacob Subag
,
Noam Korem
,
Michael Behar
,
David W. Nellans
Proceedings of the Sixth Conference on Machine Learning and Systems, 2023