2023
AutoScratch: ML-Optimized Cache Management for Inference-Oriented GPUs.
Proceedings of the Sixth Conference on Machine Learning and Systems, 2023