2024
Latency-Based Inter-Operator Scheduling for CNN Inference Acceleration on GPU.
IEEE Trans. Serv. Comput., 2024