SOSP2023
Achieving Microsecond-Scale Tail Latency Efficiently with Approximate Optimal Scheduling
Rishabh R. Iyer, Musa Unal, Marios Kogias, George Candea
被引用 17 次
摘要
Datacenter applications expect microsecond-scale service times and tightly bound tail latency, with future workloads expected to be even more demanding. To address this challenge, state-of-the-art runtimes employ theoretically optimal scheduling policies, namely a single request queue and strict preemption.