SOSP2023

Achieving Microsecond-Scale Tail Latency Efficiently with Approximate Optimal Scheduling

Rishabh R. Iyer, Musa Unal, Marios Kogias, George Candea

被引用 17 次

摘要

Datacenter applications expect microsecond-scale service times and tightly bound tail latency, with future workloads expected to be even more demanding. To address this challenge, state-of-the-art runtimes employ theoretically optimal scheduling policies, namely a single request queue and strict preemption.