Reducing Tail Latency via Safe and Simple Duplication

Abdullah Bin Faisal; Ali Musa Iftikhar; Fahad R. Dogar; Hafiz Mohsin Bashir; Ihsan Ayyub Qazi; Muhammad Asim Jamshed; Peter Vondras

arxiv: 1905.13352 · v1 · pith:XBKWJRFVnew · submitted 2019-05-30 · 💻 cs.NI · cs.DC

Reducing Tail Latency via Safe and Simple Duplication

Hafiz Mohsin Bashir , Abdullah Bin Faisal , Muhammad Asim Jamshed , Peter Vondras , Ali Musa Iftikhar , Ihsan Ayyub Qazi , Fahad R. Dogar This is my paper

classification 💻 cs.NI cs.DC

keywords duplicationcloudsafesystemabstractionacrosslatencylayers

0 comments

read the original abstract

Duplication can be a powerful strategy for overcoming stragglers in cloud services, but is often used conservatively because of the risk of overloading the system. We present duplicate-aware scheduling or DAS, which makes duplication safe and easy to use, by leveraging the two well-known primitives of prioritization and purging. To support DAS across diverse layers of a cloud system (e.g., network, storage, etc), we propose the D-Stage abstraction, which decouples the duplication policy from the mechanism, and facilitates working with legacy layers of a system. Using this abstraction, we evaluate the benefits of DAS for two data parallel applications (HDFS, an in-memory workload generator) and a network function (snort-based IDS cluster). Our experiments on the public cloud and Emulab show that DAS is safe to use, and the tail latency improvement holds across a wide range of workloads

This paper has not been read by Pith yet.

Reducing Tail Latency via Safe and Simple Duplication

discussion (0)