Tailwind: Fast and Atomic RDMA-based Replication - Archive ouverte HAL Access content directly
Conference Papers Year : 2018

Tailwind: Fast and Atomic RDMA-based Replication

(1) , (2) , (1) , (3)
1
2
3

Abstract

Replication is essential for fault-tolerance, but it is also a source of high overhead. Some recent in-memory systems use remote direct memory accesses (RDMA) to create redundant copies of data in remote memory. RDMA is attractive, since it is low-latency, high-throughput, and has no CPU overhead at the target. However, even with RDMA, no existing replication protocol avoids redundant data copying and target-side work. Backup servers may receive updates via RDMA, but they must actively incorporate the updates into replicas to update backup data structures and to ensure that only fully received messages are applied. In this paper, we present Tailwind, a zero-copy recovery log replication protocol for scale-out in-memory databases. Tailwind is the first replication protocol that eliminates all CPU-driven data copying and fully bypasses target server CPUs for data replication. In Tailwind , all data movement is done via device DMA; one-sided RDMA writes place data directly in remote I/O buffers in its final form for stable storage. Tailwind substantially improves replication throughput and response latency compared with conventional RPC-based replica-tion. In symmetric systems where servers both serve requests and act as replicas, Tailwind also improves normal-case throughput by freeing server CPU resources for request processing. We implemented and evaluated Tailwind on RAMCloud, a low-latency in-memory storage system. Experiments show Tailwind improves RAMCloud's normal-case request processing through-put by 1.7x. It also decreases writes median and 99 th percentile latencies by 2 and 3 respectively.
Fichier principal
Vignette du fichier
atc-221-final.pdf (594.15 Ko) Télécharger le fichier
Origin : Files produced by the author(s)
Loading...

Dates and versions

hal-01676502 , version 1 (05-01-2018)
hal-01676502 , version 2 (26-05-2018)

Identifiers

  • HAL Id : hal-01676502 , version 2

Cite

Yacine Taleb, Ryan Stutsman, Gabriel Antoniu, Toni Cortes. Tailwind: Fast and Atomic RDMA-based Replication. ATC ‘18 - USENIX Annual Technical Conference, Jul 2018, Boston, United States. pp.850-863. ⟨hal-01676502v2⟩
902 View
620 Download

Share

Gmail Facebook Twitter LinkedIn More