Transactional Failure Recovery for a Distributed Key-Value Store - Archive ouverte HAL Access content directly
Conference Papers Year : 2013

Transactional Failure Recovery for a Distributed Key-Value Store

(1) , (1) , (2) , (2) , (2)
1
2

Abstract

With the advent of cloud computing, many applications have embraced the ensuing paradigm shift towards modern distributed key-value data stores, like HBase, in order to benefit from the elastic scalability on offer. However, many applications still hesitate to make the leap from the traditional relational database model simply because they cannot compromise on the standard transactional guarantees of atomicity, isolation, and durability. To get the best of both worlds, one option is to integrate an independent transaction management component with a distributed key-value store. In this paper, we discuss the implications of this approach for durability. In particular, if the transaction manager provides durability (e.g., through logging), then we can relax durability constraints in the key-value store. However, if a component fails (e.g., a client or a key-value server), then we need a coordinated recovery procedure to ensure that commits are persisted correctly. In our research, we integrate an independent transaction manager with HBase. Our main contribution is a failure recovery middleware for the integrated system, which tracks the progress of each commit as it is flushed down by the client and persisted within HBase, so that we can recover reliably from failures. During recovery, commits that were interrupted by the failure are replayed from the transaction management log. Importantly, the recovery process does not interrupt transaction processing on the available servers. Using a benchmark, we evaluate the impact of component failure, and subsequent recovery, on application performance.
Fichier principal
Vignette du fichier
978-3-642-45065-5_14_Chapter.pdf (510.67 Ko) Télécharger le fichier
Origin : Files produced by the author(s)
Loading...

Dates and versions

hal-01480780 , version 1 (01-03-2017)

Licence

Attribution - CC BY 4.0

Identifiers

Cite

Muhammad Yousuf Ahmad, Bettina Kemme, Ivan Brondino, Marta Patiño-Martínez, Ricardo Jiménez-Peris. Transactional Failure Recovery for a Distributed Key-Value Store. 14th International Middleware Conference (Middleware), Dec 2013, Beijing, China. pp.267-286, ⟨10.1007/978-3-642-45065-5_14⟩. ⟨hal-01480780⟩
191 View
86 Download

Altmetric

Share

Gmail Facebook Twitter LinkedIn More