Skip to Main content Skip to Navigation
Conference papers

Transactional Failure Recovery for a Distributed Key-Value Store

Abstract : With the advent of cloud computing, many applications have embraced the ensuing paradigm shift towards modern distributed key-value data stores, like HBase, in order to benefit from the elastic scalability on offer. However, many applications still hesitate to make the leap from the traditional relational database model simply because they cannot compromise on the standard transactional guarantees of atomicity, isolation, and durability. To get the best of both worlds, one option is to integrate an independent transaction management component with a distributed key-value store. In this paper, we discuss the implications of this approach for durability. In particular, if the transaction manager provides durability (e.g., through logging), then we can relax durability constraints in the key-value store. However, if a component fails (e.g., a client or a key-value server), then we need a coordinated recovery procedure to ensure that commits are persisted correctly. In our research, we integrate an independent transaction manager with HBase. Our main contribution is a failure recovery middleware for the integrated system, which tracks the progress of each commit as it is flushed down by the client and persisted within HBase, so that we can recover reliably from failures. During recovery, commits that were interrupted by the failure are replayed from the transaction management log. Importantly, the recovery process does not interrupt transaction processing on the available servers. Using a benchmark, we evaluate the impact of component failure, and subsequent recovery, on application performance.
Complete list of metadatas

Cited literature [21 references]  Display  Hide  Download

https://hal.inria.fr/hal-01480780
Contributor : Hal Ifip <>
Submitted on : Wednesday, March 1, 2017 - 5:32:54 PM
Last modification on : Thursday, June 18, 2020 - 12:32:05 PM
Document(s) archivé(s) le : Tuesday, May 30, 2017 - 6:10:40 PM

File

978-3-642-45065-5_14_Chapter.p...
Files produced by the author(s)

Licence


Distributed under a Creative Commons Attribution 4.0 International License

Identifiers

Citation

Muhammad Ahmad, Bettina Kemme, Ivan Brondino, Marta Patiño-Martínez, Ricardo Jiménez-Peris. Transactional Failure Recovery for a Distributed Key-Value Store. 14th International Middleware Conference (Middleware), Dec 2013, Beijing, China. pp.267-286, ⟨10.1007/978-3-642-45065-5_14⟩. ⟨hal-01480780⟩

Share

Metrics

Record views

323

Files downloads

287