Transactional Failure Recovery for a Distributed Key-Value Store - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2013

Transactional Failure Recovery for a Distributed Key-Value Store

Résumé

With the advent of cloud computing, many applications have embraced the ensuing paradigm shift towards modern distributed key-value data stores, like HBase, in order to benefit from the elastic scalability on offer. However, many applications still hesitate to make the leap from the traditional relational database model simply because they cannot compromise on the standard transactional guarantees of atomicity, isolation, and durability. To get the best of both worlds, one option is to integrate an independent transaction management component with a distributed key-value store. In this paper, we discuss the implications of this approach for durability. In particular, if the transaction manager provides durability (e.g., through logging), then we can relax durability constraints in the key-value store. However, if a component fails (e.g., a client or a key-value server), then we need a coordinated recovery procedure to ensure that commits are persisted correctly. In our research, we integrate an independent transaction manager with HBase. Our main contribution is a failure recovery middleware for the integrated system, which tracks the progress of each commit as it is flushed down by the client and persisted within HBase, so that we can recover reliably from failures. During recovery, commits that were interrupted by the failure are replayed from the transaction management log. Importantly, the recovery process does not interrupt transaction processing on the available servers. Using a benchmark, we evaluate the impact of component failure, and subsequent recovery, on application performance.
Fichier principal
Vignette du fichier
978-3-642-45065-5_14_Chapter.pdf (510.67 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01480780 , version 1 (01-03-2017)

Licence

Paternité

Identifiants

Citer

Muhammad Yousuf Ahmad, Bettina Kemme, Ivan Brondino, Marta Patiño-Martínez, Ricardo Jiménez-Peris. Transactional Failure Recovery for a Distributed Key-Value Store. 14th International Middleware Conference (Middleware), Dec 2013, Beijing, China. pp.267-286, ⟨10.1007/978-3-642-45065-5_14⟩. ⟨hal-01480780⟩
204 Consultations
96 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More