Dealing with Skewed Data in Structured Overlays using Variable Hash Functions

Maeva Antoine 1 Fabrice Huet 1
1 SCALE - Safe Composition of Autonomous applications with Large-SCALE Execution environment
Laboratoire I3S - COMRED - COMmunications, Réseaux, systèmes Embarqués et Distribués
Abstract : Storing highly skewed data in a distributed system has become a very frequent issue, in particular with the emergence of semantic web and Big Data. This often leads to biased data dissemination among nodes. Addressing load imbalance is necessary, especially to minimize response time and avoid workload being handled by only one or few nodes. Our contribution aims at dynamically managing load imbalance by allowing multiple hash functions on different peers, while maintaining consistency of the overlay. Our experiments, on highly skewed data sets from the semantic web, show we can distribute data on at least 300 times more peers than when not using any load balancing strategy.
Complete list of metadatas

Cited literature [13 references]  Display  Hide  Download

https://hal.inria.fr/hal-01101678
Contributor : Maeva Antoine <>
Submitted on : Friday, January 9, 2015 - 1:25:37 PM
Last modification on : Saturday, December 8, 2018 - 1:20:50 AM
Long-term archiving on : Saturday, September 12, 2015 - 1:10:23 AM

File

can.pdf
Files produced by the author(s)

Identifiers

Collections

Citation

Maeva Antoine, Fabrice Huet. Dealing with Skewed Data in Structured Overlays using Variable Hash Functions. The 15th International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT), The University of Hong Kong, Dec 2014, Hong Kong, Hong Kong SAR China. pp.42-48, ⟨10.1109/PDCAT.2014.15⟩. ⟨hal-01101678⟩

Share

Metrics

Record views

303

Files downloads

263