A Distributed Algorithm for Resource Clustering in Large Scale Platforms

Olivier Beaumont 1, 2 Nicolas Bonichon 1, 2 Philippe Duchon 1, 2 Lionel Eyraud-Dubois 1, 2 Hubert Larchevêque 1, 2
2 CEPAGE - Algorithmics for computationally intensive applications over wide scale distributed platforms
Université Sciences et Technologies - Bordeaux 1, Inria Bordeaux - Sud-Ouest, École Nationale Supérieure d'Électronique, Informatique et Radiocommunications de Bordeaux (ENSEIRB), CNRS - Centre National de la Recherche Scientifique : UMR5800
Abstract : We consider the resource clustering problem in large scale distributed platforms, such as BOINC, WCG or Folding@home. In this context, applications mostly consist in a huge set of independent tasks, with the additional constraint that each task should be executed on a single computing resource. We aim at removing this last constraint, by allowing a task to be executed on a (small) set of resources. Indeed, for problems involving large data sets, very few resources may be able to store the data associated to a task, and therefore may be able to participate in the computations. Our goal is to propose a distributed algorithm for a large set of resources that enables to build clusters, each of which will be responsible for processing a task and storing associated data. From an algorithmic point of view, this corresponds to a bin covering problem with an additional distance constraint. Each resource is associated to a weight (its capacity) and a position in a metric space (its location, based on network coordinates such as those obtained with Vivaldi), and the aim is to build a maximal number of clusters, such that the aggregated power of each cluster (the sum of the weights of its resources) is large enough and such that the distance between two resources belonging to the same cluster is kept small (in order to minimize intra-cluster communication latencies). In this paper, we describe a generic 2-phases algorithm, based on resource augmentation and whose approximation ratio is 1/3. We also propose a distributed version of this algorithm when the metric space is $\mathbb{Q}^D$ (for a small value of $D$) and the $L_{\infty}$ norm is used to define distances. This algorithm takes $O((4^D) \log^2n)$ rounds and $O((4^D)n\log n)$ messages both in expectation and with high probability,where $n$ is the total number of hosts.
Type de document :
Rapport
[Research Report] RR-6883, INRIA. 2008
Liste complète des métadonnées

Littérature citée [18 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/inria-00369417
Contributeur : Olivier Beaumont <>
Soumis le : jeudi 19 mars 2009 - 16:31:03
Dernière modification le : jeudi 11 janvier 2018 - 06:22:11
Document(s) archivé(s) le : mardi 8 juin 2010 - 23:40:07

Fichier

RR-6883.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : inria-00369417, version 1

Collections

Citation

Olivier Beaumont, Nicolas Bonichon, Philippe Duchon, Lionel Eyraud-Dubois, Hubert Larchevêque. A Distributed Algorithm for Resource Clustering in Large Scale Platforms. [Research Report] RR-6883, INRIA. 2008. 〈inria-00369417〉

Partager

Métriques

Consultations de la notice

294

Téléchargements de fichiers

166