Mining for Availability Models in Large-Scale Distributed Systems:A Case Study of SETI@home

Bahman Javadi; Derrick Kondo; Jean-Marc Vincent; David P. Anderson

Rapport (Rapport De Recherche) Année : 2009

Mining for Availability Models in Large-Scale Distributed Systems:A Case Study of SETI@home

(1) , (1) , (1) , (2)

1
2

Bahman Javadi

Fonction : Auteur correspondant
PersonId : 859591

Connectez-vous pour contacter l'auteur

Middleware efficiently scalable

Derrick Kondo

Fonction : Auteur correspondant
PersonId : 849131

Connectez-vous pour contacter l'auteur

Middleware efficiently scalable

Jean-Marc Vincent

Fonction : Auteur
PersonId : 750922
IdHAL : jean-marc-vincent
ORCID : 0000-0003-3576-2024

Middleware efficiently scalable

David P. Anderson

Fonction : Auteur

Space Sciences Laboratory [Berkeley]

Résumé

In the age of cloud, Grid, P2P, and volunteer distributed computing, large-scale systems with tens of thousands of unreliable hosts are increasingly common. Invariably, these systems are composed of heterogeneous hosts whose individual availability often exhibit different statistical properties (for example stationary versus non-stationary behaviour) and fit different models (for example Exponential, Weibull, or Pareto probability distributions). In this paper, we describe an effective method for discovering subsets of hosts whose availability have similar statistical properties and can be modelled with similar probability distributions. We apply this method with about 230,000 host availability traces obtained from a real large-scale Internet-distributed system, namely SETI@home. We find that about 34% of hosts exhibit availability that is a truly random process, and that these hosts can often be modelled accurately with a few distinct distributions from different families. We believe that this characterization is fundamental in the design of stochastic scheduling algorithms across large-scale systems where host availability is uncertain.

Mots clés

Internet-distributed computing Modelling of availability Characterization of distributed systems

Domaines

Calcul parallèle, distribué et partagé [cs.DC]

Fichier principal

mascots09.pdf (3.84 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Bahman Javadi : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00375624

Soumis le : mercredi 15 avril 2009-16:22:01

Dernière modification le : jeudi 4 avril 2024-21:15:32

Archivage à long terme le : vendredi 12 octobre 2012-16:40:50

Dates et versions

inria-00375624 , version 1 (15-04-2009)

Identifiants

HAL Id : inria-00375624 , version 1

Citer

Bahman Javadi, Derrick Kondo, Jean-Marc Vincent, David P. Anderson. Mining for Availability Models in Large-Scale Distributed Systems:A Case Study of SETI@home. [Research Report] 2009. ⟨inria-00375624⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UGA CNRS INRIA LIG INRIA2 LARA ANR LIG_SIDCH

189 Consultations

197 Téléchargements

Mining for Availability Models in Large-Scale Distributed Systems:A Case Study of SETI@home

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager