Mining for Statistical Models of Availability in Large-Scale Distributed Systems: An Empirical Study of SETI@home

Bahman Javadi 1 Derrick Kondo 1 Jean-Marc Vincent 1 David P. Anderson 2
1 MESCAL - Middleware efficiently scalable
Inria Grenoble - Rhône-Alpes, LIG - Laboratoire d'Informatique de Grenoble
Abstract : In the age of cloud, Grid, P2P, and volunteer distributed computing, large-scale systems with tens of thousands of unreliable hosts are increasingly common. Invariably, these systems are composed of heterogeneous hosts whose individual availability often exhibit different statistical properties (for example stationary versus non-stationary behaviour) and fit different models (for example Exponential, Weibull, or Pareto probability distributions). In this paper, we describe an effective method for discovering subsets of hosts whose availability have similar statistical properties and can be modelled with similar probability distributions. We apply this method with about 230,000 host availability traces obtained from a real large-scale Internet-distributed system, namely SETI@home. We find that about 34% of hosts exhibit availability that is a truly random process, and that these hosts can often be modelled accurately with a few distinct distributions from different families. We believe that this characterization is fundamental in the design of stochastic scheduling algorithms across large-scale systems where host availability is uncertain.
Type de document :
Communication dans un congrès
17th IEEE/ACM International Symposium on Modelling, Analysis and Simulation of Computer and Telecommunication Systems (MASCOTS), 2009, London, United Kingdom. IEEE, pp.1-10, 2009, 〈10.1109/MASCOT.2009.5367061〉
Liste complète des métadonnées

https://hal.inria.fr/hal-00788912
Contributeur : Arnaud Legrand <>
Soumis le : vendredi 15 février 2013 - 13:46:20
Dernière modification le : lundi 5 octobre 2015 - 16:58:36

Identifiants

Collections

Citation

Bahman Javadi, Derrick Kondo, Jean-Marc Vincent, David P. Anderson. Mining for Statistical Models of Availability in Large-Scale Distributed Systems: An Empirical Study of SETI@home. 17th IEEE/ACM International Symposium on Modelling, Analysis and Simulation of Computer and Telecommunication Systems (MASCOTS), 2009, London, United Kingdom. IEEE, pp.1-10, 2009, 〈10.1109/MASCOT.2009.5367061〉. 〈hal-00788912〉

Partager

Métriques

Consultations de la notice

167