The utility of exploiting idle workstations for parallel computation, Proceedings of the 1997 ACM SIGMETRICS international conference on Measurement and modeling of computer systems (SIGMETRICS '97), pp.225-234, 1997. ,
On-line algorithms, ACM Computing Surveys, vol.31, issue.3es, p.314, 1999. ,
DOI : 10.1145/333580.333583
Understanding Availability, Proceedings of the 2nd International Workshop on Peer-to-Peer Systems (IPTPS '03), pp.256-267, 2003. ,
DOI : 10.1007/978-3-540-45172-3_24
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.13.1523
Grid'5000: A Large Scale And Highly Reconfigurable Experimental Grid Testbed, International Journal of High Performance Computing Applications, vol.20, issue.4, pp.481-494, 2006. ,
DOI : 10.1177/1094342006070078
URL : https://hal.archives-ouvertes.fr/hal-00684943
A batch scheduler with high level components, CCGrid 2005. IEEE International Symposium on Cluster Computing and the Grid, 2005., pp.776-783, 2005. ,
DOI : 10.1109/CCGRID.2005.1558641
URL : https://hal.archives-ouvertes.fr/hal-00005106
An Introduction to Reliability and Maintainability Engineering, 1997. ,
The Nordugrid production grid infrastructure, status and plans, Proceedings. First Latin American Web Congress, pp.158-165, 2003. ,
DOI : 10.1109/GRID.2003.1261711
A census of Tandem system availability between 1985 and 1990, IEEE Transactions on Reliability, vol.39, issue.4, pp.409-418, 1985. ,
DOI : 10.1109/24.58719
Measurement, modeling, and analysis of a peer-to-peer file-sharing workload, 19th ACM Symposium on Operating Systems Principles (SOSP), pp.314-329, 2003. ,
Scheduling for parallel supercomputing: A historical perspective of achievable utilization, Workshop on Job Scheduling Strategies for Parallel Processing (JSSPP'99), pp.1-16, 1999. ,
Characterizing and evaluating desktop grids: an empirical study, 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings., 2004. ,
DOI : 10.1109/IPDPS.2004.1302936
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.1.4764
On the Kolmogorov-Smirnov Test for the Exponential Distribution with Mean Unknown, Journal of the American Statistical Association, vol.35, issue.325, pp.387-389, 1969. ,
DOI : 10.1080/01621459.1956.10501314
A longitudinal survey of internet host reliability, 14th Symposium on Reliable Distributed Systems (SRDS), pp.2-9, 1995. ,
The ganglia distributed monitoring system: design, implementation, and experience, Parallel Computing, vol.30, issue.7, 2004. ,
DOI : 10.1016/j.parco.2004.04.001
Modeling Machine Availability in Enterprise and Wide-Area Distributed Computing Environments, 11th International Euro-Par Conference (Euro-Par 2005), pp.432-441, 2005. ,
DOI : 10.1007/11549468_50
A large-scale study of failures in high-performance computing systems, International Conference on Dependable Systems and Networks (DSN 2006), pp.249-258, 2006. ,
On-line scheduling, Online Algorithms, pp.196-231, 1996. ,
DOI : 10.1007/BFb0029570
Dependability measurement and modeling of a multicomputer system, IEEE Transactions on Computers, vol.42, issue.1, pp.62-75, 1993. ,
DOI : 10.1109/12.192214
Performance Implications of Failures in Large-Scale Cluster Scheduling, 10th International Workshop Job Scheduling Strategies for Parallel Processing (JSSPP), number 3277 in Lecture Notes in Computer Science, pp.233-252, 2004. ,
DOI : 10.1007/11407522_13
Brabois -Campus scientifique 615, rue du Jardin Botanique -BP 101 -54602 Villers-lès-Nancy Cedex (France) Unité de recherche INRIA Rennes : IRISA, Campus universitaire de Beaulieu -35042 Rennes Cedex (France) Unité de recherche INRIA Rhône-Alpes : 655, avenue de l'Europe -38334 Montbonnot Saint-Ismier (France) Unité de recherche INRIA Rocquencourt, Domaine de Voluceau -Rocquencourt -BP 105 -78153 Le Chesnay Cedex (France) Unité de recherche INRIA Sophia Antipolis : 2004, route des Lucioles -BP 93 -06902 Sophia Antipolis Cedex ,