Grid'5000: A Large Scale And Highly Reconfigurable Experimental Grid Testbed, International Journal of High Performance Computing Applications, vol.20, issue.4, pp.481-494, 2006. ,
DOI : 10.1177/1094342006070078
URL : https://hal.archives-ouvertes.fr/hal-00684943
On the dynamic resource availability in grids, 2007 8th IEEE/ACM International Conference on Grid Computing, pp.26-33, 2007. ,
DOI : 10.1109/GRID.2007.4354112
The Open Grid Services Architecture, Version 1.5, Tech. Rep. GFD-R.80, Open Grid Forum, 2006. ,
Vigne: Towards a Self-healing Grid Operating System, Proceedings of Euro-Par 2006, pp.437-447, 2006. ,
DOI : 10.1007/11823285_45
A Modern Taxonomy of High Availability, 1996. ,
Terascale clusters and the TeraGrid, Proceedings of 6th International Conference/Exhibition on High Performance Computing in Asia Pacific Region, pp.407-413, 2002. ,
SETI@home: an experiment in public-resource computing, Communications of the ACM, vol.45, issue.11, pp.56-61, 2002. ,
DOI : 10.1145/581571.581573
The Nordugrid production grid infrastructure, status and plans, Proceedings. First Latin American Web Congress, p.158, 2003. ,
DOI : 10.1109/GRID.2003.1261711
XtreemOS: a Vision for a Grid Operating System, tech. rep., XtreemOS, 2008. ,
XtreemOS: A Grid Operating System Making your Computer Ready for Participating in Virtual Organizations, 10th IEEE International Symposium on Object and Component-Oriented Real-Time Distributed Computing (ISORC'07), pp.393-402, 2007. ,
DOI : 10.1109/ISORC.2007.62
URL : https://hal.archives-ouvertes.fr/hal-01271216
Globus toolkit version 4: Software for service-oriented systems, IFIP International Conference on Network and Parallel Computing, pp.2-13, 2005. ,
Vigne: Executing Easily and Efficiently a Wide Range of Distributed Applications in Grids, Proceedings of Euro-Par, pp.394-403, 2007. ,
DOI : 10.1007/978-3-540-74466-5_43
URL : https://hal.archives-ouvertes.fr/hal-00689008
Pastry: Scalable, Decentralized Object Location, and Routing for Large-Scale Peer-to-Peer Systems, Proceedings of International Middleware Conference, pp.329-350, 2001. ,
DOI : 10.1007/3-540-45518-3_18
Handling churn in a DHT, Proceedings of the USENIX Annual Technical Conference, pp.127-140, 2004. ,
Job Submission Description Language (JSDL) Specification , Version 1.0 ,
A Step Towards a New Generation of Group Communication Systems, Proceedings of International Middleware Conference, pp.414-432, 2003. ,
DOI : 10.1007/3-540-44892-6_21
Fault tolerance of the application manager in Vigne, tech. rep, 2008. ,
Understanding replication in databases and distributed systems, Proceedings 20th IEEE International Conference on Distributed Computing Systems, pp.464-474, 2000. ,
DOI : 10.1109/ICDCS.2000.840959
Implementing fault-tolerant services using the state machine approach: a tutorial, ACM Computing Surveys, vol.22, issue.4, pp.299-319, 1990. ,
DOI : 10.1145/98163.98167
Dynamic group communication Distributed Computing, pp.359-374, 2006. ,
The Primary-Backup Approach, pp.199-216, 1993. ,
Software-based replication for fault tolerance, Computer, vol.30, issue.4, pp.68-74, 1997. ,
DOI : 10.1109/2.585156
Delta Four: A Generic Architecture for Dependable Distributed Computing, 1991. ,
Semi-passive replication, Proceedings Seventeenth IEEE Symposium on Reliable Distributed Systems (Cat. No.98CB36281), pp.43-50, 1998. ,
DOI : 10.1109/RELDIS.1998.740473
Active replication in Delta-4, [1992] Digest of Papers. FTCS-22: The Twenty-Second International Symposium on Fault-Tolerant Computing, pp.28-37, 1992. ,
DOI : 10.1109/FTCS.1992.243618
A survey and comparison of peer-to-peer overlay network schemes, IEEE Communications Surveys and Tutorials, vol.7, pp.72-93, 2005. ,
Service replication in Grids: Ensuring consistency in a dynamic, failure-prone environment, 2008 IEEE International Symposium on Parallel and Distributed Processing, pp.1-7, 2008. ,
DOI : 10.1109/IPDPS.2008.4536211
Highly available and scalable grid services, Proceedings of the Third Workshop on Dependable Distributed Data Management, WDDM '09, pp.18-20, 2009. ,
DOI : 10.1145/1518691.1518697
Fault-tolerant Grid Services Using Primary-Backup: Feasibility and Performance, Proceedings of the 2004 IEEE International Conference on Cluster Computing, pp.105-114, 2004. ,
RPC-V: Toward Fault-Tolerant RPC for Internet Connected Desktop Grids with Volatile Nodes, Proceedings of the ACM/IEEE SC2004 Conference, pp.39-39, 2004. ,
DOI : 10.1109/SC.2004.51
URL : https://hal.archives-ouvertes.fr/in2p3-00457039
Vishwa: A reconfigurable P2P middleware for Grid Computations, 2006 International Conference on Parallel Processing (ICPP'06), pp.381-390, 2006. ,
DOI : 10.1109/ICPP.2006.75
Simple Locality-Aware Coallocation in Peer-to-Peer Supercomputing, Proceedings of the Sixth IEEE International Symposium on Cluster Computing and the Grid (CCGRID '06), pp.14-24, 2006. ,
Total order broadcast and multicast algorithms, ACM Computing Surveys, vol.36, issue.4, pp.372-421, 2004. ,
DOI : 10.1145/1041680.1041682
Unreliable failure detectors for reliable distributed systems, Journal of the ACM, vol.43, issue.2, pp.225-267, 1996. ,
DOI : 10.1145/226643.226647
PaxonDHT: achieving consensus in distributed hash tables, International Symposium on Applications and the Internet (SAINT'06), pp.236-244, 2006. ,
DOI : 10.1109/SAINT.2006.48
The part-time parliament, ACM Transactions on Computer Systems, vol.16, issue.2, pp.133-169, 1998. ,
DOI : 10.1145/279227.279229
Toward Fault- Tolerant P2P Systems: Constructing a Stable Virtual Peer from Multiple Unstable Peers, Proceedings of The First International Conference on Advances in P2P Systems (AP2PS '09), pp.104-110, 2009. ,
Etna: A Fault-tolerant Algorithm for Atomic Mutable DHT Data, 2005. ,
DhtFlex: A Flexible Approach to Enable Efficient Atomic Data Management Tailored for Structured Peer-to-Peer Overlays, 2008 Third International Conference on Internet and Web Applications and Services, pp.377-384, 2008. ,
DOI : 10.1109/ICIW.2008.36
Atomic Commitment in Transactional DHTs, Proceedings of the CoreGRID Symposium, p.151, 2007. ,
DOI : 10.1007/978-0-387-72498-0_14
Deconstructing paxos, ACM SIGACT News, vol.34, issue.1, pp.47-67, 2003. ,
DOI : 10.1145/637437.637447
Comparing the Performance of Two Consensus Algorithms with Centralized and Decentralized Communication Schemes, Tech. Rep. LSR-REPORT, 2004. ,
The weakest failure detector for solving consensus, Journal of the ACM, vol.43, issue.4, pp.685-722, 1996. ,
DOI : 10.1145/234533.234549
Fast Paxos, Distributed Computing, pp.79-103, 2006. ,
DOI : 10.1007/s00446-006-0005-x
Mencius: Building efficient replicated state machines for WANs, Proceedings of the 8th USENIX Symposium on Operating systems Design and Implementation, 2008. ,