Job Submission Description Language (JSDL) Specification, V. 1.0, 2005. ,
Use-Cases and Requirements for Grid Checkpoint and Recovery, 2004. ,
URL : https://hal.archives-ouvertes.fr/hal-01272396
Virtual servers and checkpoint/restart in mainstream Linux, ACM SIGOPS Operating Systems Review, vol.42, issue.5, pp.104-113, 2008. ,
DOI : 10.1145/1400097.1400109
Design of the Architecure for Application Execution Management in XtreemOS, 2007. ,
Blocking vs. Non-Blocking Coordinated Checkpointing for Large-Scale Fault Tolerant MPI, ACM/IEEE SC 2006 Conference (SC'06), 2006. ,
DOI : 10.1109/SC.2006.15
URL : https://hal.archives-ouvertes.fr/hal-00684891
The Globus project: a status report, Future Generation Computer Systems, vol.15, issue.5-6, pp.607-621, 1999. ,
DOI : 10.1016/S0167-739X(99)00013-8
The Anatomy of the Grid: Enabling Scalable Virtual Organizations, International Journal of High Performance Computing Applications, vol.15, issue.3, pp.200-222, 2001. ,
DOI : 10.1177/109434200101500302
Fault Tolerant Checkpointing Solution for Clusters and Grid Systems, 2007. ,
SAGA: A Simple API for Grid Applications. High-level application programming on the Grid, Computational Methods in Science and Technology, vol.12, issue.1, 2006. ,
DOI : 10.12921/cmst.2006.12.01.07-20
The Legion vision of a worldwide virtual computer, Communications of the ACM, vol.40, issue.1, pp.39-45, 1997. ,
DOI : 10.1145/242857.242867
The XtreemFS architecture-a case for object-based file systems in Grids, Concurrency and Computation: Practice and Experience, p.20, 2008. ,
DOI : 10.1002/cpe.1304
The Design and Implementation of Checkpoint/Restart Process Fault Tolerance for Open MPI, 2007 IEEE International Parallel and Distributed Processing Symposium, 2007. ,
DOI : 10.1109/IPDPS.2007.370605
The Design and Implementation of Berkeley Lab's Linux Checkpoint/Restart, 2003. ,
Grid checkpointing architecture -integration of low-level checkpointing capabilites with grid, 2007. ,
The evolution of condor checkpointing. Mobility: processes, computers, and agents, pp.163-164, 1999. ,
Adaptive Checkpoint Replication for Supporting the Fault Tolerance of Applications in the Grid, 2008 Seventh IEEE International Symposium on Network Computing and Applications, 2008. ,
DOI : 10.1109/NCA.2008.38
Checkpoint process groups in a grid environment, International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT), 2008. ,
DOI : 10.1109/pdcat.2008.14
An adaptive checkpointing scheme for peer-to-peer based volunteer computing work flows. ArXiv e-prints, 2007. ,
Libckpt: Transparent Checkpointing under Unix, Proceedings of USENIX Winter 1995 Technical Conference, pp.213-224, 1995. ,
Fault-Tolerant Replication Based on Fragmented Objects, DAIS 2006: 6th IFIP WG 6.1 International Conference on Distributed Applications and Interoperable Systems, 2006. ,
DOI : 10.1007/11773887_20
The UNICORE Grid Infrastructure, Scientific Programming, pp.149-157, 2002. ,
DOI : 10.1155/2002/483253
The Lam/Mpi Checkpoint/Restart Framework: System-Initiated Checkpointing, International Journal of High Performance Computing Applications, vol.19, issue.4, pp.479-493, 2005. ,
DOI : 10.1177/1094342005056139
An Architecture for Grid Checkpoint and Recovery Services, 2007. ,