T. Nguyên and J. Désidéri, Resilience Issues for Application Workflows on Clouds, Proc. 8th Intl. Conf on Networking and Services (ICNS2012), pp.375-382

E. Deelman and Y. Gil, Managing Large-Scale Scientific Workflows in Distributed Environments: Experiences and Challenges, 2006 Second IEEE International Conference on e-Science and Grid Computing (e-Science'06), pp.165-172
DOI : 10.1109/E-SCIENCE.2006.261077

H. Simon, Future directions in High-Performance Computing, Lecture given at the ParCFD 2009 Conference, 2009.

P. Dongarra and . Beckman, The International Exascale Software Roadmap Available at, International Journal of High Performance Computer Applications, vol.25, issue.1, pp.77-83, 2011.
DOI : 10.1177/1094342010391989

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.188.5128

R. Gupta and P. Beckman, CIFTS: A Coordinated Infrastructure for Fault-Tolerant Systems, 2009 International Conference on Parallel Processing, pp.145-156, 2009.
DOI : 10.1109/ICPP.2009.20

A. Bachmann, M. Kunde, D. Seider, and A. Schreiber, Advances in Generalization and Decoupling of Software Parts in a Scientific Simulation Workflow System, Proc. 4th Intl. Conf. Advanced Engineering Computing and Applications in Sciences, pp.247-258, 2010.

. Florence, I, 2010.
URL : https://hal.archives-ouvertes.fr/hal-01195470

E. C. Joseph, A Strategic Agenda for European Leadership in Supercomputing: HPC 2020 " , IDC Final Report of the HPC Study for the DG Information Society of the EC, 2010.

T. Nguyên and J. A. Désidéri, A Distributed Workflow Platform for High-Performance Simulation, Intl. Journal on Advances in Intelligent Systems, vol.4, issue.3&4

E. Sindrilaru, A. Costan, and V. Cristea, Fault-Tolerance and Recovery in Grid Workflow Mangement Systems, Proc. 4th Intl. Conf. on Complex, Intelligent and Software Intensive Systems, pp.162-173

. Apache, The Apache Foundation

P. Beckman, Facts and Speculations on Exascale: Revolution or Evolution? " , Keynote Lecture, Proc. 17th European Conf. Parallel and Distributed Computing, pp.135-142, 2011.

P. Kovatch, M. Ezell, and R. Braby, The Malthusian Catastrophe Is Upon Us! Are the Largest HPC Machines Ever Up?, Proc. Resilience Workshop at 17th European Conf. Parallel and Distributed Computing, pp.255-262, 2011.
DOI : 10.1007/978-3-642-29740-3_25

R. Riesen, K. Ferreira, M. R. Varela, M. Taufer, and A. Rodrigues, Simulating Application Resilience at Exascale, Proc. Resilience Workshop at 17th European Conf. Parallel and Distributed Computing, pp.417-425, 2011.
DOI : 10.1007/978-3-642-29740-3_26

P. Bridges, Cooperative Application/OS DRAM Fault Recovery, Proc. Resilience Workshop at 17th European Conf. Parallel and Distributed Computing, pp.213-222, 2011.
DOI : 10.1007/978-3-642-29740-3_28

F. Capello, Toward Exascale Resilience, International Journal of High Performance Computing Applications, vol.23, issue.4, 2009.
DOI : 10.1177/1094342009347767

A. Moody, G. Bronevetsky, K. Mohror, B. De-supinski, and . Design, Modeling and evaluation of a Scalable Multilevel checkpointing System, Proc. ACM/IEEE Intl. Conf. for High Performance Computing, Networking, Storage and Analysis (SC10), pp.73-86, 2010.

M. Adams, A. Ter-hofstede, L. Rosa, and M. , Open Source Software for Workflow Management: The Case of YAWL, IEEE Software, vol.28, issue.3, pp.16-19, 2011.
DOI : 10.1109/MS.2011.58

N. Russell and A. Ter-hofstede, Surmounting BPM challenges: the YAWL story Special Issue Paper on Research and Development on Flexible Process Aware Information Systems, Computer Science, vol.23, issue.2, pp.67-79, 2009.

A. Lachlan, W. Van-der-aalst, M. Dumas, and A. Ter-hofstede, Dimensions of coupling in middleware, Concurrency and Computation: Practice and Experience, pp.233-2269, 2009.

L. Bautista-gomez, FTI, Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis on, SC '11
DOI : 10.1145/2063384.2063427

URL : https://hal.archives-ouvertes.fr/hal-00721216

B. Nicolae and F. Capello, BlobCR, Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis on, SC '11, pp.145-156, 2011.
DOI : 10.1145/2063384.2063429

URL : https://hal.archives-ouvertes.fr/inria-00601865

B. Raghavan, P. Breitkopf, K. Jeffrey, and B. Neidecker-lutz, Asynchronous evolutionary shape optimization based on high-quality surrogates: application to an air-conditioning duct, The Future of Cloud Computing " . Expert Group Report. European Commission. Information Society & Media Directorate- General. Software & Service Architectures, Infrastructures and Engineering Unit, 2010.
DOI : 10.1007/s00366-012-0263-0

URL : https://hal.archives-ouvertes.fr/hal-00982653

P. Latchoumy and P. Khader, Survey On Fault Tolerance In Grid Computing, International Journal of Computer Science & Engineering Survey, vol.2, issue.4, 2011.
DOI : 10.5121/ijcses.2011.2407

R. Garg and A. K. Singh, Fault Tolerance In Grid Computing: State of the Art and Open Issues, International Journal of Computer Science & Engineering Survey, vol.2, issue.1, 2011.
DOI : 10.5121/ijcses.2011.2107

E. Heien, Modeling and tolerating heterogeneous failures in large parallel systems, Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis on, SC '11, pp.1-45, 2011.
DOI : 10.1145/2063384.2063444

M. Bougeret, Checkpointing strategies for parallel jobs, Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis on, SC '11, pp.1-11, 2011.
DOI : 10.1145/2063384.2063428

URL : https://hal.archives-ouvertes.fr/hal-00738504

S. Llnl-sc, The opportunities and Challenges of Exascale Computing " . Summary Report of the Advanced Scientific Computing Advisory Subcommittee, 2010.

X. Liu, The Design of Cloud Workflow Systems, 2012.
DOI : 10.1007/978-1-4614-1933-4