M. Albrecht, P. Donnelly, P. Bui, and D. Thain, Makeflow, Proceedings of the 1st ACM SIGMOD Workshop on Scalable Workflow Execution Engines and Technologies, SWEET '12, 2012.
DOI : 10.1145/2443416.2443417

I. Assayad, A. Girault, and H. Kalla, A bi-criteria scheduling heuristic for distributed embedded systems under reliability and real-time constraints, International Conference on Dependable Systems and Networks, 2004, 2004.
DOI : 10.1109/DSN.2004.1311904

C. Augonnet, S. Thibault, R. Namyst, and P. Wacrenier, StarPU: a unified platform for task scheduling on heterogeneous multicore architectures, Concurrency and Computation: Practice and Experience, vol.23, issue.4, pp.187-198, 2011.
DOI : 10.1002/cpe.1631

URL : https://hal.archives-ouvertes.fr/inria-00384363

G. Aupy, A. Benoit, H. Casanova, and Y. Robert, Scheduling Computational Workflows on Failure-Prone Platforms, 2015 IEEE International Parallel and Distributed Processing Symposium Workshop, pp.2-26, 2016.
DOI : 10.1109/IPDPSW.2015.33

URL : https://hal.archives-ouvertes.fr/hal-01075100

A. Benoit, A. Cavelan, Y. Robert, and H. Sun, Assessing general-purpose algorithms to cope with fail-stop and silent errors, ACM Trans. Parallel Computing, vol.3, issue.2, 2016.
DOI : 10.1145/2897189

URL : https://hal.archives-ouvertes.fr/hal-01066664

S. Bharathi, A. Chervenak, E. Deelman, G. Mehta, M. Su et al., Characterization of scientific workflows, 2008 Third Workshop on Workflows in Support of Large-Scale Science, pp.1-10, 2008.
DOI : 10.1109/WORKS.2008.4723958

D. Tracy, H. J. Braun, N. Siegel, . Beck, L. Ladislau et al., A comparison of eleven static heuristics for mapping a class of independent tasks onto heterogeneous distributed computing systems, Journal of Parallel and Distributed computing, vol.61, issue.6, pp.810-837, 2001.

F. Cappello, A. Geist, W. Gropp, S. Kale, B. Kramer et al., Toward Exascale Resilience, The International Journal of High Performance Computing Applications, vol.23, issue.4, p.1, 2014.
DOI : 10.1515/9781400882618-003

URL : http://institute.lanl.gov/resilience/docs/Toward%20Exascale%20Resilience.pdf

J. Choi, J. Jack, S. Dongarra, . Ostrouchov, P. Antoine et al., Design and Implementation of the ScaLAPACK LU, QR, and Cholesky Factorization Routines, Scientific Programming, vol.5, issue.3, pp.173-184, 1996.
DOI : 10.1155/1996/483083

URL : https://doi.org/10.1155/1996/483083

E. Deelman, K. Vahi, G. Juve, M. Rynge, S. Callaghan et al., Pegasus, a workflow management system for science automation, Future Generation Computer Systems, vol.46, pp.17-35, 2015.
DOI : 10.1016/j.future.2014.10.008

URL : https://manuscript.elsevier.com/S0167739X14002015/pdf/S0167739X14002015.pdf

B. Allen and . Downey, The structural cause of file size distributions In Modeling , Analysis and Simulation of Computer and Telecommunication Systems, Proceedings. Ninth International Symposium on. IEEE, pp.361-370, 2001.

M. Drozdowski, Scheduling for Parallel Processing, 2009.
DOI : 10.1007/978-1-84882-310-5

L. Han, L. Canon, H. Casanova, Y. Robert, and F. Vivien, Checkpointing workflows for fail-stop errors, IEEE Trans. Comput, 2018.
DOI : 10.1109/tc.2018.2801300

URL : https://hal.archives-ouvertes.fr/hal-01559967

L. Han, V. L. Fèvre, L. Canon, Y. Robert, and F. Vivien, A generic approach to scheduling and checkpointing workflows, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01766352

. Pegasus, Pegasus Workflow Generator. https://confluence.pegasus. isi, 2014.

A. Pothen and C. Sun, A Mapping Algorithm for Parallel Sparse Cholesky Factorization, SIAM Journal on Scientific Computing, vol.14, issue.5, pp.1253-1257, 1993.
DOI : 10.1137/0914074

T. Tobita and H. Kasahara, A standard task graph set for fair evaluation of multiprocessor scheduling algorithms, Journal of Scheduling, vol.70, issue.5, pp.379-394, 2002.
DOI : 10.1109/TC.1973.5009153

H. Topcuoglu, S. Hariri, and M. Wu, Performance-effective and low-complexity task scheduling for heterogeneous computing, IEEE Transactions on Parallel and Distributed Systems, vol.13, issue.3, pp.260-274, 2002.
DOI : 10.1109/71.993206

URL : http://meseec.ce.rit.edu/eecc722-fall2002/papers/hc/5/l0260.pdf

S. Toueg and Ö. Babao?lu, On the Optimum Checkpoint Selection Problem, SIAM Journal on Computing, vol.13, issue.3, p.3, 1984.
DOI : 10.1137/0213039

URL : http://ecommons.cornell.edu/bitstream/1813/6386/1/83-546.pdf

J. Valdes, R. E. Tarjan, and E. L. Lawler, The Recognition of Series Parallel Digraphs, Proc. of STOC'79, pp.1-12, 1979.
DOI : 10.1145/800135.804393

P. Wang, K. Zhang, R. Chen, H. Chen, and H. Guan, Replication-Based Fault-Tolerance for Large-Scale Graph Processing, 2014 44th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, pp.562-573, 2014.
DOI : 10.1109/DSN.2014.58

K. Wolstencroft, R. Haines, D. Fellows, A. Williams, D. Withers et al., The Taverna workflow suite: designing and executing workflows of Web Services on the desktop, web or in the cloud, Nucleic Acids Research, vol.2011, issue.W1, p.328, 2013.
DOI : 10.1186/1752-0509-6-25

M. Y. Wu and D. D. Gajski, Hypertool: a programming aid for message-passing systems, IEEE Transactions on Parallel and Distributed Systems, vol.1, issue.3, pp.3-330, 1990.
DOI : 10.1109/71.80160

URL : http://www.eece.unm.edu/~shu/lab/paper/htooltrans.pdf

F. Zhang, C. Docan, M. Parashar, S. Klasky, N. Podhorszki et al., Enabling In-situ Execution of Coupled Scientific Workflow on Multi-core Platform, 2012 IEEE 26th International Parallel and Distributed Processing Symposium, pp.1352-1363, 2012.
DOI : 10.1109/IPDPS.2012.122