Makeflow, Proceedings of the 1st ACM SIGMOD Workshop on Scalable Workflow Execution Engines and Technologies, SWEET '12, 2012. ,
DOI : 10.1145/2443416.2443417
A bi-criteria scheduling heuristic for distributed embedded systems under reliability and real-time constraints, International Conference on Dependable Systems and Networks, 2004, 2004. ,
DOI : 10.1109/DSN.2004.1311904
StarPU: a unified platform for task scheduling on heterogeneous multicore architectures, Concurrency and Computation: Practice and Experience, vol.23, issue.4, pp.187-198, 2011. ,
DOI : 10.1002/cpe.1631
URL : https://hal.archives-ouvertes.fr/inria-00384363
Scheduling Computational Workflows on Failure-Prone Platforms, 2015 IEEE International Parallel and Distributed Processing Symposium Workshop, pp.2-26, 2016. ,
DOI : 10.1109/IPDPSW.2015.33
URL : https://hal.archives-ouvertes.fr/hal-01075100
Assessing general-purpose algorithms to cope with fail-stop and silent errors, ACM Trans. Parallel Computing, vol.3, issue.2, 2016. ,
DOI : 10.1145/2897189
URL : https://hal.archives-ouvertes.fr/hal-01066664
Characterization of scientific workflows, 2008 Third Workshop on Workflows in Support of Large-Scale Science, pp.1-10, 2008. ,
DOI : 10.1109/WORKS.2008.4723958
A comparison of eleven static heuristics for mapping a class of independent tasks onto heterogeneous distributed computing systems, Journal of Parallel and Distributed computing, vol.61, issue.6, pp.810-837, 2001. ,
Toward Exascale Resilience, The International Journal of High Performance Computing Applications, vol.23, issue.4, p.1, 2014. ,
DOI : 10.1515/9781400882618-003
URL : http://institute.lanl.gov/resilience/docs/Toward%20Exascale%20Resilience.pdf
Design and Implementation of the ScaLAPACK LU, QR, and Cholesky Factorization Routines, Scientific Programming, vol.5, issue.3, pp.173-184, 1996. ,
DOI : 10.1155/1996/483083
URL : https://doi.org/10.1155/1996/483083
Pegasus, a workflow management system for science automation, Future Generation Computer Systems, vol.46, pp.17-35, 2015. ,
DOI : 10.1016/j.future.2014.10.008
URL : https://manuscript.elsevier.com/S0167739X14002015/pdf/S0167739X14002015.pdf
The structural cause of file size distributions In Modeling , Analysis and Simulation of Computer and Telecommunication Systems, Proceedings. Ninth International Symposium on. IEEE, pp.361-370, 2001. ,
Scheduling for Parallel Processing, 2009. ,
DOI : 10.1007/978-1-84882-310-5
Checkpointing workflows for fail-stop errors, IEEE Trans. Comput, 2018. ,
DOI : 10.1109/tc.2018.2801300
URL : https://hal.archives-ouvertes.fr/hal-01559967
A generic approach to scheduling and checkpointing workflows, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01766352
Pegasus Workflow Generator. https://confluence.pegasus. isi, 2014. ,
A Mapping Algorithm for Parallel Sparse Cholesky Factorization, SIAM Journal on Scientific Computing, vol.14, issue.5, pp.1253-1257, 1993. ,
DOI : 10.1137/0914074
A standard task graph set for fair evaluation of multiprocessor scheduling algorithms, Journal of Scheduling, vol.70, issue.5, pp.379-394, 2002. ,
DOI : 10.1109/TC.1973.5009153
Performance-effective and low-complexity task scheduling for heterogeneous computing, IEEE Transactions on Parallel and Distributed Systems, vol.13, issue.3, pp.260-274, 2002. ,
DOI : 10.1109/71.993206
URL : http://meseec.ce.rit.edu/eecc722-fall2002/papers/hc/5/l0260.pdf
On the Optimum Checkpoint Selection Problem, SIAM Journal on Computing, vol.13, issue.3, p.3, 1984. ,
DOI : 10.1137/0213039
URL : http://ecommons.cornell.edu/bitstream/1813/6386/1/83-546.pdf
The Recognition of Series Parallel Digraphs, Proc. of STOC'79, pp.1-12, 1979. ,
DOI : 10.1145/800135.804393
Replication-Based Fault-Tolerance for Large-Scale Graph Processing, 2014 44th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, pp.562-573, 2014. ,
DOI : 10.1109/DSN.2014.58
The Taverna workflow suite: designing and executing workflows of Web Services on the desktop, web or in the cloud, Nucleic Acids Research, vol.2011, issue.W1, p.328, 2013. ,
DOI : 10.1186/1752-0509-6-25
Hypertool: a programming aid for message-passing systems, IEEE Transactions on Parallel and Distributed Systems, vol.1, issue.3, pp.3-330, 1990. ,
DOI : 10.1109/71.80160
URL : http://www.eece.unm.edu/~shu/lab/paper/htooltrans.pdf
Enabling In-situ Execution of Coupled Scientific Workflow on Multi-core Platform, 2012 IEEE 26th International Parallel and Distributed Processing Symposium, pp.1352-1363, 2012. ,
DOI : 10.1109/IPDPS.2012.122