Efficient exploitation of concurrency using graph decomposition, 1990. ,
Starpu: A unified platform for task scheduling on heterogeneous multicore architectures. Concurrency and Computation: Practice and Experience, pp.187-198, 2011. ,
URL : https://hal.archives-ouvertes.fr/inria-00384363
Cilk: An efficient multithreaded runtime system, 1995. ,
Productive Cluster Programming with OmpSs, Euro-Par 2011 Parallel Processing, pp.555-566, 2011. ,
DOI : 10.1147/rd.515.0593
OpenMP: an industry standard API for shared-memory programming, IEEE Computational Science and Engineering, vol.5, issue.1, pp.46-55, 1998. ,
DOI : 10.1109/99.660313
Exploiting Concurrent GPU Operations for Efficient Work Stealing on Multi-GPUs, 24rd International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD), 2012. ,
URL : https://hal.archives-ouvertes.fr/hal-00735470
The implementation of the Cilk-5 multithreaded language, ACM SIGPLAN Notices, vol.33, issue.5, pp.212-223, 1998. ,
DOI : 10.1145/277652.277725
Bounds for multiprocessor scheduling with resource constraints, SIAM Journal on Computing, vol.4, issue.2, pp.187-200, 1975. ,
KAAPI, Proceedings of the 2007 international workshop on Parallel symbolic computation, PASCO '07, pp.15-23, 2007. ,
DOI : 10.1145/1278177.1278182
URL : https://hal.archives-ouvertes.fr/hal-00647474
XKaapi: A Runtime System for Data-Flow Task Programming on Heterogeneous Architectures, 2013 IEEE 27th International Symposium on Parallel and Distributed Processing, 2013. ,
DOI : 10.1109/IPDPS.2013.66
URL : https://hal.archives-ouvertes.fr/hal-00799904
Bounds for Certain Multiprocessing Anomalies, Bell System Technical Journal, vol.45, issue.9, pp.1563-1581, 1966. ,
DOI : 10.1002/j.1538-7305.1966.tb01709.x
Bounds on Multiprocessing Timing Anomalies, SIAM Journal on Applied Mathematics, vol.17, issue.2, pp.416-429, 1969. ,
DOI : 10.1137/0117039
Scheduling Precedence Graphs in Systems with Interprocessor Communication Times, SIAM Journal on Computing, vol.18, issue.2, pp.244-257, 1989. ,
DOI : 10.1137/0218016
A Comparison of Multiprocessor Scheduling Heuristics, 1994 International Conference on Parallel Processing (ICPP'94), pp.243-250, 1994. ,
DOI : 10.1109/ICPP.1994.19
A new clustering algorithm for large communication delays, Proceedings 16th International Parallel and Distributed Processing Symposium, pp.68-73, 2002. ,
DOI : 10.1109/IPDPS.2002.1015571
Mpi: A message-passing interface standard, 2009. ,
Partitioning and scheduling using graph decomposition. Department of Computer Science and Engineering CSE-93-06, 1993. ,
A graph parsing algorithm and implementation, Dept. of Comp. Sci and Eng, 1993. ,
Opencl: A parallel programming standard for heterogeneous computing systems, Computing in science & engineering, vol.12, issue.3, p.66, 2010. ,
Performance-effective and low-complexity task scheduling for heterogeneous computing. Parallel and Distributed Systems, IEEE Transactions on, vol.13, issue.3, pp.260-274, 2002. ,
NP-complete scheduling problems, Journal of Computer and System Sciences, vol.10, issue.3, pp.384-393, 1975. ,
DOI : 10.1016/S0022-0000(75)80008-0
URL : http://doi.org/10.1016/s0022-0000(75)80008-0
Dsc: Scheduling parallel tasks on an unbounded number of processors. Parallel and Distributed Systems, IEEE Transactions on, vol.5, issue.9, pp.951-967, 1994. ,