T. Evolution and T. Cp, throughout the allocation procedure of CPA for a random PTG of 6 tasks on clusters of 10 (a) and 30 (b) processors

T. Evolution, T. A. Cp, and T. , A throughout the allocation procedure for a random PTG of 6 tasks on a cluster of 30 processors

T. 12-evolution-of, T. A. Cp, and T. , A throughout the allocation procedure of CPA for a random PTG of 50 tasks on a cluster of 20 processors, p.27

C. P. Evolution and W. , varies for a random PTG of 20 tasks on a cluster of 20 processors, p.29

[. Book-chapters, J. Arabnejad, F. Barbosa, and . Suter, High-Performance Computing on Complex Environments, chapter Fair Resource Sharing for Dynamic Scheduling of Workflows on Heterogeneous Systems. Parallel and Distributed Computing Series, 2014.

J. Costan, F. Bigot, G. Desprez, S. Fedak, C. Gault et al., Alexandru Scalable Data Management for Map?Reduce?Based Data?Intensive Applications: a View for Cloud and Hybrid Infrastructures, International Journals [ACB + 13] Gabriel Antoniu, pp.150-170, 2013.

H. Casanova, F. Desprez, and F. Suter, On cluster resource allocation for multiple parallel task graphs, Journal of Parallel and Distributed Computing, vol.70, issue.12, pp.1193-1203, 2010.
DOI : 10.1016/j.jpdc.2010.08.017

URL : https://hal.archives-ouvertes.fr/hal-00539777

P. Dutot, N. Tchimou, F. Takpé, H. Suter, and . Casanova, Scheduling Parallel Task Graphs on (Almost) Homogeneous Multicluster Platforms, IEEE Transactions on Parallel and Distributed Systems, vol.20, issue.7, pp.940-952, 2009.
DOI : 10.1109/TPDS.2009.11

URL : https://hal.archives-ouvertes.fr/inria-00347273

G. Antoniu, J. Bigot, C. Blanchet, L. Bougé, F. Briant et al., Towards Scalable Data Management for Map-Reduce-based Data-Intensive Applications on Cloud and Hybrid Infrastructures, International Conferences [ABB + 12 Proceedings of the First International IBM Cloud Academy Conference, pp.272-290, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00767029

A. Paul-bédaride, S. Degomme, A. Genaud, G. S. Legrand, M. Markomanolis et al., Toward Better Simulation of MPI Applications on Ethernet/TCP Networks, Proceedings of the 4th International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS), 2013.

A. Laurent-bobelin, D. A. Legrand, G. Márquez, P. Navarro, M. Quinson et al., Scalable Multi-Purpose Network Representation for Large Scale Distributed System Simulation, Proceedings of the 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid), pp.220-227, 2012.

E. Caron, F. Desprez, A. Muresan, and F. Suter, Budget Constrained Resource Allocation for Non-deterministic Workflows on an IaaS Cloud, Proceedings of the 12th International Conference on Algorithms and Architectures for Parallel Processing (ICA3PP), pp.186-201, 2012.
DOI : 10.1007/978-3-642-33078-0_14

H. Casanova, F. Desprez, and F. Suter, From Heterogeneous Task Scheduling to Heterogeneous Mixed Parallel Scheduling, Proceedings of the 10th International Euro-Par Conference, pp.230-237, 2004.
URL : https://hal.archives-ouvertes.fr/inria-00071583

E. Caron, F. Desprez, and F. Suter, Out-of-Core and Pipeline Techniques for Wavefront Algorithms, 19th IEEE International Parallel and Distributed Processing Symposium, 2005.
DOI : 10.1109/IPDPS.2005.318

URL : https://hal.archives-ouvertes.fr/hal-00008798

H. Casanova, F. Desprez, and F. Suter, Minimizing Stretch and Makespan of Multiple Parallel Task Graphs via Malleable Allocations, 2010 39th International Conference on Parallel Processing, pp.71-80, 2010.
DOI : 10.1109/ICPP.2010.16

URL : https://hal.archives-ouvertes.fr/hal-00533926

H. Casanova, A. Giersch, A. Legrand, M. Quinson, and F. Suter, SimGrid: a Sustained Effort for the Versatile Simulation of Large Scale Distributed Systems, Proceedings of the 1st Workshop on Sustainable Software for Science: Practice and Experiences (WSSSPE), 2013.
URL : https://hal.archives-ouvertes.fr/hal-00926437

P. Clauss, J. Gustedt, and F. Suter, Out-of-Core Wavefront Computations with Reduced Synchronization, 16th Euromicro Conference on Parallel, Distributed and Network-Based Processing (PDP 2008), pp.293-300, 2008.
DOI : 10.1109/PDP.2008.30

URL : https://hal.archives-ouvertes.fr/inria-00176084

P. Clauss, M. Stillwell, S. Genaud, F. Suter, H. Casanova et al., Single Node On-Line Simulation of MPI Applications with SMPI, 2011 IEEE International Parallel & Distributed Processing Symposium, 2011.
DOI : 10.1109/IPDPS.2011.69

URL : https://hal.archives-ouvertes.fr/inria-00527150

F. Desprez, G. S. Markomanolis, M. Quinson, and F. Suter, Assessing the Performance of MPI Applications through Time-Independent Trace Replay, 2011 40th International Conference on Parallel Processing Workshops, pp.467-476, 2011.
DOI : 10.1109/ICPPW.2011.33

URL : https://hal.archives-ouvertes.fr/inria-00546992

F. Desprez, G. S. Markomanolis, and F. Suter, Improving the Accuracy and Efficiency of Time-Independent Trace Replay, 2012 SC Companion: High Performance Computing, Networking Storage and Analysis, 2012.
DOI : 10.1109/SC.Companion.2012.64

URL : https://hal.archives-ouvertes.fr/hal-00739082

[. Desprez and F. Suter, A Bi-criteria Algorithm for Scheduling Parallel Task Graphs on Clusters, 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing, pp.243-252, 2010.
DOI : 10.1109/CCGRID.2010.43

URL : https://hal.archives-ouvertes.fr/hal-00533904

S. Hunold, H. Casanova, and F. Suter, From Simulation to Experiment: A Case Study on Multiprocessor Task Scheduling, 2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and Phd Forum, pp.660-667, 2011.
DOI : 10.1109/IPDPS.2011.201

URL : https://hal.archives-ouvertes.fr/hal-00627842

S. Hunold, R. Hoffmann, and F. Suter, Jedule: A Tool for Visualizing Schedules of Parallel Applications, 2010 39th International Conference on Parallel Processing Workshops, pp.169-178, 2010.
DOI : 10.1109/ICPPW.2010.34

URL : https://hal.archives-ouvertes.fr/hal-00533962

S. Hrs08a, T. Hunold, F. Rauber, and . Suter, Redistribution Aware Two-Step Scheduling for Mixed-Parallel Applications, Proceedings of the IEEE International Conference on Cluster Computing (Cluster), pp.50-58, 2008.

S. Hunold, T. Rauber, and F. Suter, Scheduling Dynamic Workflows onto Clusters of Clusters using Postponing, 2008 Eighth IEEE International Symposium on Cluster Computing and the Grid (CCGRID), pp.669-674, 2008.
DOI : 10.1109/CCGRID.2008.44

URL : https://hal.archives-ouvertes.fr/inria-00329779

N. Tchimou, F. Takpé, and . Suter, Critical path and area based scheduling of parallel task graphs on heterogeneous platforms, Proceedings of the 12th International Conference on Parallel and Distributed Systems (ICPADS), pp.3-10, 2006.

N. Tchimou, F. Takpé, and . Suter, Self-Constrained Resource Allocation for Parallel Task Graph Scheduling on Shared Computing Grids, Proceedings of the 19th IASTED International Conference on Parallel and Distributed Computing and Systems (PDCS), pp.36-41, 2007.

N. Tchimou, F. Takpé, and . Suter, Concurrent Scheduling of Parallel Task Graphs on Multi-Clusters Using Constrained Resource Allocations, Proceedings of the 10th IEEE International Workshop on Parallel and Distributed Scientific and Engineering Computing (PDSEC), 2009.

N. Tchimou, F. Takpé, H. Suter, and . Casanova, A Comparison of Scheduling Approaches for Mixed-Parallel Applications on Heterogeneous Platforms, Proceedings of the 6th International Symposium on Parallel and Distributed Computing (ISPDC), 2007.

M. Quinson, L. Bobelin, and F. Suter, Synthesizing Generic Experimental Environments for Simulation, 2010 International Conference on P2P, Parallel, Grid, Cloud and Internet Computing, pp.222-229, 2010.
DOI : 10.1109/3PGCIC.2010.37

URL : https://hal.archives-ouvertes.fr/inria-00502839

F. Desprez, G. S. Markomanolis, and F. Suter, Technical and Research Reports Evaluation of Profiling Tools for the Acquisition of Time-Independent Traces, Proceedings of the 8th IEEE/ACM International Conference on Grid Computing (Grid) Institut National de Recherche en Informatique et en Automatique (INRIA), pp.2-9, 2007.

M. Frincu, M. Quinson, and F. Suter, Handling Very Large Platforms with the New SimGrid Platform Description Formalism, Institut National de Recherche en Informatique et en Automatique (INRIA), 2008.
URL : https://hal.archives-ouvertes.fr/inria-00256883

S. George, F. Markomanolis, and . Suter, Time-Independent Trace Acquisition Framework ? A Grid'5000 How-to, Institut National de Recherche en Informatique et en Automatique (INRIA), 2011.

[. Suter and H. Casanova, Extracting Synthetic Multi-Cluster Platform Configurations from Grid'5000 for Driving Simulation Experiments, Institut National de Recherche en Informatique et en Automatique (INRIA), 2007.
URL : https://hal.archives-ouvertes.fr/inria-00166181

E. Caron, F. Desprez, M. Quinson, and F. Suter, Performance Evaluation of Linear Algebra Routines, Special issue on Clusters and Computational Grids for Scientific Computing (CCGSC'02), pp.373-390, 2004.
DOI : 10.1177/1094342004046046

URL : https://hal.archives-ouvertes.fr/inria-00000234

[. Caron, F. Desprez, and F. Suter, Parallel Extension of a Dynamic Performance Forecasting Tool Scalable Computing: Practice and Experience, pp.57-69, 2005.

[. Desprez and F. Suter, Impact of Mixed-Parallelism on Parallel Implementations of Strassen and Winograd Matrix Multiplication Algorithms. Concurrency and Computation:Practice and Experience, pp.771-797, 2004.
URL : https://hal.archives-ouvertes.fr/inria-00072106

B. Boudet, F. Desprez, and F. Suter, One-step algorithm for mixed data and task parallel scheduling without data replication, Proceedings International Parallel and Distributed Processing Symposium, 2003.
DOI : 10.1109/IPDPS.2003.1213127

URL : https://hal.archives-ouvertes.fr/lirmm-00269808

F. Caron, F. Desprez, J. Lombard, M. Nicod, F. Quinson et al., A Scalable Approach to Network Enabled Servers, Proceedings of the 8th International EuroPar Conference, pp.907-910, 2002.
DOI : 10.1007/3-540-45706-2_128

URL : https://hal.archives-ouvertes.fr/inria-00072087

P. Combes, F. Lombard, M. Quinson, and F. Suter, A Scalable Approach to Network Enabled Servers, Seventh Asian Computing Science Conference, pp.110-124, 2002.
DOI : 10.1007/3-540-36184-7_12

URL : https://hal.archives-ouvertes.fr/inria-00072087

[. Caron and F. Suter, Parallel Extension of a Dynamic Performance Forecasting Tool, Proceedings of the International Symposium on Parallel and Distributed Computing (IS- PDC'02), pp.80-93, 2002.
URL : https://hal.archives-ouvertes.fr/inria-00072118

F. Desprez, M. Quinson, and F. Suter, Dynamic Performance Forecasting for Network-Enabled Servers in a Heterogeneous Environment, Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications (PDPTA), volume III, pp.1421-1427, 2001.
URL : https://hal.archives-ouvertes.fr/inria-00072267

[. Desprez and F. Suter, Mixed parallel implementations of the top level step of Strassen and Winograd matrix multiplication algorithms, Proceedings 15th International Parallel and Distributed Processing Symposium. IPDPS 2001, 2001.
DOI : 10.1109/IPDPS.2001.924938

R. Vikram-adve, E. Bagrodia, R. Deelman, and . Sakellariou, Compiler-Optimized Simulation of Large-Scale Applications on High Performance Architectures, Journal of Parallel and Distributed Computing, vol.62, issue.3, pp.393-426, 2002.
DOI : 10.1006/jpdc.2001.1800

R. Agrawal and V. Sadaphal, Batch Systems: Optimal Scheduling and Processor Optimization, Proceedings of 18th International Conference on High Performance Computing (HiPC), 2011.

G. Aldering, G. Adam, P. Antilogus, P. Astier, R. Bacon et al., Michael Wood-Vasey. Overview of the Nearby Supernova Factory, Survey and Other Telescope Technologies and Discoveries of Society of Photo-Optical Instrumentation Engineers (SPIE) Conference Series, pp.61-72, 2002.

J. Alimi, V. Bouillot, Y. Rasera, V. Reverdy, P. Corasaniti et al., First-ever full observable universe simulation, 2012 International Conference for High Performance Computing, Networking, Storage and Analysis, 2012.
DOI : 10.1109/SC.2012.58

I. Altintas, S. Bhagwanani, D. Buttler, S. Chandra, Z. Cheng et al., A modeling and execution environment for distributed scientific workflows, 15th International Conference on Scientific and Statistical Database Management, 2003., pp.247-250, 2003.
DOI : 10.1109/SSDM.2003.1214989

G. Amdahl, Validity of the single processor approach to achieving large scale computing capabilities, Proceedings of the April 18-20, 1967, spring joint computer conference on, AFIPS '67 (Spring), pp.483-485, 1967.
DOI : 10.1145/1465482.1465560

P. David and . Anderson, BOINC: A System for Public-Resource Computing and Storage, Proceedings of the 5th International Workshop on Grid Computing, pp.4-10, 2004.

H. Arabnejad, J. Barbosa-alexander, D. Pasqua, A. Ambra, G. Belloum et al., Performance Evaluation of List Based Scheduling on Heterogeneous Systems, Proceedings of the Euro-Par 2011: Parallel Processing Workshops -CCPI, pp.440-449, 2011.
DOI : 10.1007/978-3-642-29737-3_49

C. Augonnet, S. Thibault, R. Namyst, and P. Wacrenier, StarPU: a Unified platform for Task Scheduling on Heterogeneous Multicore Architectures. Concurrency and Computation: Practice and Experience, pp.187-198, 2011.
URL : https://hal.archives-ouvertes.fr/inria-00384363

R. Badia, J. Labarta, J. Giménez, and F. Escalé, Dimemas: Predicting MPI applications behavior in Grid environments, Proceedings of the Workshop on Grid Applications and Programming Tools, 2003.

R. Bagrodia, E. Deelman, and T. Phan, Parallel Simulation of Large-Scale Parallel Applications, International Journal of High Performance Computing Applications, vol.15, issue.1, pp.3-12, 2001.
DOI : 10.1177/109434200101500101

D. Bailey, E. Barszcz, J. Barton, D. Browning, R. Carter et al., The NAS parallel benchmarks---summary and preliminary results, Proceedings of the 1991 ACM/IEEE conference on Supercomputing , Supercomputing '91, pp.158-165, 1991.
DOI : 10.1145/125826.125925

S. Bansal, P. Kumar, and K. Singh, An improved two-step algorithm for task and data parallel scheduling in distributed memory machines, Parallel Computing, vol.32, issue.10, pp.759-774, 2006.
DOI : 10.1016/j.parco.2006.08.004

A. Barabási and R. Albert, Emergence of Scaling in Random Networks, Science, vol.286, pp.509-512, 1999.

N. Bard, R. Bolze, E. Caron, F. Desprez, M. Heymann et al., Décrypthon Grid -Grid Resources Dedicated to Neuromuscular Disorders, Proceedings of the 8th HealthGrid conference, 2010.

O. Beaumont, L. Eyraud-dubois, and Y. Won, Using the Last-Mile Model as a Distributed Scheme for Available Bandwidth Prediction, Proceedings of the 17th Int. European Conference on Parallel and Distributed Computing (EuroPar), pp.103-116, 2011.
DOI : 10.1145/1384529.1375493

URL : https://hal.archives-ouvertes.fr/inria-00588651

W. Bell, D. Cameron, L. Capozza, P. Millar, K. Stockinger et al., Optorsim: A Grid Simulator for Studying Dynamic Data Replication Strategies, International Journal of High Performance Computing Applications, vol.17, issue.4, pp.403-416, 2003.
DOI : 10.1177/10943420030174005

M. A. Bender, S. Chakrabarti, and S. Muthukrishnan, Flow and Stretch Metrics for Scheduling Continuous Job Streams, Proceedings of the Ninth Annual ACM/SIAM Symposium on Discrete Algorithms (SODA), pp.270-279, 1998.

G. B. Berriman, J. Good, A. Laity, A. Bergou, J. Jacob et al., Montage: a Grid Enabled Image Mosaic Service for the National Virtual Observatory, Astronomical Data Analysis Software and Systems (ADASS) XIII, p.593, 2004.

D. Biswas and B. Genest, Minimal Observability for Transactional Hierarchical Services, Proceedings of the Twentieth International Conference on Software Engineering & Knowledge Engineering (SEKE), pp.531-536, 2008.

G. Bosilca, A. Bouteiller, A. Danalis, M. Faverge, A. Haidar et al., Flexible Development of Dense Linear Algebra Algorithms on Massively Parallel Architectures with PLASMA, Proceedings of the 12th Workshop on Parallel and Distributed Scientific and Engineering Computing, pp.1432-1441, 2011.

G. Bosilca, A. Bouteiller, A. Danalis, T. Hérault, P. Lemarinier et al., DAGuE: A generic distributed DAG engine for High Performance Computing, Parallel Computing, vol.38, issue.1-2, pp.37-51, 2012.
DOI : 10.1016/j.parco.2011.10.003

. Freund, A Comparison of Eleven Static Heuristics for Mapping a Class of Independent Tasks onto Heterogeneous Distributed Computing Systems, Journal of Parallel Distributed Computing, vol.61, issue.6, pp.810-837, 2001.

S. Browne, J. Dongarra, N. Garner, G. Ho, and P. Mucci, A Portable Programming Interface for Performance Evaluation on Modern Processors, International Journal of High Performance Computing Applications, vol.14, issue.3, pp.189-204, 2000.
DOI : 10.1177/109434200001400303

K. Butler, P. Mcdaniel, and W. Aiello, Optimizing BGP security by exploiting path stability, Proceedings of the 13th ACM conference on Computer and communications security , CCS '06, pp.298-310, 2006.
DOI : 10.1145/1180405.1180442

R. Buyya and M. Murshed, GridSim: A Toolkit for the Modeling and Simulation of Distributed Resource Management and Scheduling for Grid Computing. Concurrency and Computation: Practice and Experience, pp.13-151175, 2002.

K. Calvert, M. Doar, and E. Zegura, Modeling Internet topology, IEEE Communications Magazine, vol.35, issue.6, pp.160-168, 1997.
DOI : 10.1109/35.587723

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.54.2169

S. Camarasu-pop, T. Glatard, and H. Benoit-cattin, Simulating Application Workflows and Services Deployed on the European Grid Infrastructure, 2013 13th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing, pp.18-25, 2013.
DOI : 10.1109/CCGrid.2013.13

URL : https://hal.archives-ouvertes.fr/hal-00843917

Y. Caniou and J. Gay, Simbatch: An API for Simulating and Predicting the Performance of Parallel Resources Managed by Batch Systems, Proceedings of the Workshop on Secure, Trusted, Manageable and Controllable Grid Services (SGS) in conjonction with EuroPar'08, pp.223-234, 2008.
DOI : 10.1163/1574040054861267

O. Louis-claude-canon, J. Dubuisson, E. Gustedt, and . Jeannot, Defining and controlling the heterogeneity of a cluster: The Wrekavoc tool, Journal of Systems and Software, vol.83, issue.5, pp.786-802, 2010.
DOI : 10.1016/j.jss.2009.11.734

N. Capit, G. D. Costa, Y. Georgiou, G. Huard, C. Martin et al., A batch scheduler with high level components, CCGrid 2005. IEEE International Symposium on Cluster Computing and the Grid, 2005., pp.776-783, 2005.
DOI : 10.1109/CCGRID.2005.1558641

URL : https://hal.archives-ouvertes.fr/hal-00005106

E. Caron and F. Desprez, Diet: A Scalable Toolbox to Build Network Enabled Servers on the Grid, International Journal of High Performance Computing Applications, vol.20, issue.3, pp.335-352, 2006.
DOI : 10.1177/1094342006067472

URL : https://hal.archives-ouvertes.fr/hal-01429867

E. Caron, V. Garonne, and A. Tsaregorodtsev, Definition, modelling and simulation of a grid computing scheduling system for high throughput computing, Future Generation Computer Systems, vol.23, issue.8, pp.968-976, 2007.
DOI : 10.1016/j.future.2007.04.008

URL : https://hal.archives-ouvertes.fr/in2p3-00421380

H. Casanova, Simgrid: a toolkit for the simulation of application scheduling, Proceedings First IEEE/ACM International Symposium on Cluster Computing and the Grid, pp.430-437, 2001.
DOI : 10.1109/CCGRID.2001.923223

H. Casanova, A. Legrand, and L. Marchal, Scheduling Distributed Applications: the SimGrid Simulation Framework, Proceedings of the third IEEE International Symposium on Cluster Computing and the Grid (CCGrid), pp.138-145, 2003.
URL : https://hal.archives-ouvertes.fr/hal-00789451

H. Casanova, A. Legrand, and M. Quinson, SimGrid: A Generic Framework for Large-Scale Distributed Experiments, Tenth International Conference on Computer Modeling and Simulation (uksim 2008), 2008.
DOI : 10.1109/UKSIM.2008.28

URL : https://hal.archives-ouvertes.fr/inria-00260697

H. Casanova, G. Obertelli, F. Berman, and R. Wolski, The AppLeS Parameter Sweep Template: User-Level Middleware for the Grid, Proceedings of the High Performance Networking and Computing Conference (SC), 2000.

S. Chakrabarti, J. Demmel, and K. Yelick, Models and Scheduling Algorithms for Mixed Data and Task Parallel Programs, Journal of Parallel and Distributed Computing, vol.47, issue.2, pp.168-184, 1997.
DOI : 10.1006/jpdc.1997.1413

G. Chapman, J. Cleese, T. Gilliam, and E. Idle, Monty Python and the Holy Grail, 2002.

H. Chen and M. Maheswaran, Distributed Dynamic Scheduling of Composite Tasks on Grid Systems, Proceedings of the 12th Heterogeneous Computing Workshop (HCW), 2002.

Y. Chen, R. Griffith, J. Liu, R. H. Katz, and A. D. Joseph, Understanding TCP incast throughput collapse in datacenter networks, Proceedings of the 1st ACM workshop on Research on enterprise networking, WREN '09, pp.73-82, 2009.
DOI : 10.1145/1592681.1592693

W. Cirne and F. Berman, A model for moldable supercomputer jobs, Proceedings 15th International Parallel and Distributed Processing Symposium. IPDPS 2001, 2001.
DOI : 10.1109/IPDPS.2001.925004

J. Collins, D. Tullsen, H. Wong, and J. Shen, Dynamic speculative precomputation, Proceedings. 34th ACM/IEEE International Symposium on Microarchitecture. MICRO-34, pp.306-317, 2001.
DOI : 10.1109/MICRO.2001.991128

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.23.763

D. Cordeiro, G. Mounié, S. Perarnau, D. Trystram, J. Vincent et al., Random graph generation for scheduling simulations, Proceedings of the 3rd International ICST Conference on Simulation Tools and Techniques, 2010.
DOI : 10.4108/ICST.SIMUTOOLS2010.8667

URL : https://hal.archives-ouvertes.fr/hal-00471255

P. Couvares, T. Kosar, A. Roy, J. Weber, and K. Wenger, Workflow Management in Condor, Science, pp.357-375, 2007.
DOI : 10.1007/978-1-84628-757-2_22

D. Culler, R. Karp, D. Patterson, A. Sahay, K. E. Schauser et al., LogP: Towards a Realistic Model of Parallel Computation, Proceedings of the fourth ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP), pp.1-12, 1993.

F. Dabek, R. Cox, M. F. Kaashoek, and R. Morris, Vivaldi: A Decentralized Network Coordinate System, Proceedings of the ACM SIGCOMM 2004 Conference on Applications, Technologies, Architectures, and Protocols for Computer Communication, pp.15-26, 2004.

K. Silas-de-munck, J. Vanmechelen, and . Broeckhove, Improving the Scalability of SimGrid Using Dynamic Routing, Proceedings of the 9th International Conference on Computational Science (ICCS), pp.406-415, 2009.
DOI : 10.1007/978-3-642-01970-8_40

J. Dean and S. Ghemawat, MapReduce, Communications of the ACM, vol.51, issue.1, pp.107-113, 2008.
DOI : 10.1145/1327452.1327492

D. Debels and M. Vanhoucke, The Discrete Time/Cost Trade-off Problem: Extensions and Heuristic Procedures, Journal of Scheduling, vol.10, pp.4-5, 2007.

E. Deelman, G. Singh, M. Su, J. Blythe, Y. Gil et al., Pegasus: A Framework for Mapping Complex Scientific Workflows onto Distributed Systems, Scientific Programming, vol.13, issue.3, pp.219-237, 2005.
DOI : 10.1155/2005/128026

F. Desprez and J. Rouzaud-cornabas, SimGrid Cloud Broker: Simulating the Amazon AWS Cloud, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00909120

R. Dick, D. Rhodes, and W. Wolf, TGFF, Proceedings of the sixth international workshop on Hardware/software codesign , CODES/CASHE '98, pp.97-101, 1998.
DOI : 10.1145/278241.278309

P. Dickens, P. Heidelberger, and D. Nicol, Parallelized direct execution simulation of message-passing parallel programs, IEEE Transactions on Parallel and Distributed Systems, vol.7, issue.10, pp.1090-1105, 1996.
DOI : 10.1109/71.539740

R. Dolbeau, S. Bihan, F. Bodin, and . Hmpp, A Hybrid Multi-core Parallel Programming Environment, Proceedings of the First Workshop on General Purpose Processing on Graphics Processing Units, 2007.

B. Donassolo, H. Casanova, A. Legrand, and P. Velho, Fast and scalable simulation of volunteer computing systems using SimGrid, Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing, HPDC '10, pp.605-612, 2010.
DOI : 10.1145/1851476.1851565

URL : https://hal.archives-ouvertes.fr/hal-00690629

B. Donassolo, A. Legrand, and C. Geyer, Non-cooperative Scheduling Considered Harmful in Collaborative Volunteer Computing Environments, 2011 11th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, 2011.
DOI : 10.1109/CCGrid.2011.34

URL : https://hal.archives-ouvertes.fr/hal-00788792

A. Dorigo, P. Elmer, F. Furano, and A. Hanushevsky, XROOTD ? A Highly Scalable Architecture for Data Access, WSEAS Transactions on Computers, vol.1, issue.4, 2005.

A. B. Downey, A Model For Speedup of Parallel Programs, 1997.

P. Dutot, Hierarchical Scheduling for Moldable Tasks, Proceedings of the 11th International Euro-Par Conference, pp.302-311, 2005.
DOI : 10.1007/11549468_35

URL : https://hal.archives-ouvertes.fr/inria-00001077

P. Dutot, L. Eyraud-dubois, G. Mounié, and D. Trystram, Bi-criteria algorithm for scheduling jobs on cluster platforms, Proceedings of the sixteenth annual ACM symposium on Parallelism in algorithms and architectures , SPAA '04, pp.125-132, 2004.
DOI : 10.1145/1007912.1007932

URL : https://hal.archives-ouvertes.fr/hal-00001520

P. Dutot, G. Mounié, and D. Trystram, Scheduling Parallel Tasks ? Approximation Algorithms, Handbook of Scheduling, chapter 26, 2004.
URL : https://hal.archives-ouvertes.fr/hal-00003126

M. Faloutsos, P. Faloutsos, and C. Faloutsos, On Power-Law Relationships of the Internet Topology, ACM Annual Conference of the Special Interest Group on Data Communication (SIGCOMM), pp.251-262, 1999.

D. Feitelson, Workload Modeling for Computer Systems Performance Evaluation
DOI : 10.1017/CBO9781139939690

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.643.159

D. Feitelson and L. Rudolph, Toward convergence in job schedulers for parallel supercomputers, Proceedings of the Second Workshop on Job Scheduling Strategies for Parallel Processing, pp.1-26, 1996.
DOI : 10.1007/BFb0022284

D. Feitelson, L. Rudolph, U. Schwiegelshohn, K. Sevcik, and P. Wong, Theory and practice in parallel job scheduling, Proceedings of the Third Workshop on Job Scheduling Strategies for Parallel Processing, pp.1-34, 1997.
DOI : 10.1007/3-540-63574-2_14

D. Feitelson, D. Tsafrir, and D. Krakov, Experience with using the Parallel Workloads Archive, Journal of Parallel and Distributed Computing, vol.74, issue.10, 2012.
DOI : 10.1016/j.jpdc.2014.06.013

I. Foster and C. Kesselman, Globus: a Metacomputing Infrastructure Toolkit, International Journal of High Performance Computing Applications, vol.11, issue.2, pp.115-128, 1997.
DOI : 10.1177/109434209701100205

I. Foster, C. Kesselman, and S. Tuecke, The Anatomy of the Grid: Enabling Scalable Virtual Organizations, International Journal of High Performance Computing Applications, vol.15, issue.3, pp.200-222, 2001.
DOI : 10.1177/109434200101500302

P. Fuhrmann, G. Volker, and . Ulzow, dCache, Storage System for the Future
DOI : 10.1007/11823285_116

E. Gabriel, G. Fagg, G. Bosilca, T. Angskun, J. Dongarra et al., Open MPI: Goals, Concept, and Design of a Next Generation MPI Implementation, Proceedings of the 11th European PVM/MPI Users' Group Meeting, pp.97-104, 2004.
DOI : 10.1007/978-3-540-30218-6_19

R. Michael, D. S. Garey, and . Johnson, Computers and Intractability: A Guide to the Theory of NP-Completeness, 1979.

M. Geimer, F. Wolf, B. Wylie, and B. Mohr, A scalable tool architecture for diagnosing wait states in massively parallel applications, Parallel Computing, vol.35, issue.7, pp.375-388, 2009.
DOI : 10.1016/j.parco.2009.02.003

W. Gentzsch, Sun Grid Engine: towards creating a compute power grid, Proceedings First IEEE/ACM International Symposium on Cluster Computing and the Grid, pp.35-39, 2001.
DOI : 10.1109/CCGRID.2001.923173

T. Glatard and S. Camarasu-pop, Modelling Pilot-Job Applications on Production Grids, Proceedings of the 7th International workshop on Algorithms, Models and Tools for Parallel Computing on Heterogeneous Platforms (Heteropar 09), pp.140-149, 2009.
DOI : 10.1007/978-3-642-14122-5_18

T. Glatard and S. Camarasu-pop, A model of pilot-job resource provisioning on production grids, Parallel Computing, vol.37, issue.10-11, pp.10-11684, 2011.
DOI : 10.1016/j.parco.2011.04.001

T. Glatard, J. Montagnat, D. Lingrand, and X. Pennec, Flexible and Efficient Workflow Deployment of Data-Intensive Applications On Grids With MOTEUR, International Journal of High Performance Computing Applications, vol.22, issue.3, pp.347-360, 2008.
DOI : 10.1177/1094342008096067

R. Graves, T. Jordan, S. Callaghan, E. Deelman, E. Field et al., CyberShake: A Physics-Based Seismic Hazard Model for Southern California, Pure and Applied Geophysics, vol.168, issue.3-4, pp.3-4367, 2011.
DOI : 10.1007/s00024-010-0161-6

W. Gropp, MPICH2: A New Start for MPI Implementations, Proceedings of the 9th European PVM/MPI Users' Group Meeting, p.7, 2002.
DOI : 10.1007/3-540-45825-5_5

W. Gropp, E. Lusk, and A. Skjellum, Using MPI: Portable Parallel Programming with the Message Passing Interface. Scientific And Engineering Computation Series, 1999.

D. Grove and P. Coddington, Communication Benchmarking and Performance Modelling of MPI Programs on Cluster Computers, The Journal of Supercomputing, vol.23, issue.1/2, pp.201-217, 2005.
DOI : 10.1007/s11227-005-2340-2

J. Gustedt, E. Jeannot, and M. Quinson, EXPERIMENTAL METHODOLOGIES FOR LARGE-SCALE SYSTEMS: A SURVEY, Parallel Processing Letters, vol.19, issue.03, pp.399-418, 2009.
DOI : 10.1142/S0129626409000304

URL : https://hal.archives-ouvertes.fr/inria-00364180

T. Hagras and J. Janecek, A simple scheduling heuristic for heterogeneous computing environments, Second International Symposium on Parallel and Distributed Computing, 2003. Proceedings., pp.104-110, 2003.
DOI : 10.1109/ISPDC.2003.1267650

M. Hermanns, M. Geimer, F. Wolf, and B. Wylie, Verifying Causality between Distant Performance Phenomena in Large-Scale MPI Applications, 2009 17th Euromicro International Conference on Parallel, Distributed and Network-based Processing, pp.78-84, 2009.
DOI : 10.1109/PDP.2009.50

T. Hey, S. Tansley, and K. Tolle, The Fourth Paradigm ??? Data-Intensive Scientific Discovery, 2009.
DOI : 10.1007/978-3-642-33299-9_1

T. Hoefler, C. Siebert, and A. Lumsdaine, LogGOPSim -Simulating Large-Scale Applications in the LogGOPS Model, Proceedings of the ACM Workshop on Large-Scale System and Application Performance, pp.597-604, 2010.

D. Hull, K. Wolstencroft, R. Stevens, C. Goble, M. Pocock et al., Taverna: a tool for building and running workflows of services, Nucleic Acids Research, vol.34, issue.Web Server, pp.729-732, 2006.
DOI : 10.1093/nar/gkl320

S. Hunold, Low-Cost Tuning of Two-Step Algorithms for Scheduling Mixed-Parallel Applications onto Homogeneous Clusters, 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing, pp.253-262, 2010.
DOI : 10.1109/CCGRID.2010.52

S. Hunold, T. Rauber, R. Gudula, and . Unger, Dynamic scheduling of multi-processor tasks on clusters of clusters, 2007 IEEE International Conference on Cluster Computing, pp.507-514, 2007.
DOI : 10.1109/CLUSTR.2007.4629277

F. Ino, N. Fujimoto, and K. Hagihara, LogGPS: a Parallel Computational Model for Synchronization Analysis, Proceedings of the eighth ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP), pp.133-142, 2001.

M. Iverson, F. Usun¨ozgünerusun¨-usun¨ozgusun¨ozg¨usun¨ozgüner, and . Hierarchical, Hierarchical, competitive scheduling of multiple DAGs in a dynamic heterogeneous environment, Distributed Systems Engineering, vol.6, issue.3, pp.112-120, 1999.
DOI : 10.1088/0967-1846/6/3/303

K. Jansen and H. Zhang, An approximation algorithm for scheduling malleable tasks under general precedence constraints, ACM Transactions on Algorithms, vol.2, issue.3, pp.416-434, 2006.
DOI : 10.1145/1159892.1159899

G. Juve, A. L. Chervenak, E. Deelman, S. Bharathi, G. Mehta et al., Characterizing and profiling scientific workflows, Future Generation Computer Systems, vol.29, issue.3, pp.682-692, 2013.
DOI : 10.1016/j.future.2012.08.015

M. Kamruzzaman, S. Swanson, and D. Tullsen, Inter-core Prefetching for Multicore Processors Using Migrating Helper Threads, Proceedings of the 16th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), pp.393-404, 2011.

Y. Kee, H. Casanova, and A. Chien, Realistic Modeling and Synthesis of Resources for Computational Grids, Proceedings of ACM, 2004.

D. Klusácek, L. Matyska, and H. Rudová, Alea ??? Grid Scheduling Simulation Environment, Proceedings of the 7th International Conference on Parallel Processing and Applied Mathematics (PPAM'07), pp.1029-1038, 2007.
DOI : 10.1007/978-3-540-68111-3_109

A. Knüpferkn¨knüpfer, H. Brunst, J. Doleschal, M. Jurenz, M. Lieber et al., The Vampir Performance Analysis Tool-Set, Proceedings of the 2nd International Workshop on Parallel Tools for High Performance Computing (HLRS), pp.139-155, 2008.

A. Knüpferkn¨knüpfer, R. Christian, D. Mey, S. Biersdorff, K. Diethelm et al., Score-P: A Joint Performance Measurement Run- Time Infrastructure for Periscope, Scalasca, TAU, and Vampir, Holger Brunst, Matthias S. M ¨ uller, Wolfgang E. Nagel, and Michael M. Resch Tools for High Performance Computing 2011, pp.79-91, 2012.

W. Kolberg, P. De-botelho-marcos, J. Anjos, A. Miyazaki, C. Geyer et al., MRSG ??? A MapReduce simulator over SimGrid, Parallel Computing, vol.39, issue.4-5, pp.233-244, 2013.
DOI : 10.1016/j.parco.2013.02.001

URL : https://hal.archives-ouvertes.fr/hal-00931855

R. Kufrin, Perfsuite: An Accessible, Open Source Performance Analysis Environment for Linux, Proceedings of the 6th International Conference on Linux Clusters: The HPC Revolution 2005 (LCI-05), Chapel Hill, NC, 2005.

M. Lassnig, T. Fahringer, V. Garonne, A. Molfetas, and M. Barisits, A similarity measure for time, frequency, and dependencies in large-scale workloads, Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis on, SC '11, 2011.
DOI : 10.1145/2063384.2063441

E. Lé-on, R. Riesen, A. Maccabe, and P. Bridges, Instruction-Level Simulation of a Cluster at Scale, Proceedings of the International Conference for High Performance Computing and Communications (SC), 2009.

R. Lepère, D. Trystram, and W. Gerhard, Approximation Algorithms for Scheduling Malleable Tasks under Precedence Constraints, Proceedings of the 9th Annual European Symposium on Algorithms (ESA), number 2161 in Lecture Notes in Computer Science, pp.146-157, 2001.

R. Lepère, D. Trystram, and G. Woeginger, APPROXIMATION ALGORITHMS FOR SCHEDULING MALLEABLE TASKS UNDER PRECEDENCE CONSTRAINTS, International Journal of Foundations of Computer Science, vol.13, issue.04, pp.613-627, 2002.
DOI : 10.1142/S0129054102001308

H. Li, D. Groep, and L. Wolters, Workload Characteristics of a Multi-cluster Supercomputer, Proceedings of the 10th workshop on Job Scheduling Strategies for Parallel Processing, pp.176-193, 2005.
DOI : 10.1007/11407522_10

D. Lu and P. Dinda, GridG, ACM SIGMETRICS Performance Evaluation Review, vol.30, issue.4, pp.33-40, 2003.
DOI : 10.1145/773056.773063

M. Malawski, G. Juve, E. Deelman, and J. Nabrzyski, Cost-and Deadline- Constrained Provisioning for Scientific Workflow Ensembles in IaaS Clouds, Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis (SC), 2012.

A. Mayer, S. Mcgough, N. Furmento, W. Lee, S. Newhouse et al., ICENI Dataflow and Workflow: Composition and Scheduling in Space and Time, UK e-Science All Hands Meeting, pp.627-634, 2003.

A. Medina, A. Lakhina, I. Matta, and J. Byers, BRITE: an approach to universal topology generation, MASCOTS 2001, Proceedings Ninth International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems, 2001.
DOI : 10.1109/MASCOT.2001.948886

W. Reagan, A. Moore, and . Rajsekar, iRODS: Data Sharing Technology Integrating Communities of Practice, Proceedings of the IEEE International Geoscience and Remote Sensing Symposium (IGARSS), pp.1984-1987, 2010.

A. Mu-'alem and D. Feitelson, Utilization, predictability, workloads, and user runtime estimates in scheduling the IBM SP2 with backfilling, IEEE Transactions on Parallel and Distributed Systems, vol.12, issue.6, pp.529-543, 2001.
DOI : 10.1109/71.932708

G. Muller, Y. Padioleau, J. Lawall, and R. Hansen, Semantic patches considered helpful, ACM SIGOPS Operating Systems Review, vol.40, issue.3, pp.90-92, 2006.
DOI : 10.1145/1151374.1151392

M. Noeth, F. Mueller, M. Schulz, and . Bronis-de-supinski, Scalable Compression and Replay of Communication Traces in Massively Parallel Environments, Proceedings of the 21st IEEE International Parallel and Distributed Processing Symposium, pp.1-11, 2007.

N. Alberto, J. Fernández, J. Garcia, F. Garcia, and J. Carretero, New Techniques for Simulating High Performance MPI Applications on Large Storage Networks, Journal of Supercomputing, vol.51, issue.1, pp.40-57, 2010.

H. Oh and S. Ha, A static scheduling heuristic for heterogeneous processors, Proceedings of the Second International Euro-Par Conference on Parallel Processing (Euro-Par'96), pp.573-577, 1996.
DOI : 10.1007/BFb0024750

S. Ostermann, R. Prodan, T. Fahringer, A. Iosup, and D. Epema, Trace- Based Characteristics of Grid Workflows, From Grids to Service and Pervasive Computing, pp.191-204, 2008.

B. Penoff, A. Wagner, T. Michael, and I. R. Ungeler, MPI-NeTSim: A Network Simulation Module for MPI, 2009 15th International Conference on Parallel and Distributed Systems, 2009.
DOI : 10.1109/ICPADS.2009.116

S. Prakash, E. Deelman, and R. Bagrodia, Asynchronous parallel simulation of parallel programs, IEEE Transactions on Software Engineering, vol.26, issue.5, pp.385-400, 2000.
DOI : 10.1109/32.846297

M. Quinson, GRAS: a Research and Development Framework for Grid and P2P Infrastructures, Proceedings of the IASTED International Conference on Parallel and Distributed Computing and Systems, 2006.
URL : https://hal.archives-ouvertes.fr/inria-00108389

A. Radulescu, C. Nicolescu, A. Van-gemund, and P. Jonker, CPR: mixed task and data parallel scheduling for distributed systems, Proceedings 15th International Parallel and Distributed Processing Symposium. IPDPS 2001, 2001.
DOI : 10.1109/IPDPS.2001.924977

A. Radulescu and A. Van-gemund, A low-cost approach towards mixed task and data parallel scheduling, International Conference on Parallel Processing, 2001., pp.69-76, 2001.
DOI : 10.1109/ICPP.2001.952048

S. Ramaswamy, E. Hodges, I. , and P. Banerjee, Compiling MATLAB programs to ScaLAPACK: exploiting task and data parallelism, Proceedings of International Conference on Parallel Processing, pp.613-619, 1996.
DOI : 10.1109/IPPS.1996.508120

P. Ratn, F. Mueller, M. Bronis-de-supinski, and . Schulz, Preserving Time in Largescale Communication Traces, Proceedings of the 22nd Annual International Conference on Supercomputing, pp.46-55, 2008.

T. Rauber, R. Gudula, and . Unger, Compiler support for task scheduling in hierarchical execution models, Journal of Systems Architecture, vol.45, issue.6-7, pp.483-503, 1999.
DOI : 10.1016/S1383-7621(98)00019-8

R. Reussner, P. Sanders, and J. Träff, SKaMPI: A Comprehensive Benchmark for Public Benchmarking of MPI, Scientific Programming, pp.55-65, 2002.
DOI : 10.1155/2002/202839

R. Sakellariou and H. Zhao, A hybrid heuristic for DAG scheduling on heterogeneous systems, 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings., 2004.
DOI : 10.1109/IPDPS.2004.1303065

E. Santos-neto, W. Cirne, F. Vilar-brasileiro, and A. Lima, Exploiting Replication and Data Reuse to Efficiently Schedule Data-Intensive Applications on Grids, Proceedings of the 10th International Workshop on Job Scheduling Strategies for Parallel Processing, pp.210-232, 2005.
DOI : 10.1007/11407522_12

J. Schaeffer, G. Andrés, and . Casanova, TReqS: The Tape REQuest Scheduler, Journal of Physics: Conference Series, vol.331, issue.4, 2011.
DOI : 10.1088/1742-6596/331/4/042040

URL : https://hal.archives-ouvertes.fr/hal-00684108

L. Mello-schnorr, G. Huard, and P. Navaux, Triva: Interactive 3D visualization for performance analysis of parallel applications, Future Generation Computer Systems, vol.26, issue.3, pp.348-358, 2010.
DOI : 10.1016/j.future.2009.10.006

L. Mello-schnorr, A. Legrand, and J. Vincent, Detection and Analysis of Resource Usage Anomalies in Large Distributed Systems Through Multi-scale Visualization, Concurrency and Computation: Practice and Experience, 2012.

S. Shende and A. Malony, The Tau Parallel Performance System, International Journal of High Performance Computing Applications, vol.20, issue.2, pp.287-311, 2006.
DOI : 10.1177/1094342006064482

G. Sih and E. Lee, A compile-time scheduling heuristic for interconnection-constrained heterogeneous processor architectures, IEEE Transactions on Parallel and Distributed Systems, vol.4, issue.2, pp.175-187, 1993.
DOI : 10.1109/71.207593

M. Skutella, Approximation Algorithms for the Discrete Time-Cost Tradeoff Problem, Mathematics of Operations Research, vol.23, issue.4, pp.909-929, 1998.
DOI : 10.1287/moor.23.4.909

A. Snavely, L. Carrington, N. Wolter, J. Labarta, R. Badia et al., A Framework for Application Performance Modeling and Prediction, Proceedings of the International Conference for High Performance Computing and Communications (SC), 2002.

F. Song, S. Tomov, and J. Dongarra, Enabling and scaling matrix computations on heterogeneous multi-core and multi-GPU systems, Proceedings of the 26th ACM international conference on Supercomputing, ICS '12, pp.365-376, 2012.
DOI : 10.1145/2304576.2304625

H. J. Song, X. Liu, D. Jakobsen, R. Bhagwan, X. Zhang et al., The MicroGrid: a scientific tool for modeling computational grids, Proceedings of the ACM/IEEE conference on Supercomputing, 2000.

B. Stein, J. Chassin-de-kergommeaux, and P. Bernard, Pajé, an Interactive Visualization Tool for Tuning Multi-Threaded Parallel Applications, Parallel Computing, vol.26, pp.1253-1274, 2000.

T. Szepieniec and M. Bubak, Investigation of the DAG eligible jobs maximization algorithm in a grid, 2008 9th IEEE/ACM International Conference on Grid Computing, pp.340-345, 2008.
DOI : 10.1109/GRID.2008.4662819

D. Thain, T. Tannenbaum, and M. Livny, Distributed computing in practice: the Condor experience. Concurrency -Practice and Experience, pp.323-356, 2005.

M. Tikir, M. Laurenzano, L. Carrington, and A. Snavely, PSINS: An Open Source Event Tracer and Execution Simulator for MPI Applications, Proceedings of the 15th International EuroPar Conference, pp.135-148, 2009.
DOI : 10.1007/BFb0052218

T. Tobita and H. Kasahara, A standard task graph set for fair evaluation of multiprocessor scheduling algorithms, Journal of Scheduling, vol.70, issue.5, pp.379-394, 2002.
DOI : 10.1002/jos.116

H. Topcuoglu, S. Hariri, and M. Wu, Performance-effective and low-complexity task scheduling for heterogeneous computing, IEEE Transactions on Parallel and Distributed Systems, vol.13, issue.3, pp.260-274, 2002.
DOI : 10.1109/71.993206

A. Vahdat, K. Yocum, K. Walsh, P. Mahadevan, D. Kostic et al., Scalability and Accuracy in a Large-Scale Network Emulator, Proceedings of the 5th Symposium on Operating Systems Design and Implementation (OSDI), 2002.

A. Wil-van-der-aalst, A. Barros, B. Ter-hofstede, and . Kiepuszewski, Advanced Workflow Patterns, Proceedings of the 7th International. Conference on Cooperative Information Systems (CoopIS), pp.18-29, 2000.
DOI : 10.1007/10722620_2

N. Vydyanathan, S. Krishnamoorthy, G. M. Sabin, V. Umit, T. M. Kurç et al., An Integrated Approach for Processor Allocation and Scheduling of Mixed-Parallel Applications, 2006 International Conference on Parallel Processing (ICPP'06), pp.443-450, 2006.
DOI : 10.1109/ICPP.2006.22

N. Vydyanathan, S. Krishnamoorthy, G. M. Sabin, V. Umit, T. M. Kurç et al., An Integrated Approach to Locality-Conscious Processor Allocation and Scheduling of Mixed-Parallel Applications, IEEE Transactions on Parallel and Distributed Systems, vol.20, issue.8, pp.201158-1172, 2009.
DOI : 10.1109/TPDS.2008.219

L. Wang, G. Von-laszewski, J. Dayal, and F. Wang, Towards Energy Aware Scheduling for Precedence Constrained Parallel Tasks in a Cluster with DVFS, 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing, pp.368-377, 2010.
DOI : 10.1109/CCGRID.2010.19

B. Waxman, Routing of multipoint connections, IEEE Journal on Selected Areas in Communications, vol.6, issue.9, pp.1617-1622, 1988.
DOI : 10.1109/49.12889

A. Yoo, M. Jette, and M. Grondona, SLURM: Simple Linux Utility for Resource Management, Proceedings of the 9th International Workshop on Job Scheduling Strategies for Parallel Processing, pp.44-60, 2003.
DOI : 10.1007/10968987_3

E. Zegura, K. Calvert, and M. Donahoo, A Quantitative Comparison of Graphbased Models for Internet Topology, IEEE/ACM Transactions on Networking, vol.5, issue.6, 1997.

J. Zhai, W. Chen, and W. Zheng, PHANTOM: Predicting Performance of Parallel Applications on Large-Scale Parallel Machines Using a Single Node, Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, pp.305-314, 2010.

H. Zhao and R. Sakellariou, Scheduling Multiple DAGs onto Heterogeneous Systems, Proceedings of the 15th Heterogeneous Computing Workshop (HCW), 2006.

G. Zheng, G. Kakulapati, and L. Kale, BigSim: A Parallel Simulator for Performance Prediction of Extremely Large Parallel Machines, Proceedings of the 18th International Parallel and Distributed Processing Symposium, 2004.

G. Zheng, T. Wilmarth, P. Jagadishprasad, and L. Kalé, Simulation-Based Performance Prediction for Large Parallel Machines, International Journal of Parallel Programming, vol.15, issue.2-3, pp.183-207, 2005.
DOI : 10.1007/s10766-005-3582-6