hal-00762008, version 1
Non-Linear Divisible Loads: There is No Free Lunch
IPDPS 2013 - 27th IEEE International Parallel & Distributed Processing Symposium (2013) 20
Abstract: Divisible Load Theory (DLT) has received a lot of attention in the past decade. A divisible load is a perfect parallel task, that can be split arbitrarily and executed in parallel on a set of possibly heterogeneous resources. The success of DLT is strongly related to the existence of many optimal resource allocation and scheduling algorithms, what strongly differs from general scheduling theory. Moreover, recently, close relationships have been underlined between DLT, that provides a fruitful theoretical framework for scheduling jobs on heterogeneous platforms, and \mapreduce, that provides a simple and efficient programming framework to deploy applications on large scale distributed platforms. The success of both have suggested to extend their framework to non-linear complexity tasks. In this paper, we show that both DLT and \mapreduce are better suited to workloads with linear complexity. In particular, we prove that divisible load theory cannot directly be applied to quadratic workloads, such as it has been proposed recently. We precisely state the limits for classical DLT studies and we review and propose solutions based on a careful preparation of the dataset and clever data partitioning algorithms. In particular, through simulations, we show the possible impact of this approach on the volume of communications generated by \mapreduce, in the context of Matrix Multiplication and Outer Product algorithms.
- a – CNRS
- 1:
- INRIA – CNRS : UMR5800 – Université Sciences et Technologies - Bordeaux I – École Nationale Supérieure d'Électronique, Informatique et Radiocommunications de Bordeaux (ENSEIRB)
- 2:
- CNRS : UMR5800 – Université Sciences et Technologies - Bordeaux I – École Nationale Supérieure d'Électronique, Informatique et Radiocommunications de Bordeaux (ENSEIRB) – Université Victor Segalen - Bordeaux II
- 3:
- Université de Lyon – CNRS : UMR5668 – INRIA – École Normale Supérieure - Lyon – Université Claude Bernard - Lyon I
- 4:
- INRIA – École Normale Supérieure - Lyon – Laboratoire d'informatique du Parallélisme – CNRS : UMR5668
- Domain : Computer Science/Distributed, Parallel, and Cluster Computing
- Internal note : RR-8170
- Comment : accepted for publication in IPDPS 2013
- hal-00762008, version 1
- http://hal.inria.fr/hal-00762008
- oai:hal.inria.fr:hal-00762008
- From:
- Submitted on: Thursday, 6 December 2012 12:13:32
- Updated on: Monday, 18 February 2013 19:19:28





Associated documents
Export