Skip to Main content Skip to Navigation

Scheduling tasks sharing files from distributed repositories

Arnaud Giersch 1 Yves Robert 2 Frédéric Vivien 2 
1 REMAP - Regularity and massive parallel computing
Inria Grenoble - Rhône-Alpes, LIP - Laboratoire de l'Informatique du Parallélisme
Abstract : This paper is devoted to scheduling a large collection of independent tasks onto a large distributed heterogeneous platform, which is composed of a set of servers. Each server is a processor cluster equipped with a file repository. The tasks to be scheduled depend upon (input) files which initially reside on the server repositories. A given file may well be shared by several tasks. For each task, the problem is to decide on which server to execute it, and to transfer the required files (those which the task depends upon) to that server repository. The objective is to find a task allocation, and to schedule the induced communications, so as to minimize the total execution time. The contribution of this paper is twofold. On the theoretical side, we establish complexity results that assess the difficulty of the problem. On the practical side, we design several new heuristics, including an extension of the heuristic to the decentralized framework, and several lower cost heuristics, which we compare through extensive simulations.
Document type :
Complete list of metadata
Contributor : Rapport De Recherche Inria Connect in order to contact the contributor
Submitted on : Tuesday, May 23, 2006 - 6:09:49 PM
Last modification on : Wednesday, March 2, 2022 - 1:28:05 PM


  • HAL Id : inria-00071602, version 1



Arnaud Giersch, Yves Robert, Frédéric Vivien. Scheduling tasks sharing files from distributed repositories. [Research Report] RR-4976, LIP RR-2003-49, INRIA, LIP. 2003. ⟨inria-00071602⟩



Record views


Files downloads