TakTuk, Adaptive Deployment of Remote Executions

Benoît Claudel 1 Guillaume Huard 2, 3 Olivier Richard 4
1 SARDES - System architecture for reflective distributed computing environments
Inria Grenoble - Rhône-Alpes, LIG - Laboratoire d'Informatique de Grenoble
3 MOAIS - PrograMming and scheduling design fOr Applications in Interactive Simulation
Inria Grenoble - Rhône-Alpes, LIG - Laboratoire d'Informatique de Grenoble
4 MESCAL - Middleware efficiently scalable
Inria Grenoble - Rhône-Alpes, LIG - Laboratoire d'Informatique de Grenoble
Abstract : This article deals with TakTuk, a middleware that deploys efficiently parallel remote executions on large scale grids (thousands of nodes). This tool is mostly intended for interactive use: distributed machines administration and parallel applications development. Thus, it has to minimize the time required to complete the whole deployment process. To achieve this minimization, we propose and validate a remote execution deployment model inspired by the real world behavior of standard remote execution protocols (rsh and ssh). From this model and from existing works in networking, we deduce an optimal deployment algorithm for the homogeneous case. Unfortunately, this optimal algorithm does not translate directly to the heterogeneous case. Therefore, we derive from the theoretical solution a heuristic based on dynamic work-stealing that adapts to heterogeneities (processors, links, load, ...). The underlying principle of this heuristic is the same as the principle of the optimal algorithm: to deploy nodes as soon as possible. Experiments assess TakTuk efficiency and show that TakTuk scales well to thousands of nodes. Compared to similar tools, TakTuk ranks among the best performers while offering more features and versatility. In particular, TakTuk is the only tool really suited to remote executions deployment on grids or more heterogeneous platforms.
Type de document :
Communication dans un congrès
Proceedings of the International Symposium on High Performance Distributed Computing (HPDC), 2009, Munich, Germany. ACM, pp.91-100, 2009, 〈10.1145/1551609.1551629〉
Liste complète des métadonnées

https://hal.inria.fr/hal-00788923
Contributeur : Arnaud Legrand <>
Soumis le : vendredi 15 février 2013 - 13:46:28
Dernière modification le : jeudi 11 janvier 2018 - 06:22:03

Identifiants

Collections

Citation

Benoît Claudel, Guillaume Huard, Olivier Richard. TakTuk, Adaptive Deployment of Remote Executions. Proceedings of the International Symposium on High Performance Distributed Computing (HPDC), 2009, Munich, Germany. ACM, pp.91-100, 2009, 〈10.1145/1551609.1551629〉. 〈hal-00788923〉

Partager

Métriques

Consultations de la notice

234