Computing the throughput of probabilistic and replicated streaming applications

In this paper, we investigate how to compute the throughput of probabilistic and replicated streaming applications. We are given (i) a streaming application whose dependence graph is a linear chain; (ii) a one-to-many mapping of the application onto a fully heterogeneous target, where a processor is assigned at most one application stage, but where a stage can be replicated onto a set of processors; and (iii) a set of I.I.D. (Independent and Identically-Distributed) variables to model each computation and communication time in the mapping. How can we compute the throughput of the application, i.e., the rate at which data sets can be processed? We consider two execution models, the STRICT model where the actions of each processor are sequentialized, and the OVERLAP model where a processor can compute and communicate in parallel. The problem is easy when application stages are not replicated, i.e., assigned to a single processor: in that case the throughput is dictated by the critical hardware resource. However, when stages are replicated, i.e., assigned to several processors, the problem becomes surprisingly complicated: even in the deterministic case, the optimal throughput may be lower than the smallest internal resource throughput. To the best of our knowledge, the problem has never been considered in the probabilistic case. The first main contribution of the paper is to provide a general method to compute the throughput when mapping parameters are constant or follow I.I.D. exponential laws. The second main contribution is to provide bounds for the throughput when stage parameters are arbitrary I.I.D. and N.B.U.E. (New Better than Used in Expectation) variables: the throughput is bounded from below by the exponential case and bounded from above by the deterministic case.

Domaines

Calcul parallèle, distribué et partagé [cs.DC]

Fichier principal

RR-7510.pdf (742.32 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Anne Benoit : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00555890

Soumis le : vendredi 14 janvier 2011-15:45:31

Dernière modification le : jeudi 4 avril 2024-21:15:32

Archivage à long terme le : mardi 6 novembre 2012-11:35:25

Dates et versions

inria-00555890 , version 1 (14-01-2011)

Identifiants

HAL Id : inria-00555890 , version 1

Citer

Anne Benoit, Matthieu Gallet, Bruno Gaujal, Yves Robert. Computing the throughput of probabilistic and replicated streaming applications. [Research Report] RR-7510, INRIA. 2011, pp.33. ⟨inria-00555890⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENS-LYON UNIV-RENNES1 UGA CNRS INRIA UNIV-LYON1 IRISA INRIA-RRRT LIG INRIA2 LARA UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UDL ANR UR1-MATH-NUM LIG_SIDCH

354 Consultations

146 Téléchargements