Generalized cache tiling for dataflow programs - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2016

Generalized cache tiling for dataflow programs

Résumé

The dataflow programming paradigm has facilitated the expression of a great number of algorithmic applications on embedded platforms in a wide variety of applicative domains. Whether it is a Domain Specific Language (DSL) or a more generalistic one, the dataflow paradigm allows to intuitively state the successive steps of an algorithm and link them through data communications. The optimization of cache-memory in this context has been a subject of interest since the early '90s as the reuse and communication of data between the agents of a dataflow program is a key factor in achieving a high-performance implementation within the reduced limits of embedded architectures. In order to improve data reuse among the dataflow agents we propose a modelisation of the communications and data usage within a dataflow program. Aside from providing an estimate of the amount of cache-misses that a given scheduling generates, this model allows us to specify the associated optimization problem in a manner that is identical to loop-nest tiling. Improving on the existing state-of-the-art methods we extend our tiling technique to include non-uniform dependencies on one of the dimensions of the iteration space. When applying the proposed technique to dataflow programs expressed within the StreamIt framework we are able to showcase significant reductions in the number of cache-misses for a majority of test-cases when compared to existing optimizations.
Fichier non déposé

Dates et versions

hal-01336172 , version 1 (22-06-2016)

Identifiants

Citer

Łukasz Domagała, Duco van Amstel, Fabrice Rastello. Generalized cache tiling for dataflow programs. Conference on Languages, Compilers, Tools, and Theory for Embedded Systems, ACM SIGPLAN/SIGBED, Jun 2016, Santa Barbara, United States. pp.10, ⟨10.1145/2907950.2907960⟩. ⟨hal-01336172⟩
228 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More