Decomposing Meeting Graph Circuits to Minimise Kernel Loop Unrolling

Mounira Bachir 1 Sid Touati 2 Albert Cohen 1
1 ALCHEMY - Architectures, Languages and Compilers to Harness the End of Moore Years
LRI - Laboratoire de Recherche en Informatique, UP11 - Université Paris-Sud - Paris 11, Inria Saclay - Ile de France, CNRS - Centre National de la Recherche Scientifique : UMR8623
Abstract : This article studies an important open problem in backend compilation regarding loop unrolling after periodic register allocation. Although software pipelining is a powerful technique to extract fine-grain parallelism, variables can stay alive across more than one kernel iteration, which is challenging for code generation. The classical software solution that does not alter the computation throughput consists in unrolling the loop a posteriori (13; 12). However, the resulting unrolling degree is often unacceptable and may reach absurd levels. Alternatively, loop unrolling can be avoided thanks to software register renaming. This is achieved through the insertion of move operations. However, inserting those operations may increase the initiation interval (II) and nullifies the benefits of software pipelining itself. We propose in this article a new technique to minimise the loop unrolling degree generated after periodic register allocation. In fact, this technique consists on decomposing the generated meeting graph circuits by inserting move instructions without compromising the throughput benefits of software pipelining. The different experiments showed that the execution time is acceptable and good results can be produced when we have many functional units which can execute move operations.
Type de document :
Communication dans un congrès
9th Workshop on Optimizations for DSP and Embedded Systems (ODES-9), Apr 2011, Chamonix, France. pp.8, 2011
Liste complète des métadonnées

Littérature citée [16 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/inria-00637938
Contributeur : Sid Touati <>
Soumis le : jeudi 3 novembre 2011 - 12:14:23
Dernière modification le : jeudi 5 avril 2018 - 12:30:12
Document(s) archivé(s) le : samedi 4 février 2012 - 02:25:17

Fichier

Decomposing_Meeting.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : inria-00637938, version 1

Collections

Citation

Mounira Bachir, Sid Touati, Albert Cohen. Decomposing Meeting Graph Circuits to Minimise Kernel Loop Unrolling. 9th Workshop on Optimizations for DSP and Embedded Systems (ODES-9), Apr 2011, Chamonix, France. pp.8, 2011. 〈inria-00637938〉

Partager

Métriques

Consultations de la notice

332

Téléchargements de fichiers

106