Cost-Effective Speculative Scheduling in High Performance Processors

Arthur Perais; André Seznec; Pierre Michaud; Andreas Sembrant; Erik Hagersten

doi:10.1145/2749469.2749470

Communication Dans Un Congrès Année : 2015

Cost-Effective Speculative Scheduling in High Performance Processors

(1) , (1) , (1) , (2) , (2)

1
2

Arthur Perais

Fonction : Auteur
PersonId : 743539
IdHAL : arthur-perais
IdRef : 19174543X

Amdahl's Law is Forever

André Seznec

Fonction : Auteur
PersonId : 13729
IdHAL : andre-seznec
ORCID : 0000-0002-3058-6503
IdRef : 033236402

Amdahl's Law is Forever

Pierre Michaud

Fonction : Auteur
PersonId : 738135
IdHAL : pmichaud

Amdahl's Law is Forever

Andreas Sembrant

Fonction : Auteur
PersonId : 969684

Department of Information Technology

Erik Hagersten

Fonction : Auteur
PersonId : 969685

Department of Information Technology

Résumé

To maximize performance, out-of-order execution processors sometimes issue instructions without having the guarantee that operands will be available in time; e.g. loads are typically assumed to hit in the L1 cache and dependent instructions are issued assuming a L1 hit. This form of speculation – that we refer to as speculative scheduling – has been used for two decades in real processors, but has received little attention from the research community. In particular, as pipeline depth grows and the distance between the Issue and the Execute stages increases, it becomes critical to issue dependents on variable-latency instructions as soon as possible, rather than to wait for the actual cycle at which the result becomes available. Unfortunately, due to the uncertain nature of speculative scheduling, the scheduler may wrongly issue an instruction that will not have its source(s) on the bypass network when it reaches the Execute stage. Therefore, this instruction must be canceled and replayed, which can potentially impair performance and increase energy consumption. In this work, we do not present a new replay mechanism. Rather, we focus on ways to reduce the number of replays that are agnostic of the replay scheme. First, we propose an easily implementable, low-cost solution to reduce the number of replays caused by L1 bank conflicts. Schedule Shifting always assumes that, given a dual-load issue capacity, the second load issued in a given cycle will be delayed because of a bank conflict. Its dependents are thus always issued with a corresponding delay. Second, we also improve on existing L1 hit/miss prediction schemes by taking into account instruction criticality. That is, for some criterion of criticality and for loads whose hit/miss behavior is hard to predict, we show that it is more cost-effective to stall dependents if the load is not predicted critical. In total, in our experiments assuming a 4-cycle issue-to- execute delay, we found that the vast majority of instructions replays due to L1 data cache banks conflicts – 78.0% – and L1 hit mispredictions – 96.5% – can be avoided, thus leading to a 3.4% performance gain and a 13.4% decrease in the number of issued instructions, over a baseline speculative scheduling scheme.

Mots clés

Microarchitecture Speculative Scheduling Banking Replay Selective Replay

Domaines

Architectures Matérielles [cs.AR]

Fichier principal

ISCA'15_Scheduling.pdf (2.32 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Arthur Perais : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01193233

Soumis le : vendredi 4 septembre 2015-16:50:58

Dernière modification le : vendredi 24 mars 2023-14:53:01

Archivage à long terme le : samedi 5 décembre 2015-13:49:09

Dates et versions

hal-01193233 , version 1 (04-09-2015)

Identifiants

HAL Id : hal-01193233 , version 1
DOI : 10.1145/2749469.2749470

Citer

Arthur Perais, André Seznec, Pierre Michaud, Andreas Sembrant, Erik Hagersten. Cost-Effective Speculative Scheduling in High Performance Processors. International Symposium on Computer Architecture, ACM/IEEE, Jun 2015, Portland, United States. pp.247-259, ⟨10.1145/2749469.2749470⟩. ⟨hal-01193233⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSTITUT-TELECOM UNIV-RENNES1 CNRS INRIA INSA-RENNES IRISA CENTRALESUPELEC IRISA-D3 INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM

443 Consultations

1087 Téléchargements

Cost-Effective Speculative Scheduling in High Performance Processors

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager