A Path to Complexity-Effective Wide-Issue Superscalar Processors - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Rapport (Rapport De Recherche) Année : 2001

A Path to Complexity-Effective Wide-Issue Superscalar Processors

Résumé

The advance of integration allows implementation of very wide issue superscalar processors on a single chip. Aggressive speculative execution as well as simultaneous multithreading can exploit such wide issue superscalar processors. Unfortunately, with the increase of issue width, processor designers are facing new difficulties to enable high clock frequency and to master silicon area and power consumption. Due to performance issues, when doubling the issue width from 4 to 8 instructions per cycle on a superscalar processor, one has also to double the number of physical registers. Combined with the doubling of the number of register ports, this leads to an eight fold increase of the silicon area devoted to the register file on conventional monolithic register file architecture while the silicon area devoted to functional units only doubles. At the same time, the peak power consumption of the register file also raises quasi-quadratically with the issue width. Moreover, read operations on the register file have to be deeply pipelined. Wake-up logic as well the bypass network in the processor are also becoming limiting factors when the issue width increases. In this paper, we present three mechanisms to reduce the number of read and write ports on every individual physical register in a wide-issue clustered superscalar processor, respectively limited read port arbitration, register write specialization and register read specialization. Then we show that combining register write specialization and register read specializa- tion, one can build a 8-way 4-cluster superscalar processor where each individual physical register is implemented as four identical (2-read, 2-write) registers instead of a single copy (16-read, 8-write) register in conventional designs.This dramatically reduces the silicon area, the peak power consumption and the access time of the register file. As a side effect, the complexities of the bypass network and of the wake-up logic are also significantly reduced. In particular, fast-forwarding is simplified on a 8-way 4-cluster processor. Limited read port arbitration can be used to further reduce the complexity of the register file. Such a complexity reduction can not come for free, but only costs some degrees of freedom on the policy for allocating instructions to clusters and some extra complexity in the register renaming process.
Fichier principal
Vignette du fichier
RR-4242.pdf (328.61 Ko) Télécharger le fichier
Loading...

Dates et versions

inria-00072345 , version 1 (23-05-2006)

Identifiants

  • HAL Id : inria-00072345 , version 1

Citer

André Seznec. A Path to Complexity-Effective Wide-Issue Superscalar Processors. [Research Report] RR-4242, INRIA. 2001. ⟨inria-00072345⟩
205 Consultations
411 Téléchargements

Partager

Gmail Facebook X LinkedIn More