A correctly rounded implementationof the exponential function on the Intel Itanium architecture

Christoph Quirin Lauter

Rapport (Rapport De Recherche) Année : 2003

A correctly rounded implementationof the exponential function on the Intel Itanium architecture

(1)

Christoph Quirin Lauter

Fonction : Auteur
PersonId : 13523
IdHAL : christoph-lauter
ORCID : 0000-0001-7335-8220
IdRef : 13979459X

Computer arithmetic

Résumé

This article presents an efficient implementation of a correctly rounded exponential function in double precision on the Intel Itanium processor family. This work combines advanced processor features (like the double-extended precision fused multiply-and-add units of the Itanium processors) with recent research results giving the worst-case precision needed for correctly rounding the exponential function. We give and prove an algorithm which returns a correctly rounded result (in any of the four IEEE-754 rounding modes) within 172 machine cycles on the 2 processor. This is about four times slower than the less accurate function present in the standard Intel mathematical library. The evaluation is performed in one phase only and is therefore fast even in the worst case, contrary to other implementations which use a multilevel strategy: We show that the worst-case required precision of 157 bits can always be stored in the sum of two double-extended floating-point numbers. Another algorithm is given with a 92 cycles execution time, but its proof has to be formally completed.

Cet article présente une implémentation efficace de la fonction exponentielle en double précision avec arrondi correct sur famille de processeurs Intel Itanium. Ce travail combine les caractéristiques avancées de ces processeurs ( en particulier la présence d'unités de multiplication-accumulation en double précisions étendue) avec les résultats de travaux récents surs les pires cas d'arrondi correct pour l'exponentielle. Nous décrivons et prouvons un algorithme qui calcule le résultat arrondi correctement ( dans l'un quelconque des quatre modes d'arrondis spécifiés par la norme IEEE-754) en 172 cycles sur l'Itanium 2 d'Intel, soit environ quatre fois lus lentement que la fonction exponentielle standard, moins précise. L'évaluation est réalisée en une seule étape, et est donc rapide également au pire cas, au contraire d'autres implémentations qui utilisent une stratégie multi-niveaux : nous montrons que les résultats intermédiaires ( sui ont besoin d'une précision jusqu'à 157 bits) peuvent toujours être représentés somme la somme de deux nombres flottants en double précision étendue. Nous présentons également un algorithme amélioré qui réalise le calcul en 92 cycles, mis sa preuve reste à formaliser complètement

Mots clés

ITANIUM EXPONENTIAL CORRECT ROUNDING IEEE-754

ARRONDI CORERCT EXPONENTIELLE

Domaines

Autre [cs.OH]

Fichier principal

RR-5024.pdf (510.68 Ko)

RR2003-54.pdf (682.82 Ko)

Rapport De Recherche Inria : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00071560

Soumis le : mardi 23 mai 2006-17:57:13

Dernière modification le : jeudi 11 mai 2023-11:56:10

Dates et versions

inria-00071560 , version 1 (23-05-2006)

Identifiants

HAL Id : inria-00071560 , version 1

Citer

Christoph Quirin Lauter. A correctly rounded implementationof the exponential function on the Intel Itanium architecture. [Research Report] RR-5024, LIP RR 2003-54, INRIA, LIP. 2003. ⟨inria-00071560⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENS-LYON CNRS INRIA UNIV-LYON1 INRIA-RRRT INRIA2 LARA UDL

144 Consultations

624 Téléchargements

A correctly rounded implementationof the exponential function on the Intel Itanium architecture

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager