Evaluating the Impact of Mixed-Precision on Fault Propagation for Deep Neural Networks on GPUs

Fernando Fernandes dos Santos; Paolo Rech; Angeliki Kritikakou; Olivier Sentieys

doi:10.1109/ISVLSI54635.2022.00071

Communication Dans Un Congrès Année : 2022

Evaluating the Impact of Mixed-Precision on Fault Propagation for Deep Neural Networks on GPUs

(1) , (2) , (1) , (1)

1
2

Fernando Fernandes dos Santos

Fonction : Auteur
PersonId : 754557
IdHAL : ffernand
ORCID : 0000-0002-3504-9862

Architectures matérielles spécialisées pour l’ère post loi-de-Moore

Paolo Rech

Fonction : Auteur
PersonId : 938758

Dipartimento di Ingegneria Industriale [Trento]

Angeliki Kritikakou

Fonction : Auteur

Architectures matérielles spécialisées pour l’ère post loi-de-Moore

Olivier Sentieys

Fonction : Auteur
PersonId : 6775
IdHAL : olivier-sentieys
ORCID : 0000-0003-4334-6418
IdRef : 061585920

Architectures matérielles spécialisées pour l’ère post loi-de-Moore

Résumé

Graphics Processing Units (GPUs) offer the possibility to execute floating-point operations (FLOP) with mixed-precisions such as INT8, FP16, Bfloat, FP32, and FP64. For Deep Neural Networks (DNNs), a reduced precision is likely to lower the execution time and power consumption as it requires a smaller hardware area and fewer clock cycles to perform instructions than the standard FP32 and FP64 precisions. As less area is needed for reduced precision, the circuit error rate is also expected to be lower [1]. NVIDIA GPUs also have tensor cores that perform matrix multiplication on hardware. The tensor cores are capable to perform a 4 ×4 FP16 matrix multiplication in one clock cycle [2]. The tensor cores can deliver up to 9 × higher performance than the software implementation of matrix multiplication (sequence of sums and multiplications) on GPUs and up to 47 ×than a CPU-based system [2].

Domaines

Informatique [cs] Architectures Matérielles [cs.AR] Calcul parallèle, distribué et partagé [cs.DC]

Fichier principal

isvlsi_2022_sps.pdf (122.08 Ko)

isvlsi_2022_sps.zip (136.64 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Fernando Fernandes dos Santos : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-03903347

Soumis le : vendredi 16 décembre 2022-12:36:39

Dernière modification le : mardi 19 mars 2024-09:34:06

Dates et versions

hal-03903347 , version 1 (16-12-2022)

Licence

Paternité

Identifiants

HAL Id : hal-03903347 , version 1
DOI : 10.1109/ISVLSI54635.2022.00071

Citer

Fernando Fernandes dos Santos, Paolo Rech, Angeliki Kritikakou, Olivier Sentieys. Evaluating the Impact of Mixed-Precision on Fault Propagation for Deep Neural Networks on GPUs. ISVLSI 2022 - IEEE Computer Society Annual Symposium on VLSI, Jul 2022, Nicosia, Italy. pp.327-327, ⟨10.1109/ISVLSI54635.2022.00071⟩. ⟨hal-03903347⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 CNRS INRIA INSA-RENNES IRISA CENTRALESUPELEC INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM

18 Consultations

51 Téléchargements

Evaluating the Impact of Mixed-Precision on Fault Propagation for Deep Neural Networks on GPUs

Résumé

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Altmetric

Partager