Low-precision logarithmic arithmetic for neural network accelerators

Maxime Christ; Florent de Dinechin; Frédéric Pétrot

doi:10.1109/ASAP54787.2022.00021

Communication Dans Un Congrès Année : 2022

Low-precision logarithmic arithmetic for neural network accelerators

(1, 2) , (1) , (2)

1
2

Maxime Christ

Fonction : Auteur
PersonId : 1135779

Systèmes Embarqués audio programmables

System Level Synthesis

Florent de Dinechin

Fonction : Auteur
PersonId : 5437
IdHAL : florent-de-dinechin
ORCID : 0000-0003-4927-3301
IdRef : 060154012

Systèmes Embarqués audio programmables

Frédéric Pétrot

Fonction : Auteur
PersonId : 12920
IdHAL : frederic-petrot
ORCID : 0000-0003-0624-7373
IdRef : 108969223

System Level Synthesis

Résumé

Resource requirements for hardware acceleration of neural networks inference is notoriously high, both in terms of computation and storage. One way to mitigate this issue is to quantize parameters and activations. This is usually done by scaling and centering the distributions of weights and activations, on a kernel per kernel basis, so that a low-precision binary integer representation can be used. This work studies low-precision logarithmic number system (LNS) as an efficient alternative. Firstly, LNS has more dynamic than fixed-point for the same number of bits. Thus, when quantizing MNIST and CIFAR reference networks without retraining, the smallest format size achieving top-1 accuracy comparable to floating-point is 1 to 3 bits smaller with LNS than with fixed-point. In addition, it is shown that the zero bit of classical LNS is not needed in this context, and that the sign bit can be saved for activations. Secondly, low-precision LNS enables efficient inference architectures where 1/ multiplications reduce to additions; 2/ the weighted inputs are converted to classical linear domain, but the tables needed for this conversion remain very small thanks to the low precision; and 3/ the conversion of the output activation back to LNS can be merged with an arbitrary activation function. The proposed LNS neuron is detailed and its implementation on FPGA is shown to be smaller and faster than a fixed-point one for comparable accuracy.

Mots clés

Arithmetic FPGA Logarithmic Number System low precision Neural network accelerator

Domaines

Arithmétique des ordinateurs Réseau de neurones [cs.NE]

Fichier principal

LNSNeuron_asap2022.pdf (294.01 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Florent de Dinechin : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-03684585

Soumis le : mercredi 1 juin 2022-14:52:48

Dernière modification le : jeudi 4 avril 2024-21:19:55

Archivage à long terme le : vendredi 2 septembre 2022-19:23:03

Dates et versions

hal-03684585 , version 1 (01-06-2022)

Identifiants

HAL Id : hal-03684585 , version 1
DOI : 10.1109/ASAP54787.2022.00021

Citer

Maxime Christ, Florent de Dinechin, Frédéric Pétrot. Low-precision logarithmic arithmetic for neural network accelerators. 33rd IEEE International Conference on Application-specific Systems, Architectures and Processors (ASAP 2022), IEEE, Jul 2022, Gothenburg, Sweden. ⟨10.1109/ASAP54787.2022.00021⟩. ⟨hal-03684585⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UGA CNRS INRIA INSA-LYON TIMA INRIA2 CITI INSA-GROUPE UDL MIAI ANR

163 Consultations

507 Téléchargements

Low-precision logarithmic arithmetic for neural network accelerators

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager