Skip to Main content Skip to Navigation
Reports

Algorithm Level Timing Speculation for Convolutional Neural Network Accelerators

Abstract : In this paper, we propose a technique for improving the efficiency of hardware accelerators based on timing speculation (overclocking) and fault tolerance. We augment the accelerator with a lightweight error detection mechanism to protect against timing errors, enabling aggressive timing speculation. We demonstrate the validity of our approach for the convolution layers in Convolutional Neural Networks (CNN). We present an implementation of a fault-tolerant CNN accelerator combined with the lightweight error detection for convolution layers. The error detection mechanism we have developed works at the algorithm level, based on algebraic properties of the computation, allowing the full implementation to be realized using High-Level Synthesis tools. We use a set of Zybo boards to experimentally demonstrate that overclocking boosts the frequency by 17-36% with low chances of error, and that the infrequent errors can be detected with a negligible overhead (only 1000 LUTs).
Document type :
Reports
Complete list of metadatas

Cited literature [20 references]  Display  Hide  Download

https://hal.inria.fr/hal-01811231
Contributor : Thibaut Marty <>
Submitted on : Friday, June 8, 2018 - 4:38:35 PM
Last modification on : Friday, July 10, 2020 - 4:01:50 PM
Document(s) archivé(s) le : Sunday, September 9, 2018 - 6:58:03 PM

File

RT-0500.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01811231, version 1

Citation

Thibaut Marty, Tomofumi Yuki, Steven Derrien. Algorithm Level Timing Speculation for Convolutional Neural Network Accelerators. [Technical Report] RT-0500, Univ Rennes, Inria, CNRS, IRISA, France. 2018, pp.1-17. ⟨hal-01811231⟩

Share

Metrics

Record views

710

Files downloads

325