An experimental study of reduced-voltage operation in modern FPGAs for neural network acceleration

Salami, Behzad; Onural, Erhan Baturay; Yuksel, Ismail Emir; Koc, Fahrettin; Ergin, Oguz; Cristal Kestelman, Adrián; Unsal, Osman Sabri; Sarbazi-Azad, Hamid; Mutlu, Onur

doi:10.1109/DSN48063.2020.00032

dc.contributor.author	Salami, Behzad
dc.contributor.author	Onural, Erhan Baturay
dc.contributor.author	Yuksel, Ismail Emir
dc.contributor.author	Koc, Fahrettin
dc.contributor.author	Ergin, Oguz
dc.contributor.author	Cristal Kestelman, Adrián
dc.contributor.author	Unsal, Osman Sabri
dc.contributor.author	Sarbazi-Azad, Hamid
dc.contributor.author	Mutlu, Onur
dc.contributor.other	Universitat Politècnica de Catalunya. Doctorat en Arquitectura de Computadors
dc.contributor.other	Universitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors
dc.contributor.other	Barcelona Supercomputing Center
dc.date.accessioned	2020-10-21T11:39:54Z
dc.date.available	2020-10-21T11:39:54Z
dc.date.issued	2020
dc.identifier.citation	Salami, B. [et al.]. An experimental study of reduced-voltage operation in modern FPGAs for neural network acceleration. A: Annual IEEE/IFIP International Conference on Dependable Systems and Networks. "50th Annual IEEE/IFIP International Conference on Dependable Systems and Networks: 29 June-2 July 2020, Valencia, Spain: proceedings". Institute of Electrical and Electronics Engineers (IEEE), 2020, p. 138-149. ISBN 978-1-7281-5809-9. DOI 10.1109/DSN48063.2020.00032.
dc.identifier.isbn	978-1-7281-5809-9
dc.identifier.other	https://arxiv.org/abs/2005.03451
dc.identifier.uri	http://hdl.handle.net/2117/330567
dc.description.abstract	We empirically evaluate an undervolting technique, i.e., underscaling the circuit supply voltage below the nominal level, to improve the power-efficiency of Convolutional Neural Network (CNN) accelerators mapped to Field Programmable Gate Arrays (FPGAs). Undervolting below a safe voltage level can lead to timing faults due to excessive circuit latency increase. We evaluate the reliability-power trade-off for such accelerators. Specifically, we experimentally study the reduced-voltage operation of multiple components of real FPGAs, characterize the corresponding reliability behavior of CNN accelerators, propose techniques to minimize the drawbacks of reduced-voltage operation, and combine undervolting with architectural CNN optimization techniques, i.e., quantization and pruning. We investigate the effect ofenvironmental temperature on the reliability-power trade-off of such accelerators. We perform experiments on three identical samples of modern Xilinx ZCU102 FPGA platforms with five state-of-the-art image classification CNN benchmarks. This approach allows us to study the effects of our undervolting technique for both software and hardware variability. We achieve more than 3X power-efficiency (GOPs/W ) gain via undervolting. 2.6X of this gain is the result of eliminating the voltage guardband region, i.e., the safe voltage region below the nominal level that is set by FPGA vendor to ensure correct functionality in worst-case environmental and circuit conditions. 43% of the power-efficiency gain is due to further undervolting below the guardband, which comes at the cost of accuracy loss in the CNN accelerator. We evaluate an effective frequency underscaling technique that prevents this accuracy loss, and find that it reduces the power-efficiency gain from 43% to 25%.
dc.description.sponsorship	The work done for this paper was partially supported by a HiPEAC Collaboration Grant funded by the H2020 HiPEAC Project under grant agreement No. 779656. The research leading to these results has received funding from the European Union’s Horizon 2020 Programme under the LEGaTO Project (www.legato-project.eu), grant agreement No. 780681.
dc.format.extent	12 p.
dc.language.iso	eng
dc.publisher	Institute of Electrical and Electronics Engineers (IEEE)
dc.subject	Àrees temàtiques de la UPC::Enginyeria de la telecomunicació::Telemàtica i xarxes d'ordinadors
dc.subject.lcsh	Field programmable gate arrays
dc.subject.lcsh	Neural networks (Computer science)
dc.subject.other	Reliability
dc.subject.other	Circuit faults
dc.subject.other	Power demand
dc.subject.other	Quantization (signal)
dc.subject.other	Hardware
dc.subject.other	Training
dc.title	An experimental study of reduced-voltage operation in modern FPGAs for neural network acceleration
dc.type	Conference report
dc.subject.lemac	Matrius de portes programables per l'usuari
dc.subject.lemac	Xarxes neuronals (Informàtica)
dc.contributor.group	Universitat Politècnica de Catalunya. CAP - Grup de Computació d'Altes Prestacions
dc.identifier.doi	10.1109/DSN48063.2020.00032
dc.description.peerreviewed	Peer Reviewed
dc.relation.publisherversion	https://ieeexplore.ieee.org/document/9153393
dc.rights.access	Open Access
local.identifier.drac	29342164
dc.description.version	Postprint (author's final draft)
dc.relation.projectid	info:eu-repo/grantAgreement/EC/H2020/779656/EU/High Performance and Embedded Architecture and Compilation/HiPEAC
dc.relation.projectid	info:eu-repo/grantAgreement/EC/H2020/780681/EU/Low Energy Toolset for Heterogeneous Computing/LEGaTO
local.citation.author	Salami, B.; Onural, E.; Yuksel, I.; Koc, F.; Ergin, O.; Cristal, A.; Unsal, O.; Sarbazi-Azad, H.; Mutlu, O.
local.citation.contributor	Annual IEEE/IFIP International Conference on Dependable Systems and Networks
local.citation.publicationName	50th Annual IEEE/IFIP International Conference on Dependable Systems and Networks: 29 June-2 July 2020, Valencia, Spain: proceedings
local.citation.startingPage	138
local.citation.endingPage	149

Fitxers d'aquest items

Nom:: 2005.03451.pdf
Mida:: 3,092Mb
Format:: PDF

Visualitza/Obre

Aquest ítem apareix a les col·leccions següents

Ponències/Comunicacions de congressos [292]
Ponències/Comunicacions de congressos [574]
Ponències/Comunicacions de congressos [784]
Ponències/Comunicacions de congressos [1.954]

Mostra el registre d'ítem simple

UPCommons. Portal del coneixement obert de la UPC

An experimental study of reduced-voltage operation in modern FPGAs for neural network acceleration

Fitxers d'aquest items

Aquest ítem apareix a les col·leccions següents

Explora