Mostra el registre d'ítem simple

dc.contributor.authorShan, Junnan
dc.contributor.authorLazarescu, Mihai T.
dc.contributor.authorCortadella, Jordi
dc.contributor.authorLavagno, Luciano
dc.contributor.authorCasu, Mario R.
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament de Ciències de la Computació
dc.date.accessioned2021-05-31T10:04:25Z
dc.date.available2021-05-31T10:04:25Z
dc.date.issued2022-04
dc.identifier.citationShan, J. [et al.]. Fast energy-optimal multi-kernel DNN-like application allocation on multi-FPGA platforms. "IEEE transactions on computer-aided design of integrated circuits and systems", Abril 2022, vol. 41, núm. 4, p. 1186-1190.
dc.identifier.issn0278-0070
dc.identifier.urihttp://hdl.handle.net/2117/346425
dc.description.abstractPlatforms with multiple Field Programmable Gate Arrays (FPGAs), such as Amazon Web Services (AWS) F1 instances, can efficiently accelerate multi-kernel pipelined applications, e.g., Convolutional Neural Networks for machine vision tasks or transformer networks for Natural Language Processing tasks. To reduce energy consumption when the FPGAs are underutilized, we propose a model to (1) find off-line the minimum-power solution for given throughput constraints, and (2) dynamically reprogram the FPGA at runtime (which is complementary to dynamic voltage and frequency scaling) to match best the workloads when they change. The off-line optimization model can be solved using a Mixed-Integer Non-Linear Programming (MINLP) solver, but it can be very slow. Hence, we provide two heuristic optimization methods that improve result quality within a bounded time. We use several very large designs to demonstrate that both heuristics obtain comparable results to MINLP, when it can find the best solution, and they obtain much better results than MINLP, when it cannot find the optimum within a bounded amount of time. The heuristic methods can also be thousands of times faster than the MINLP solver.
dc.format.extent5 p.
dc.language.isoeng
dc.subjectÀrees temàtiques de la UPC::Informàtica::Informàtica teòrica
dc.subject.lcshField programmable gate arrays
dc.subject.lcshEnergy consumption
dc.subject.otherCNN
dc.subject.otherNLP
dc.subject.otherTransformer
dc.subject.otherMulti-FPGA
dc.subject.otherAllocation
dc.subject.otherOptimization
dc.subject.otherHeuristic
dc.subject.otherAWS
dc.titleFast energy-optimal multi-kernel DNN-like application allocation on multi-FPGA platforms
dc.typeArticle
dc.subject.lemacMatrius de portes programables per l'usuari
dc.subject.lemacEnergia -- Consum
dc.contributor.groupUniversitat Politècnica de Catalunya. ALBCOM - Algorismia, Bioinformàtica, Complexitat i Mètodes Formals
dc.identifier.doi10.1109/TCAD.2021.3076958
dc.description.peerreviewedPeer Reviewed
dc.relation.publisherversionhttps://ieeexplore.ieee.org/document/9419915
dc.rights.accessOpen Access
local.identifier.drac31307389
dc.description.versionPostprint (author's final draft)
local.citation.authorShan, J.; Lazarescu, M.; Cortadella, J.; Lavagno, L.; Casu, M.
local.citation.publicationNameIEEE transactions on computer-aided design of integrated circuits and systems
local.citation.volume41
local.citation.number4
local.citation.startingPage1186
local.citation.endingPage1190


Fitxers d'aquest items

Thumbnail

Aquest ítem apareix a les col·leccions següents

Mostra el registre d'ítem simple