Hamun: an approximate computing method to prolong the lifespan of ReRAM-based accelerators

dc.contributor.authorSabri Abrebekoh, Mohammad
dc.contributor.authorRiera Villanueva, Marc
dc.contributor.authorGonzález Colás, Antonio María
dc.contributor.groupUniversitat Politècnica de Catalunya. ARCO - Microarquitectura i Compiladors
dc.contributor.otherUniversitat Politècnica de Catalunya. Doctorat en Arquitectura de Computadors
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors
dc.date.accessioned2025-06-12T06:51:35Z
dc.date.available2025-06-12T06:51:35Z
dc.date.issued2025-09
dc.description.abstractReRAM-based accelerators exhibit enormous potential to increase computational efficiency for DNN inference tasks, delivering significant performance and energy savings over traditional platforms. By incorporating adaptive scheduling, these accelerators dynamically adjust to DNN requirements, optimizing allocation of constrained hardware resources. However, ReRAM cells have limited endurance cycles due to wear-out from multiple updates for each inference execution, which shortens the lifespan of ReRAM-based accelerators and presents a practical challenge in positioning them as alternatives to conventional platforms like TPUs. Addressing these endurance limitations is essential for making ReRAM-based solutions viable for long-term, high-performance DNN inference. To address the lifespan limitations of ReRAM-based accelerators, we introduce Hamun, an approximate computing method designed to extend the lifespan of ReRAM-based accelerators through a range of optimizations. Hamun incorporates a novel mechanism that detects faulty cells due to wear-out and retires them, avoiding in this way their otherwise adverse impact on DNN accuracy. Moreover, Hamun extends the lifespan of ReRAM-based accelerators by adapting wear-leveling techniques across various abstraction levels of the accelerator and implementing a batch execution scheme to maximize ReRAM cell usage for multiple inferences. Additionally, Hamun introduces a new approximation method that leverages the fault tolerance characteristics of DNNs to delay the retirement of worn-out cells, reducing the performance penalty of retired cells and further extending the accelerator’s lifespan. On average, evaluated on a set of popular DNNs, Hamun demonstrates an improvement in lifespan of 13.2x over a state-of-the-art baseline. The main contributors to this improvement are the fault handling and batch execution schemes, which provide 4.6x and 2.6x lifespan improvements respectively.
dc.description.peerreviewedPeer Reviewed
dc.description.sponsorshipThis work has been supported by the CoCoUnit ERC Advanced Grant of the EU’s Horizon 2020 program (grant No 833057), the Spanish State Research Agency (MCIN/AEI) under grant PID2020-113172RBI00, the Catalan Agency for University and Research (AGAUR) under grant 2021SGR00383, and the ICREA Academia program.
dc.description.versionPostprint (published version)
dc.format.extent13 p.
dc.identifier.citationSabri, M.; Riera, M.; Gonzalez, A. Hamun: an approximate computing method to prolong the lifespan of ReRAM-based accelerators. "Journal of systems architecture", Setembre 2025, vol. 166, article 103444.
dc.identifier.doi10.1016/j.sysarc.2025.103444
dc.identifier.issn1873-6165
dc.identifier.urihttps://hdl.handle.net/2117/431493
dc.language.isoeng
dc.publisherElsevier
dc.relation.projectidinfo:eu-repo/grantAgreement/EC/H2020/833057/EU/CoCoUnit: An Energy-Efficient Processing Unit for Cognitive Computing/CoCoUnit
dc.relation.projectidinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/PID2020-113172RB-I00/ES/ARQUITECTURAS DE DOMINIO ESPECIFICO PARA SISTEMAS DE COMPUTACION ENERGETICAMENTE EFICIENTES/
dc.relation.publisherversionhttps://www.sciencedirect.com/science/article/pii/S138376212500116X
dc.rights.accessOpen Access
dc.rights.licensenameAttribution 4.0 International
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subjectÀrees temàtiques de la UPC::Informàtica::Arquitectura de computadors
dc.subject.otherDeep Neural Networks (DNNs)
dc.subject.otherHardware accelerators
dc.subject.otherProcessing-In-Memory (PIM)
dc.subject.otherReRAM
dc.subject.otherLifespan
dc.subject.otherLifetime
dc.titleHamun: an approximate computing method to prolong the lifespan of ReRAM-based accelerators
dc.typeArticle
dspace.entity.typePublication
local.citation.authorSabri, M.; Riera, M.; Gonzalez, A.
local.citation.numberarticle 103444
local.citation.publicationNameJournal of systems architecture
local.citation.volume166
local.identifier.drac42189522

Fitxers

Paquet original

Mostrant 1 - 1 de 1
Carregant...
Miniatura
Nom:
1-s2.0-S138376212500116X-main.pdf
Mida:
2.46 MB
Format:
Adobe Portable Document Format
Descripció: