Mostra el registre d'ítem simple

dc.contributor.authorMoretó Planas, Miquel
dc.contributor.authorCazorla, Francisco
dc.contributor.authorRamírez Bellido, Alejandro
dc.contributor.authorValero Cortés, Mateo
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors
dc.date.accessioned2018-01-22T11:38:22Z
dc.date.available2018-01-22T11:38:22Z
dc.date.issued2007
dc.identifier.citationMoreto, M., Cazorla, F., Ramírez, A., Valero, M. Online prediction of applications cache utility. A: International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation. "2007 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation, IC-SAMOS 2007: July 16-19, 2007, Samos, Greece: proceedings". Samos: Institute of Electrical and Electronics Engineers (IEEE), 2007, p. 169-177.
dc.identifier.isbn1-4244-1058-4
dc.identifier.urihttp://hdl.handle.net/2117/113042
dc.description.abstractGeneral purpose architectures are designed to offer average high performance regardless of the particular application that is being run. Performance and power inefficiencies appear as a consequence for some programs. Reconfigurable hardware (cache hierarchy, branch predictor, execution units, bandwidth, etc.) has been proposed to overcome these inefficiencies by dynamically adapting the architecture to the application needs. However, nearly all the proposals use indirect measures or heuristics of performance to decide new configurations, what may lead to inefficiencies. In this paper we propose a runtime mechanism that allows to predict the throughput of an application on an architecture using a reconfigurable L2 cache. L2 cache size varies at a way granularity and we predict the performance of the same application on all other L2 cache sizes at the same time. We obtain for different L2 cache sizes an average error of 3.11%, a maximum error of 16.4% and standard deviation of 3.7%. No profiling or operating system participation is needed in this mechanism. We also give a hardware implementation that allows to reduce the hardware cost under 0.4% of the total L2 size and maintains high accuracy. This mechanism can be used to reduce power consumption in single threaded architectures and improve performance in multithreaded architectures that dynamically partition shared L2 caches.
dc.format.extent9 p.
dc.language.isoeng
dc.publisherInstitute of Electrical and Electronics Engineers (IEEE)
dc.subjectÀrees temàtiques de la UPC::Informàtica::Arquitectura de computadors
dc.subject.lcshSimultaneous multithreading processors
dc.subject.lcshCompilers (Computer programs)
dc.subject.otherReconfigurable architectures
dc.subject.otherMulti-threading
dc.subject.otherProgram compilers
dc.titleOnline prediction of applications cache utility
dc.typeConference report
dc.subject.lemacMultiprocessadors
dc.subject.lemacCompiladors (Programes d'ordinador)
dc.contributor.groupUniversitat Politècnica de Catalunya. CAP - Grup de Computació d'Altes Prestacions
dc.identifier.doi10.1109/ICSAMOS.2007.4285748
dc.description.peerreviewedPeer Reviewed
dc.relation.publisherversionhttp://ieeexplore.ieee.org/document/4285748/
dc.rights.accessOpen Access
local.identifier.drac2325387
dc.description.versionPostprint (published version)
local.citation.authorMoreto, M.; Cazorla, F.; Ramírez, A.; Valero, M.
local.citation.contributorInternational Conference on Embedded Computer Systems: Architectures, Modeling and Simulation
local.citation.pubplaceSamos
local.citation.publicationName2007 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation, IC-SAMOS 2007: July 16-19, 2007, Samos, Greece: proceedings
local.citation.startingPage169
local.citation.endingPage177


Fitxers d'aquest items

Thumbnail

Aquest ítem apareix a les col·leccions següents

Mostra el registre d'ítem simple