Mostra el registre d'ítem simple

dc.contributor.authorDuric, Milovan
dc.contributor.authorPalomar Pérez, Óscar
dc.contributor.authorSmith, Aaron
dc.contributor.authorStanic, Milan
dc.contributor.authorUnsal, Osman Sabri
dc.contributor.authorCristal Kestelman, Adrián
dc.contributor.authorValero Cortés, Mateo
dc.contributor.authorBurger, Doug
dc.contributor.authorVeidenbaum, Alexander V
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors
dc.contributor.otherBarcelona Supercomputing Center
dc.date.accessioned2015-05-06T12:37:41Z
dc.date.created2014
dc.date.issued2014
dc.identifier.citationDuric, M. [et al.]. Dynamic-vector execution on a general purpose EDGE chip multiprocessor. A: International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation. "International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation (SAMOS XIV): proceedings: July 14-17, 2014: Samos, Greece". Samos: Institute of Electrical and Electronics Engineers (IEEE), 2014, p. 18-25.
dc.identifier.isbn978-1-4799-3770-7
dc.identifier.urihttp://hdl.handle.net/2117/27788
dc.description.abstractThis paper proposes a cost-effective technique that morphs the available cores of a low power chip multiprocessor (CMP) into an accelerator for data parallel (DLP) workloads. Instead of adding a special-purpose vector architecture as an accelerator, our technique leverages the resources of each CMP core to mimic the functionality of a vector processor. The morphing provides dynamic vector execution (DVX) on a general purpose CMP, by adding minimal hardware for vector control. DVX enhances the vector execution by dynamically configuring the allocation of compute and memory resources to match particular workload requirements. As an energy efficient substrate, we utilize modest dual issue cores based on an Explicit Data Graph Execution (EDGE) architecture. The results show that a DVX enabled 4-core EDGE CMP improves the energy-delay product over 14x, at the cost of only 1.1% of additional area. We compare DVX against a CMP that adds a dedicated DLP accelerator based on a conventional high performance vector design. The vector accelerator increases the area footprint over 74%, which greatly affects the cost of the modest processor. DVX avoids the additional costs and yet gains over 86% of the speedup obtained with the dedicated accelerator.
dc.description.sponsorshipThis work has been partially funded by the Spanish Government (TIN2012-34557), the European Research Council under the European Unions 7th FP (FP/2007-2013) / ERC GA n. 321253. and Microsoft Research
dc.format.extent8 p.
dc.language.isoeng
dc.publisherInstitute of Electrical and Electronics Engineers (IEEE)
dc.subjectÀrees temàtiques de la UPC::Enginyeria electrònica::Microelectrònica
dc.subjectÀrees temàtiques de la UPC::Informàtica::Arquitectura de computadors
dc.subject.lcshMicroprocessors
dc.subject.lcshComputer architecture
dc.subject.otherDLP accelerator
dc.subject.otherDVX enabled 4-core EDGE CMP
dc.subject.otherEDGE architecture
dc.subject.otherCost-effective technique
dc.subject.otherData parallel workloads
dc.subject.otherDedicated accelerator
dc.subject.otherDynamic vector execution
dc.subject.otherEnergy efficient substrate
dc.subject.otherEnergy-delay product
dc.subject.otherExplicit data graph execution
dc.subject.otherFunctionality
dc.subject.otherGeneral purpose CMP
dc.subject.otherGeneral purpose EDGE chip
dc.subject.otherMultiprocessor
dc.subject.otherHigh performance vector design
dc.subject.otherLow power chip multiprocessor
dc.subject.otherMinimal hardware
dc.subject.otherModest processor
dc.subject.otherSpecial-purpose vector architecture
dc.subject.otherVector accelerator
dc.subject.otherVector control
dc.subject.otherVector processor
dc.subject.otherMicroprocessor chips
dc.subject.otherMultiprocessing systems
dc.subject.otherComputational modeling
dc.subject.otherComputer architecture
dc.subject.otherHardware
dc.subject.otherInstruction sets
dc.subject.otherMessage systems
dc.subject.otherRegisters
dc.subject.otherVectors
dc.titleDynamic-vector execution on a general purpose EDGE chip multiprocessor
dc.typeConference report
dc.subject.lemacMicroprocessadors
dc.subject.lemacArquitectura d'ordinadors
dc.contributor.groupUniversitat Politècnica de Catalunya. CAP - Grup de Computació d'Altes Prestacions
dc.identifier.doi10.1109/SAMOS.2014.6893190
dc.description.peerreviewedPeer Reviewed
dc.relation.publisherversionhttp://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=6893190
dc.rights.accessRestricted access - publisher's policy
local.identifier.drac15430860
dc.description.versionPostprint (published version)
dc.relation.projectidinfo:eu-repo/grantAgreement/EC/FP7/321253/EU/Riding on Moore's Law/ROMOL
dc.date.lift10000-01-01
local.citation.authorDuric, M.; Palomar, O.; Smith, A.; Stanic, M.; Unsal, O.; Cristal, A.; Valero, M.; Burger, D.; Veidenbaum, A.V.
local.citation.contributorInternational Conference on Embedded Computer Systems: Architectures, Modeling and Simulation
local.citation.pubplaceSamos
local.citation.publicationNameInternational Conference on Embedded Computer Systems: Architectures, Modeling and Simulation (SAMOS XIV): proceedings: July 14-17, 2014: Samos, Greece
local.citation.startingPage18
local.citation.endingPage25


Fitxers d'aquest items

Imatge en miniatura

Aquest ítem apareix a les col·leccions següents

Mostra el registre d'ítem simple