Mostra el registre d'ítem simple

dc.contributor.authorXu, Ying hao
dc.contributor.authorVidal, Miquel
dc.contributor.authorArejita, Beñat
dc.contributor.authorDiaz, Javier
dc.contributor.authorAlvarez, Carlos
dc.contributor.authorJiménez González, Daniel
dc.contributor.authorMartorell Bofill, Xavier
dc.contributor.authorMantovani, Filippo
dc.contributor.otherBarcelona Supercomputing Center
dc.date.accessioned2018-03-06T09:36:26Z
dc.date.available2018-03-06T09:36:26Z
dc.date.issued2018
dc.identifier.citationXu, Y. H. [et al.]. Implementation of the K-Means Algorithm on Heterogeneous Devices: A Use Case Based on an Industrial Dataset. A: "Parallel Computing is Everywhere (serie: Advances in Parallel Computing)". IOS Press, 2018, p. 642-651.
dc.identifier.isbn0927-5452
dc.identifier.urihttp://hdl.handle.net/2117/114842
dc.description.abstractThis paper presents and analyzes a heterogeneous implementation of an industrial use case based on K-means that targets symmetric multiprocessing (SMP), GPUs and FPGAs. We present how the application can be optimized from an algorithmic point of view and how this optimization performs on two heterogeneous platforms. The presented implementation relies on the OmpSs programming model, which introduces a simplified pragma-based syntax for the communication between the main processor and the accelerators. Performance improvement can be achieved by the programmer explicitly specifying the data memory accesses or copies. As expected, the newer SMP+GPU system studied is more powerful than the older SMP+FPGA system. However the latter is enough to fulfill the requirements of our use case and we show that uses less energy when considering only the active power of the execution.
dc.description.sponsorshipThis work is partially supported by the European Union H2020 project AXIOM (grant agreement n. 645496), HiPEAC (grant agreement n. 687698), and Mont-Blanc (grant agreements n. 288777, 610402 and 671697), the Spanish Government Programa Severo Ochoa (SEV-2015-0493), the Spanish Ministry of Science and Technology (TIN2015- 65316-P) and the Departament d’Innovació, Universitats i Empresa de la Generalitat de Catalunya, under project MPEXPAR: Models de Programaci´o i Entorns d’Execució Paral·lels (2014-SGR-1051).
dc.format.extent10 p.
dc.language.isoeng
dc.publisherIOS Press
dc.subjectÀrees temàtiques de la UPC::Informàtica
dc.subject.lcshCluster analysis--Data processing
dc.subject.lcshDynamic programming
dc.subject.otherFPGA
dc.subject.otherArm
dc.subject.otherClustering
dc.subject.otherHeterogeneous programming
dc.subject.otherOmpSs
dc.subject.otherFPGA automatic toolchain
dc.titleImplementation of the K-Means Algorithm on Heterogeneous Devices: A Use Case Based on an Industrial Dataset
dc.typeConference lecture
dc.subject.lemacProgramari--Disseny
dc.identifier.doi10.3233/978-1-61499-843-3-642
dc.description.peerreviewedPeer Reviewed
dc.relation.publisherversionhttp://ebooks.iospress.nl/volumearticle/48661
dc.rights.accessOpen Access
local.identifier.drac28596125
dc.description.versionPostprint (author's final draft)
dc.relation.projectidinfo:eu-repo/grantAgreement/EC/H2020/645496/EU/Agile, eXtensible, fast I%2FO Module for the cyber-physical era/AXIOM
dc.relation.projectidinfo:eu-repo/grantAgreement/EC/H2020/687698/EU/High Performance and Embedded Architecture and Compilation/HiPEAC
dc.relation.projectidinfo:eu-repo/grantAgreement/EC/FP7/288777/EU/Mont-Blanc, European scalable and power efficient HPC platform based on low-power embedded technology/MONT-BLANC
dc.relation.projectidinfo:eu-repo/grantAgreement/EC/FP7/610402/EU/Mont-Blanc 2, European scalable and power efficient HPC platform based on low-power embedded technology/MONT-BLANC 2
dc.relation.projectidinfo:eu-repo/grantAgreement/EC/H2020/671697/EU/Mont-Blanc 3, European scalable and power efficient HPC platform based on low-power embedded technology/Mont-Blanc 3
dc.relation.projectidinfo:eu-repo/grantAgreement/MINECO//TIN2015-65316-P/ES/COMPUTACION DE ALTAS PRESTACIONES VII/
local.citation.publicationNameParallel Computing is Everywhere (serie: Advances in Parallel Computing)
local.citation.volume32
local.citation.startingPage642
local.citation.endingPage651


Fitxers d'aquest items

Thumbnail

Aquest ítem apareix a les col·leccions següents

Mostra el registre d'ítem simple