Show simple item record

dc.contributorGonzález Colás, Antonio María
dc.contributorArnau Montañés, José María
dc.contributor.authorHuerta Gañán, Rodrigo
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors
dc.date.accessioned2022-11-17T07:13:59Z
dc.date.available2022-11-17T07:13:59Z
dc.date.issued2022-06-23
dc.identifier.urihttp://hdl.handle.net/2117/376507
dc.description.abstractGPU architectures have become popular for executing general-purpose programs. Moreover, they are some of the most efficient architectures for machine learning applications which are among the most trendy and demanding applications these days. GPUs rely on having a large number of threads that run concurrently to hide the latency among dependent instructions. This work presents SOCGPU (Simple Out-of-order Core for GPU), a simple out-of-order execution mechanism that does not require register renaming nor scoreboards. It uses a small Instruction Buffer and a tiny Dependence matrix to keep track of dependencies among instructions and avoid data hazards. Evaluations for an Nvidia GTX1080TI-like GPU show that SOCGPU provides a speed-up up to 3.76 in some machine learning programs and 1.58 on average for a variety of benchmarks, while it reduces energy consumption by 17.6%, with only 3.48% area overhead when using the same number of warps as the baseline. Moreover, we show that SOCGPU can reduce the number of concurrently running warps without hardly affecting performance, which can provide significant reductions in area, especially in the register file and the instruction scheduler logic, as well as other hardware structures of the GPU cores.
dc.language.isoeng
dc.publisherUniversitat Politècnica de Catalunya
dc.subjectÀrees temàtiques de la UPC::Informàtica::Arquitectura de computadors
dc.subject.lcshGraphics processing units
dc.subject.otherGPU
dc.subject.otherGPGPU
dc.subject.otheremissió fora d'ordre
dc.subject.otherplanificació d'instruccions
dc.subject.otherout-of-order issue
dc.subject.otherinstruction scheduling
dc.titleImproving instruction scheduling in GPGPUs
dc.typeMaster thesis
dc.subject.lemacInfografia
dc.subject.lemacMicroprocessadors
dc.identifier.slug170405
dc.rights.accessOpen Access
dc.date.updated2022-07-08T04:01:26Z
dc.audience.educationlevelMàster
dc.audience.mediatorFacultat d'Informàtica de Barcelona
dc.audience.degreeMÀSTER UNIVERSITARI EN INNOVACIÓ I RECERCA EN INFORMÀTICA (Pla 2012)


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record