Mostra el registre d'ítem simple
PMSS: a programmable memory system and scheduler for complex memory patterns
dc.contributor.author | Hussain, Tassadaq |
dc.contributor.author | Haider, Amna |
dc.contributor.author | Ayguadé Parra, Eduard |
dc.contributor.other | Universitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors |
dc.date.accessioned | 2015-05-07T12:11:55Z |
dc.date.created | 2014-10 |
dc.date.issued | 2014-10 |
dc.identifier.citation | Hussain, T.; Haider, A.; Ayguade, E. PMSS: a programmable memory system and scheduler for complex memory patterns. "Journal of parallel and distributed computing", Octubre 2014, vol. 74, núm. 15, p. 2983-2993. |
dc.identifier.issn | 0743-7315 |
dc.identifier.uri | http://hdl.handle.net/2117/27820 |
dc.description.abstract | HPC industry demands more computing units on FPGAs, to enhance the performance by using task/data parallelism. FPGAs can provide its ultimate performance on certain kernels by customizing the hardware for the applications. However, applications are getting more complex, with multiple kernels and complex data arrangements, generating overhead while scheduling/managing system resources. Due to this reason all classes of multi threaded machines–minicomputer to supercomputer–require to have efficient hardware scheduler and memory manager that improves the effective bandwidth and latency of the DRAM main memory. This architecture could be a very competitive choice for supercomputing systems that meets the demand of parallelism for HPC benchmarks. In this article, we proposed a Programmable Memory System and Scheduler (PMSS), which provides high speed complex data access pattern to the multi threaded architecture. This proposed PMSS system is implemented and tested on a Xilinx ML505 evaluation FPGA board. The performance of the system is compared with a microprocessor based system that has been integrated with the Xilkernel operating system. Results show that the modified PMSS based multi-accelerator system consumes 50% less hardware resources, 32% less on-chip power and achieves approximately a 19x speedup compared to the MicroBlaze based system. |
dc.format.extent | 11 p. |
dc.language.iso | eng |
dc.rights | Attribution-NonCommercial-NoDerivs 3.0 Spain |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/3.0/es/ |
dc.subject | Àrees temàtiques de la UPC::Informàtica::Arquitectura de computadors::Arquitectures paral·leles |
dc.subject | Àrees temàtiques de la UPC::Informàtica::Sistemes d'informació::Emmagatzematge i recuperació de la informació |
dc.subject.lcsh | Computer storage devices |
dc.subject.lcsh | Parallel programming (Computer science) |
dc.subject.other | HPC |
dc.subject.other | DRAM |
dc.subject.other | Xilkernel |
dc.subject.other | FPGA |
dc.title | PMSS: a programmable memory system and scheduler for complex memory patterns |
dc.type | Article |
dc.subject.lemac | Ordinadors -- Dispositius de memòria |
dc.subject.lemac | Programació en paral·lel (Informàtica) |
dc.contributor.group | Universitat Politècnica de Catalunya. CAP - Grup de Computació d'Altes Prestacions |
dc.identifier.doi | 10.1016/j.jpdc.2014.06.005 |
dc.description.peerreviewed | Peer Reviewed |
dc.relation.publisherversion | http://www.sciencedirect.com/science/article/pii/S0743731514001075 |
dc.rights.access | Restricted access - publisher's policy |
local.identifier.drac | 15625235 |
dc.description.version | Postprint (published version) |
dc.date.lift | 10000-01-01 |
local.citation.author | Hussain, T.; Haider, A.; Ayguade, E. |
local.citation.publicationName | Journal of parallel and distributed computing |
local.citation.volume | 74 |
local.citation.number | 15 |
local.citation.startingPage | 2983 |
local.citation.endingPage | 2993 |
Fitxers d'aquest items
Aquest ítem apareix a les col·leccions següents
-
Articles de revista [1.050]
-
Articles de revista [382]