PMSS: a programmable memory system and scheduler for complex memory patterns
Rights accessRestricted access - publisher's policy
HPC industry demands more computing units on FPGAs, to enhance the performance by using task/data parallelism. FPGAs can provide its ultimate performance on certain kernels by customizing the hardware for the applications. However, applications are getting more complex, with multiple kernels and complex data arrangements, generating overhead while scheduling/managing system resources. Due to this reason all classes of multi threaded machines–minicomputer to supercomputer–require to have efficient hardware scheduler and memory manager that improves the effective bandwidth and latency of the DRAM main memory. This architecture could be a very competitive choice for supercomputing systems that meets the demand of parallelism for HPC benchmarks. In this article, we proposed a Programmable Memory System and Scheduler (PMSS), which provides high speed complex data access pattern to the multi threaded architecture. This proposed PMSS system is implemented and tested on a Xilinx ML505 evaluation FPGA board. The performance of the system is compared with a microprocessor based system that has been integrated with the Xilkernel operating system. Results show that the modified PMSS based multi-accelerator system consumes 50% less hardware resources, 32% less on-chip power and achieves approximately a 19x speedup compared to the MicroBlaze based system.
CitationHussain, T.; Haider, A.; Ayguade, E. PMSS: a programmable memory system and scheduler for complex memory patterns. "Journal of parallel and distributed computing", Octubre 2014, vol. 74, núm. 15, p. 2983-2993.
|PMSS A programm ... omplex memory patterns.pdf||PMSS A programmable memory system and scheduler for complex memory patterns||1.934Mb||Restricted access|