Optimizing the exploitation of multicore processors and GPUs with OpenMP and OpenCL
Document typeConference report
Rights accessRestricted access - publisher's policy
European Commisision's projectTERAFLUX - Exploiting dataflow parallelism in Teradevice Computing (EC-FP7-249013)
In this paper, we present OMPSs, a programming model based on OpenMP and StarSs, that can also incorporate the use of OpenCL or CUDA kernels. We evaluate the proposal on three different architectures, SMP, Cell/B.E. and GPUs, showing the wide usefulness of the approach. The evaluation is done with four different benchmarks, Matrix Multiply, BlackScholes, Perlin Noise, and Julia Set. We compare the results obtained with the execution of the same benchmarks written in OpenCL, in the same architectures. The results show that OMPSs greatly outperforms the OpenCL environment. It is more flexible to exploit multiple accelerators. And due to the simplicity of the annotations, it increases programmer’s productivity.
CitationFerrer, R. [et al.]. Optimizing the exploitation of multicore processors and GPUs with OpenMP and OpenCL. A: International Workshop on Languages and Compilers for Parallel Computing. "Languages and Compilers for Parallel Computing 23rd International Workshop, LCPC 2010: Houston, TX: USA, October 7-9, 2010: revised selected papers". Houston, Texas: Springer, 2010, p. 215-229.
|Optimizing the ... with OpenMP and OpenCL.pdf||Optimizing the exploitation of multicore processors and GPUs with OpenMP and OpenCL||404,2Kb||Restricted access|