Optimizing the exploitation of multicore processors and GPUs with OpenMP and OpenCL
Rights accessRestricted access - publisher's policy
European Commisision's projectTERAFLUX - Exploiting dataflow parallelism in Teradevice Computing (EC-FP7-249013)
In this paper, we present OMPSs, a programming model based on OpenMP and StarSs, that can also incorporate the use of OpenCL or CUDA kernels. We evaluate the proposal on three different architectures, SMP, Cell/B.E. and GPUs, showing the wide usefulness of the approach. The evaluation is done with four different benchmarks, Matrix Multiply, BlackScholes, Perlin Noise, and Julia Set. We compare the results obtained with the execution of the same benchmarks written in OpenCL, in the same architectures. The results show that OMPSs greatly outperforms the OpenCL environment. It is more flexible to exploit multiple accelerators. And due to the simplicity of the annotations, it increases programmer’s productivity
CitationFerrer, R. [et al.]. Optimizing the exploitation of multicore processors and GPUs with OpenMP and OpenCL. "Lecture notes in computer science", 2011, vol. 6548, p. 215-229.
|Optimizing the ... with OpenMP and OpenCL.pdf||Optimizing the Exploitation of Multicore Processors and GPUs with OpenMP and OpenCL||404.2Kb||Restricted access|