Extending OpenMP* with vector constructs for modern multicore SIMD architectures
Document typeConference report
Rights accessRestricted access - publisher's policy
In order to obtain maximum performance, many applications require to extend parallelism from multi-threading to instruction-level (SIMD) parallelism that exists in many current (and future) multi-core architectures. While auto-vectorization technology has been used to exploit this SIMD level, it is not always enough due to OpenMP semantics and compiler technology limitations. In those cases, programmers need to resort to low-level intrinsics or vendor specific directives. We propose a new OpenMP directive: the simd directive. This directive will allow programmers to guide the vectorization process enabling a more productive and portable exploitation of the SIMD level. Our performance results show significant improvements over current auto-vectorizing technology of the Intel® Composer XE 2011.
CitationKlemm, M. [et al.]. Extending OpenMP* with vector constructs for modern multicore SIMD architectures. A: International Workshop on OpenMP. "OpenMP in a heterogeneous world: 8th International Workshop on OpenMP, IWOMP 2012: Rome, Italy: June 11-13, 2012: proceedings". Roma: Springer, 2012, p. 59-72.
|Extending OpenM ... ore SIMD Architectures.pdf||Extending OpenMP with Vector Constructs for Modern Multicore SIMD Architectures||329.7Kb||Restricted access|