Loop parallelization: revisiting framework of unimodular transformations
Document typeConference report
PublisherInstitute of Electrical and Electronics Engineers (IEEE)
Rights accessOpen Access
The paper extends the framework of linear loop transformations adding a new nonlinear step at the transformation process. The current framework of linear loop transformation cannot identify a significant fraction of parallelism. For this reason, we present a method to complement it with some basic transformations in order to extract the maximum loop parallelism in perfect nested loops with tight recurrences in the dependence graph. The parallelizing algorithm solves the important problem of deciding the set of transformations to apply in order to maximize the degree of parallelism, the number of parallel loops within a loop nest, and presents a way of generating efficient transformed code that exploits coarse grain parallelism on a MIMD system
CitationTorres, J., Ayguadé, E., Labarta, J., Valero, M. Loop parallelization: revisiting framework of unimodular transformations. A: Euromicro International Conference on Parallel, Distributed, and Network-Based Processing. "Proceedings of the Fourth Euromicro Workshop on Parallel and Distributed Processing, 1996: PDP '96". Braga: Institute of Electrical and Electronics Engineers (IEEE), 1996, p. 420-427.
All rights reserved. This work is protected by the corresponding intellectual and industrial property rights. Without prejudice to any existing legal exemptions, reproduction, distribution, public communication or transformation of this work are prohibited without permission of the copyright holder