Rights accessRestricted access - publisher's policy
Current heterogeneous multicore architectures, including the Cell/B.E., GPUs, and future developments, like Larrabee,
require enormous programming efforts to efficiently run current parallel applications, achieving high performance. In
this paper, we want to present the results we obtain from the coding with the SARC Programming Model, of two benchmarks, matrix multiply and conjugate gradient (NAS CG),
with respect memory bandwidth. We show some sample loops annotated and the experience that we got trying to have our system executing them efficienly. Results indicate
that the programming model is able to achieve up to 85% of the peak memory bandwidth on the Cell/B.E. processor.
CitationFerrer, R. [et al.]. Achieving high memory performance from heterogeneous architectures with the SARC programming model. A: Workshop on Memory Performance: dealing with Applications, Systems and Architecture. "Proceedings of the 10th MEDEA workshop on MEmory performance: DEaling with Applications, systems and architecture". Raleigh, North Carolina: ACM, 2009, p. 15-21.
All rights reserved. This work is protected by the corresponding intellectual and industrial property rights. Without prejudice to any existing legal exemptions, reproduction, distribution, public communication or transformation of this work are prohibited without permission of the copyright holder. If you wish to make any use of the work not provided for in the law, please contact: email@example.com