Memory bandwidth has always been a critical factor for the performance of many data intensive applications. The increasing processor performance, and the advert of single chip multiprocessors have increased the memory bandwidth demands beyond what a single commodity memory device can provide.
The immediate solution is to use more than one memory device, and interleave data across them so they can be used in parallel
as if they were a single device of higher bandwidth.
In this paper we showed that fine-grain memory interleaving on the evaluated many-core architectures with many DRAM
channels was critical to achieve high memory bandwidth efficiency. Our results showed that performance can degrade up to 50% due to achievable bandwidths being far from the maximum installed.
CitationCabarcas, F. [et al.]. Interleaving granularity on high bandwidth memory architecture for CMPs. A: International Symposium on Systems, Architectures, Modeling, and Simulation. "SAMOS 2010 : International Symposium on Systems, Architectures, Modeling, and Simulation (SAMOS X)". Samos: IEEE Computer Society Publications, 2010, p. 250-257.
All rights reserved. This work is protected by the corresponding intellectual and industrial property rights. Without prejudice to any existing legal exemptions, reproduction, distribution, public communication or transformation of this work are prohibited without permission of the copyright holder. If you wish to make any use of the work not provided for in the law, please contact: firstname.lastname@example.org