Performance and power evaulation of clustered VLIW processors with wide functional units
Rights accessRestricted access - publisher's policy
Architectural resources and program recurrences are themain limitations to the amount of Instruction-Level Parallelism (ILP) exploitable from loops. To increase the number of operations per second, current designs use high degrees of resource replication for memory ports and functional units. But the high costs in terms of power and cycle time of this technique limit the degree of replication. Clustering is a technique aimed at decentralizing the design of future wide issue cores and enable them to meet the technology constraints in terms of cycle time, area and power. Another way to reduce the complexity of recent cores is using wide functional units. This technique only requires minor modifications to the underlying hardware, but also imposes a penalty on the exploitable parallelism. In this paper we evaluate a broad range of VLIW configurations that make use of these two techniques. From this study we conclude that applying both techniques yields configurations with very good power-performance efficiency.
CitationPericas, M. [et al.]. Performance and power evaulation of clustered VLIW processors with wide functional units. "Lecture notes in computer science", Novembre 2004, vol. 3133, p. 88-97.
|Performance and ... Wide Functional Units.pdf||Performance and Power Evaluation of Clustered VLIW Processors with Wide Functional Units||218.5Kb||Restricted access|