Cost-conscious strategies to increase performance of numerical programs on agressive VLIW architectures

López Álvarez, David; Llosa Espuny, José Francisco; Valero Cortés, Mateo; Ayguadé Parra, Eduard

doi:10.1109/12.956090

Visualitza/Obre

Cost-conscious strategies to increase performance of numerical programs on aggressive VLIW architectures (2,418Mb)

Veure estadístiques d'ús d'UPCommons

Estadístiques de LA Referencia / Recolecta

Cita com:

Mostra el registre d'ítem complet

López Álvarez, David

Llosa Espuny, José Francisco

Valero Cortés, Mateo

Ayguadé Parra, Eduard

Tipus de documentArticle

Data publicació2001-10

Condicions d'accésAccés obert

Tots els drets reservats. Aquesta obra està protegida pels drets de propietat intel·lectual i industrial corresponents. Sense perjudici de les exempcions legals existents, queda prohibida la seva reproducció, distribució, comunicació pública o transformació sense l'autorització del titular dels drets

Abstract

Loops are the main time-consuming part of numerical applications. The performance of the loops is limited either by the resources offered by the architecture or by recurrences in the computation. To execute more operations per cycle, current processors are designed with growing degrees of resource replication (replication technique) for memory ports and functional units. However, the high cost in terms of area and cycle time of this technique precludes the use of high degrees of replication. High values for the cycle time may clearly offset any gain in terms of number of execution cycles. High values for the area may lead to an unimplementable configuration. An alternative to resource replication is resource widening (widening technique), which has also been used in some recent designs in which the width of the resources is increased (i.e., a single operation is performed over multiple data). Moreover, several general-purpose superscalar microprocessors have been implemented with multiply-add fused floating-point units (fusion technique), which reduces the latency of the combined operation and the number of resources used. The authors evaluate a broad set of VLIW processor design alternatives that combine the three techniques. We perform a technological projection for the next processor generations in order to foresee the possible implementable alternatives. From this study, we conclude that if the cost is taken into account, combining certain degrees of replication and widening in the hardware resources is more effective than applying only replication. Also, we confirm that multiply-add fused units will have a significant impact in raising the performance of future processor architectures with a reasonable increase in cost

CitacióLópez, D., Llosa, J., Valero, M., Ayguadé, E. Cost-conscious strategies to increase performance of numerical programs on agressive VLIW architectures. "IEEE transactions on computers", Octubre 2001, vol. 50, núm. 10, p. 1033-1051.

URIhttp://hdl.handle.net/2117/85498

DOI10.1109/12.956090

ISSN0018-9340

Versió de l'editorhttp://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=956090

Col·leccions

Veure estadístiques d'ús d'UPCommons

Mostra el registre d'ítem complet

Fitxers	Descripció	Mida	Format	Visualitza
Cost-conscious ... ive VLIW architectures.pdf	Cost-conscious strategies to increase performance of numerical programs on aggressive VLIW architectures	2,418Mb	PDF	Visualitza/Obre

UPCommons. Portal del coneixement obert de la UPC

Cost-conscious strategies to increase performance of numerical programs on agressive VLIW architectures

Visualitza/Obre

Explora