Effective usage of vector registers in advanced vector architectures

Villa, Luis; Espasa Sans, Roger; Valero Cortés, Mateo

doi:10.1109/PACT.1997.644021

Visualitza/Obre

00644021.pdf (1,063Mb)

Veure estadístiques d'ús d'UPCommons

Estadístiques de LA Referencia / Recolecta

Cita com:

Mostra el registre d'ítem complet

Villa, Luis

Espasa Sans, Roger

Valero Cortés, Mateo

Tipus de documentText en actes de congrés

Data publicació1997

EditorInstitute of Electrical and Electronics Engineers (IEEE)

Condicions d'accésAccés obert

Tots els drets reservats. Aquesta obra està protegida pels drets de propietat intel·lectual i industrial corresponents. Sense perjudici de les exempcions legals existents, queda prohibida la seva reproducció, distribució, comunicació pública o transformació sense l'autorització del titular dels drets

Abstract

This paper presents data confirming the fact that traditional vector architectures can not reduce their vector register length without suffering a severe performance penalty. However, we will show that by combining the vector register length reduction with two different ILP techniques, decoupling and multithreading, the performance penalty can be made very small. We will show that each resulting architecture tolerates very well long memory latencies and also makes a better usage of the available storage space in each vector register. Using decoupling and short vectors, Each register can be halved while still providing speedups in the range 1.04-1.49 over a traditional architecture with long registers. Using multithreading. We split a vector register file in two halfs and show that two independent threads running on such machine can yield speedups in the range 1.23-1.29. The paper also explores configurations with 1/4 and 1/8 the original vector register size aimed at cost-conscious designs, and shows that even at 1/4 the original size, the resulting architectures can outperform a traditional machine. We also present results across a wide range of memory latencies, and show that the combination of short vectors and ILP techniques results in a very good tolerance of slow memory systems.

CitacióVilla, L., Espasa, R., Valero, M. Effective usage of vector registers in advanced vector architectures. A: International Conference on Parallel Architectures and Compilation Techniques. "1997 International Conference on Parallel Architectures and Compilation Techniques: San Francisco, California, November 10-14, 1997: proceedings". Institute of Electrical and Electronics Engineers (IEEE), 1997, p. 250-260.

URIhttp://hdl.handle.net/2117/108431

DOI10.1109/PACT.1997.644021

ISBN0-8186-8090-3

Versió de l'editorhttp://ieeexplore.ieee.org/document/644021/

Col·leccions

Veure estadístiques d'ús d'UPCommons

Mostra el registre d'ítem complet

Fitxers	Descripció	Mida	Format	Visualitza
00644021.pdf		1,063Mb	PDF	Visualitza/Obre

UPCommons. Portal del coneixement obert de la UPC

Effective usage of vector registers in advanced vector architectures

Visualitza/Obre

Explora