Understanding the overhead of the spin-lock loop in CMT architectures

Cakarevic, Vladimir; Radojković, Petar; Verdú Mulà, Javier; Cazorla Almeida, Francisco Javier; Gioiosa, Roberto; Pajuelo González, Manuel Alejandro; Nemirovsky, Mario; Valero Cortés, Mateo

Visualitza/Obre

Understanding the overhead of the spin-lock loop....pdf (298,7Kb)

Veure estadístiques d'ús d'UPCommons

Estadístiques de LA Referencia / Recolecta

Cita com:

Mostra el registre d'ítem complet

Cakarevic, Vladimir

Radojković, Petar

Verdú Mulà, Javier

Cazorla Almeida, Francisco Javier

Gioiosa, Roberto

Pajuelo González, Manuel Alejandro

Nemirovsky, Mario

Valero Cortés, Mateo

Tipus de documentText en actes de congrés

Data publicació2008

Condicions d'accésAccés obert

Tots els drets reservats. Aquesta obra està protegida pels drets de propietat intel·lectual i industrial corresponents. Sense perjudici de les exempcions legals existents, queda prohibida la seva reproducció, distribució, comunicació pública o transformació sense l'autorització del titular dels drets

Abstract

Spin locks are a synchronization mechanisms used to provide mutual exclusion to shared software resources. Spin locks are used over other synchronization mechanisms in several situations, like when the average waiting time to obtain the lock is short, in which case the probability of getting the lock is high, or when it is no possible to use other synchronization mechanisms. In this paper, we study the effect that the execution of the Linux spin-lock loop in the Sun UltraSPARC T1 and T2 processors introduces on other running tasks, especially in the worst case scenario where the workload shows high contention on a lock. For this purpose, we create a task that continuously executes the spin-lock loop and execute several instances of this task together with another active tasks. Our results show that, when the spin-lock tasks run with other applications in the same core of a T1 or a T2 processor, they introduce a significant overhead on other applications: 31% in T1 and 42% in T2, on average, respectively. For the T1 and T2 processors, we identify the fetch bandwidth as the main source of interaction between active threads and the spin-lock threads. We, propose 4 different variants of the Linux spin-lock loop that require less fetch bandwidth. Our proposal reduces the overhead of the spin-lock tasks over the other applications down to 3.5% and 1.5% on average, in T1 and T2 respectively. This is a reduction of 28 percentage points with respect to the Linux spin-lock loop for T1. For T2 the reduction is about 40 percentage points.

CitacióCakarevic, V. [et al.]. Understanding the overhead of the spin-lock loop in CMT architectures. A: Workshop on the Interaction between Operating Systems and Computer Architecture. "8th Workshop on the Interaction between Operating Systems and Computer Architecture". 2008, p. 1-10.

URIhttp://hdl.handle.net/2117/9946

Col·leccions

Veure estadístiques d'ús d'UPCommons

Mostra el registre d'ítem complet

Fitxers	Descripció	Mida	Format	Visualitza
Understanding t ... the spin-lock loop....pdf		298,7Kb	PDF	Visualitza/Obre

UPCommons. Portal del coneixement obert de la UPC

Understanding the overhead of the spin-lock loop in CMT architectures

Visualitza/Obre

Explora