Dexter: a performance-cost efficient resource allocation manager for serverless data analytics

Carregant...
Miniatura
El pots comprar en digital a:
El pots comprar en paper a:

Projectes de recerca

Unitats organitzatives

Número de la revista

Títol de la revista

ISSN de la revista

Títol del volum

Col·laborador

Editor

Tribunal avaluador

Realitzat a/amb

Tipus de document

Text en actes de congrés

Data publicació

Editor

Association for Computing Machinery (ACM)

Condicions d'accés

Accés obert

Llicència

Creative Commons
Aquesta obra està protegida pels drets de propietat intel·lectual i industrial corresponents. Llevat que s'hi indiqui el contrari, els seus continguts estan subjectes a la llicència de Creative Commons: Reconeixement 4.0 Internacional

Assignatures relacionades

Assignatures relacionades

Publicacions relacionades

Datasets relacionats

Datasets relacionats

Projecte CCD

Abstract

Leveraging serverless platforms for the efficient execution of distributed data analytics frameworks, such as Apache Spark [ 3], has gained substantial interest since early 2022. The elasticity, free-of-management, and on-demand scalability of serverless have motivated the effort in deploying distributed data analytics applications to serverless platforms. However, effectively auto-scaling resources for such complex workloads so that we can fully benefit from the resource elasticity of serverless remains challenging. Mis-configuration can result in severe performance and cost issues arising from resource under- and over-provisioning. In this paper, we present Dexter, a robust resource allocation manager dynamically allocating resources at a fine-grained level to guarantee performance-cost efficiency (optimizing total runtime cost). Dexter is novel in combining predictive and reactive strategies that fully leverage the elasticity of serverless to enhance the performance-cost efficiency for workflow executions. Unlike blackbox ML models, Dexter quickly reaches a sufficiently good solution, prioritizing simplicity, generality, and ease of understanding. Our experimental evaluation shows that, compared with the default serverless Spark resource allocation that dynamically requests exponentially more executors to accommodate pending tasks, our solution achieves a cost reduction of up to 4.65×, while improving performance-cost efficiency up to 3.50×. Dexter also enables a substantial resource saving, demanding up to 5.75× fewer resources.

Descripció

Persones/entitats

Document relacionat

Versió de

Citació

Nestorov, A. [et al.]. Dexter: a performance-cost efficient resource allocation manager for serverless data analytics. A:  ACM/IFIP International Middleware Conference. "Middleware'24: proceedings of the Twenty-Fifth ACM International Middleware Conference: December 2-6, 2024, Hong Kong, Hong Kong". New York: Association for Computing Machinery (ACM), 2024, p. 117-130. ISBN 979-8-4007-0623-3. DOI 10.1145/3652892.3700753 .

Ajut

Forma part

Dipòsit legal

ISBN

979-8-4007-0623-3

ISSN

Altres identificadors

Referències