Dexter: a performance-cost efficient resource allocation manager for serverless data analytics

dc.contributor.authorNestorov, Anna Maria
dc.contributor.authorMarrón Vida, Diego
dc.contributor.authorGutiérrez Torre, Alberto
dc.contributor.authorWang, Chen
dc.contributor.authorMisale, Claudia
dc.contributor.authorYoussef, Alaa
dc.contributor.authorCarrera Pérez, David
dc.contributor.authorBerral García, Josep Lluís
dc.contributor.groupUniversitat Politècnica de Catalunya. CROMAI - Computing Resources Orchestration and Management for AI
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors
dc.contributor.otherBarcelona Supercomputing Center
dc.date.accessioned2025-03-27T11:57:43Z
dc.date.available2025-03-27T11:57:43Z
dc.date.issued2024
dc.description.abstractLeveraging serverless platforms for the efficient execution of distributed data analytics frameworks, such as Apache Spark [ 3], has gained substantial interest since early 2022. The elasticity, free-of-management, and on-demand scalability of serverless have motivated the effort in deploying distributed data analytics applications to serverless platforms. However, effectively auto-scaling resources for such complex workloads so that we can fully benefit from the resource elasticity of serverless remains challenging. Mis-configuration can result in severe performance and cost issues arising from resource under- and over-provisioning. In this paper, we present Dexter, a robust resource allocation manager dynamically allocating resources at a fine-grained level to guarantee performance-cost efficiency (optimizing total runtime cost). Dexter is novel in combining predictive and reactive strategies that fully leverage the elasticity of serverless to enhance the performance-cost efficiency for workflow executions. Unlike blackbox ML models, Dexter quickly reaches a sufficiently good solution, prioritizing simplicity, generality, and ease of understanding. Our experimental evaluation shows that, compared with the default serverless Spark resource allocation that dynamically requests exponentially more executors to accommodate pending tasks, our solution achieves a cost reduction of up to 4.65×, while improving performance-cost efficiency up to 3.50×. Dexter also enables a substantial resource saving, demanding up to 5.75× fewer resources.
dc.description.peerreviewedPeer Reviewed
dc.description.sponsorshipThis work is financed by the EU-HORIZON programme under grant agreements EU-HORIZON GA.101092646, EU-HORIZON MSCA GA.101086248, by Generalitat de Catalunya (AGAUR) GA.2021-SGR-00478, and the Spanish Ministry of Science (MICINN), the Research State Agency (AEI) and European Regional Development Funds (ERDF/FEDER) PID2021-126248OB-I00, MCIN/AEI/10.13039/ 501100011033/FEDER, UE.
dc.description.versionPostprint (author's final draft)
dc.format.extent14 p.
dc.identifier.citationNestorov, A. [et al.]. Dexter: a performance-cost efficient resource allocation manager for serverless data analytics. A:  ACM/IFIP International Middleware Conference. "Middleware'24: proceedings of the Twenty-Fifth ACM International Middleware Conference: December 2-6, 2024, Hong Kong, Hong Kong". New York: Association for Computing Machinery (ACM), 2024, p. 117-130. ISBN 979-8-4007-0623-3. DOI 10.1145/3652892.3700753 .
dc.identifier.doi10.1145/3652892.3700753
dc.identifier.isbn979-8-4007-0623-3
dc.identifier.urihttps://hdl.handle.net/2117/427195
dc.language.isoeng
dc.publisherAssociation for Computing Machinery (ACM)
dc.relation.projectidinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2021-126248OB-I00/ES/DISTRIBUCION DE ANALISIS DE DATOS Y APRENDIZAJE EN TECNOLOGIAS EDGE-SUPERCOMPUTING/
dc.relation.projectidinfo:eu-repo/grantAgreement/EC/HE/101092646/EU/Adaptive virtualization for AI-enabled Cloud-edge Continuum/CloudSkin
dc.relation.publisherversionhttps://dl.acm.org/doi/10.1145/3652892.3700753
dc.rights.accessOpen Access
dc.rights.licensenameAttribution 4.0 International
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subjectÀrees temàtiques de la UPC::Informàtica::Arquitectura de computadors::Arquitectures distribuïdes
dc.subject.otherServerless
dc.subject.otherResource allocation
dc.subject.otherData analytics
dc.subject.otherSpark
dc.subject.otherStage
dc.titleDexter: a performance-cost efficient resource allocation manager for serverless data analytics
dc.typeConference report
dspace.entity.typePublication
local.citation.authorNestorov, A.; Marrón, D.; Gutierrez-Torre, A.; Wang, C.; Misale, C.; Youssef, A.; Carrera, D.; Berral, J.
local.citation.contributor ACM/IFIP International Middleware Conference
local.citation.endingPage130
local.citation.publicationNameMiddleware’24: proceedings of the Twenty-Fifth ACM International Middleware Conference: December 2-6, 2024, Hong Kong, Hong Kong
local.citation.pubplaceNew York
local.citation.startingPage117
local.identifier.drac40727735

Fitxers

Paquet original

Mostrant 1 - 1 de 1
Carregant...
Miniatura
Nom:
DEXTER_Anna_preprint.pdf
Mida:
3.59 MB
Format:
Adobe Portable Document Format
Descripció: