Dexter: a performance-cost efficient resource allocation manager for serverless data analytics
| dc.contributor.author | Nestorov, Anna Maria |
| dc.contributor.author | Marrón Vida, Diego |
| dc.contributor.author | Gutiérrez Torre, Alberto |
| dc.contributor.author | Wang, Chen |
| dc.contributor.author | Misale, Claudia |
| dc.contributor.author | Youssef, Alaa |
| dc.contributor.author | Carrera Pérez, David |
| dc.contributor.author | Berral García, Josep Lluís |
| dc.contributor.group | Universitat Politècnica de Catalunya. CROMAI - Computing Resources Orchestration and Management for AI |
| dc.contributor.other | Universitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors |
| dc.contributor.other | Barcelona Supercomputing Center |
| dc.date.accessioned | 2025-03-27T11:57:43Z |
| dc.date.available | 2025-03-27T11:57:43Z |
| dc.date.issued | 2024 |
| dc.description.abstract | Leveraging serverless platforms for the efficient execution of distributed data analytics frameworks, such as Apache Spark [ 3], has gained substantial interest since early 2022. The elasticity, free-of-management, and on-demand scalability of serverless have motivated the effort in deploying distributed data analytics applications to serverless platforms. However, effectively auto-scaling resources for such complex workloads so that we can fully benefit from the resource elasticity of serverless remains challenging. Mis-configuration can result in severe performance and cost issues arising from resource under- and over-provisioning. In this paper, we present Dexter, a robust resource allocation manager dynamically allocating resources at a fine-grained level to guarantee performance-cost efficiency (optimizing total runtime cost). Dexter is novel in combining predictive and reactive strategies that fully leverage the elasticity of serverless to enhance the performance-cost efficiency for workflow executions. Unlike blackbox ML models, Dexter quickly reaches a sufficiently good solution, prioritizing simplicity, generality, and ease of understanding. Our experimental evaluation shows that, compared with the default serverless Spark resource allocation that dynamically requests exponentially more executors to accommodate pending tasks, our solution achieves a cost reduction of up to 4.65×, while improving performance-cost efficiency up to 3.50×. Dexter also enables a substantial resource saving, demanding up to 5.75× fewer resources. |
| dc.description.peerreviewed | Peer Reviewed |
| dc.description.sponsorship | This work is financed by the EU-HORIZON programme under grant agreements EU-HORIZON GA.101092646, EU-HORIZON MSCA GA.101086248, by Generalitat de Catalunya (AGAUR) GA.2021-SGR-00478, and the Spanish Ministry of Science (MICINN), the Research State Agency (AEI) and European Regional Development Funds (ERDF/FEDER) PID2021-126248OB-I00, MCIN/AEI/10.13039/ 501100011033/FEDER, UE. |
| dc.description.version | Postprint (author's final draft) |
| dc.format.extent | 14 p. |
| dc.identifier.citation | Nestorov, A. [et al.]. Dexter: a performance-cost efficient resource allocation manager for serverless data analytics. A: ACM/IFIP International Middleware Conference. "Middleware'24: proceedings of the Twenty-Fifth ACM International Middleware Conference: December 2-6, 2024, Hong Kong, Hong Kong". New York: Association for Computing Machinery (ACM), 2024, p. 117-130. ISBN 979-8-4007-0623-3. DOI 10.1145/3652892.3700753 . |
| dc.identifier.doi | 10.1145/3652892.3700753 |
| dc.identifier.isbn | 979-8-4007-0623-3 |
| dc.identifier.uri | https://hdl.handle.net/2117/427195 |
| dc.language.iso | eng |
| dc.publisher | Association for Computing Machinery (ACM) |
| dc.relation.projectid | info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2021-126248OB-I00/ES/DISTRIBUCION DE ANALISIS DE DATOS Y APRENDIZAJE EN TECNOLOGIAS EDGE-SUPERCOMPUTING/ |
| dc.relation.projectid | info:eu-repo/grantAgreement/EC/HE/101092646/EU/Adaptive virtualization for AI-enabled Cloud-edge Continuum/CloudSkin |
| dc.relation.publisherversion | https://dl.acm.org/doi/10.1145/3652892.3700753 |
| dc.rights.access | Open Access |
| dc.rights.licensename | Attribution 4.0 International |
| dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ |
| dc.subject | Àrees temàtiques de la UPC::Informàtica::Arquitectura de computadors::Arquitectures distribuïdes |
| dc.subject.other | Serverless |
| dc.subject.other | Resource allocation |
| dc.subject.other | Data analytics |
| dc.subject.other | Spark |
| dc.subject.other | Stage |
| dc.title | Dexter: a performance-cost efficient resource allocation manager for serverless data analytics |
| dc.type | Conference report |
| dspace.entity.type | Publication |
| local.citation.author | Nestorov, A.; Marrón, D.; Gutierrez-Torre, A.; Wang, C.; Misale, C.; Youssef, A.; Carrera, D.; Berral, J. |
| local.citation.contributor | ACM/IFIP International Middleware Conference |
| local.citation.endingPage | 130 |
| local.citation.publicationName | Middleware’24: proceedings of the Twenty-Fifth ACM International Middleware Conference: December 2-6, 2024, Hong Kong, Hong Kong |
| local.citation.pubplace | New York |
| local.citation.startingPage | 117 |
| local.identifier.drac | 40727735 |
Fitxers
Paquet original
1 - 1 de 1
Carregant...
- Nom:
- DEXTER_Anna_preprint.pdf
- Mida:
- 3.59 MB
- Format:
- Adobe Portable Document Format
- Descripció:



