Empowering automatic data-center management with machine learning
Document typeResearch report
Rights accessOpen Access
The Cloud as computing paradigm has become nowadays crucial for most Internet business models. Managing a cloud and optimizing its performance on a moment-by-moment basis is not easy given as the amount and diversity of elements involved (hardware, applications, workloads, customer needsdots). Here we show how a combination of scheduling algorithms and data mining techniques helps improving the performance and profitability of a data-center running virtualized web-services. We model the data-center's main resources (CPU, memory, I/O), quality of service (viewed as response time), and workloads (incoming streams of requests) from past executions. We show how these models to help scheduling algorithms make better decisions about job and resource allocation, aiming for a balance between throughput, quality of service, and power consumption. We test our approach first with real data and web-services on a data-center simulator, and further validate it in a real execution on a reduced scale cluster running the Open-Nebula virtualization platform.
CitationBerral, J., Gavaldà, R., Torres, J. "Empowering automatic data-center management with machine learning". 2013.
Is part ofLSI-13-1-R
- CAP - Grup de Computació d'Altes Prestacions - Reports de recerca 
- Departament d'Arquitectura de Computadors - Reports de recerca 
- LARCA - Laboratori d'Algorísmia Relacional, Complexitat i Aprenentatge - Reports de recerca 
- Departament de Ciències de la Computació - Reports de recerca [1.044]
All rights reserved. This work is protected by the corresponding intellectual and industrial property rights. Without prejudice to any existing legal exemptions, reproduction, distribution, public communication or transformation of this work are prohibited without permission of the copyright holder