dc.contributor.author | Baig, Shuja-ur-Rehman |
dc.contributor.author | Iqbal, Waheed |
dc.contributor.author | Berral García, Josep Lluís |
dc.contributor.author | Erradi, Abdelkarim |
dc.contributor.author | Carrera Pérez, David |
dc.contributor.other | Universitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors |
dc.contributor.other | Barcelona Supercomputing Center |
dc.date.accessioned | 2020-05-05T12:33:26Z |
dc.date.available | 2020-05-05T12:33:26Z |
dc.date.issued | 2019-12 |
dc.identifier.citation | Baig, S. [et al.]. Real-time data center's telemetry reduction and reconstruction using Markov chain models. "IEEE systems journal", Desembre 2019, vol. 13, núm. 4, p. 4039-4050. |
dc.identifier.issn | 1932-8184 |
dc.identifier.uri | http://hdl.handle.net/2117/186371 |
dc.description.abstract | Large-scale data centers are composed of thousands of servers organized in interconnected racks to offer services to users. These data centers continuously generate large amounts of telemetry data streams (e.g., hardware utilization metrics) used for multiple purposes, including resource management, workload characterization, resource utilization prediction, capacity planning, and real-time analytics. These telemetry streams require costly bandwidth utilization and storage space, particularly at medium-long term for large data centers. This paper addresses this problem by proposing and evaluating a system to efficiently reduce bandwidth and storage for telemetry data through real-time modeling using Markov chain based methods. Our proposed solution was evaluated using real telemetry datasets and compared with polynomial regression methods for reducing and reconstructing data. Experimental results show that data can be lossy compressed up to 75% for bandwidth utilization and 95.33% for storage space, with reconstruction accuracy close to 92%. |
dc.description.sponsorship | This work was supported in part by the European Research Council (ERC) under the EU Horizon 2020 programme under Grant GA 639595, in part by the Spanish Ministry of Economy, Industry and Competitiveness under Grant TIN2015-65316-P and Grant IJCI2016-27485, in part by the Generalitat de Catalunya under Grant 2014-SGR-1051, in part by the University of the Punjab, Pakistan, and in part by the Qatar National Research Fund (a member of Qatar Foundation) under NPRP Grant # NPRP9-224-1-049. |
dc.format.extent | 12 p. |
dc.language.iso | eng |
dc.subject | Àrees temàtiques de la UPC::Informàtica::Arquitectura de computadors |
dc.subject.lcsh | Real-time data processing |
dc.subject.lcsh | Markov processes |
dc.subject.lcsh | Data processing service centers |
dc.subject.other | Data center monitoring |
dc.subject.other | Data reconstruction |
dc.subject.other | Data reduction |
dc.subject.other | Markov chain models (MMs) |
dc.subject.other | Polynomial regression (PR) |
dc.subject.other | Telemetry |
dc.title | Real-time data center's telemetry reduction and reconstruction using Markov chain models |
dc.type | Article |
dc.subject.lemac | Temps real (Informàtica) |
dc.subject.lemac | Markov, Processos de |
dc.subject.lemac | Centres informàtics |
dc.contributor.group | Universitat Politècnica de Catalunya. CAP - Grup de Computació d'Altes Prestacions |
dc.identifier.doi | 10.1109/JSYST.2019.2918430 |
dc.description.peerreviewed | Peer Reviewed |
dc.relation.publisherversion | https://ieeexplore.ieee.org/document/8734756 |
dc.rights.access | Open Access |
local.identifier.drac | 28083974 |
dc.description.version | Postprint (author's final draft) |
dc.relation.projectid | info:eu-repo/grantAgreement/EC/H2020/639595/EU/Holistic Integration of Emerging Supercomputing Technologies/Hi-EST |
dc.relation.projectid | info:eu-repo/grantAgreement/MINECO//TIN2015-65316-P/ES/COMPUTACION DE ALTAS PRESTACIONES VII/ |
dc.relation.projectid | info:eu-repo/grantAgreement/AGAUR/V PRI/2014 SGR 1051 |
dc.relation.projectid | info:eu-repo/grantAgreement/MINECO/1PE/IJCI-2016-27485 |
local.citation.author | Baig, S.; Iqbal, W.; Berral, J.; Erradi, A.; Carrera, D. |
local.citation.publicationName | IEEE systems journal |
local.citation.volume | 13 |
local.citation.number | 4 |
local.citation.startingPage | 4039 |
local.citation.endingPage | 4050 |