Mostra el registre d'ítem simple
Next stop 'NoOps': enabling cross-system diagnostics through graph-based composition of logs and metrics
dc.contributor.author | Zasadzinski, Michal |
dc.contributor.author | Solé Simó, Marc |
dc.contributor.author | Brandon, Alvaro |
dc.contributor.author | Muntés Mulero, Victor |
dc.contributor.author | Carrera Pérez, David |
dc.contributor.other | Universitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors |
dc.date.accessioned | 2018-12-21T10:26:47Z |
dc.date.available | 2018-12-21T10:26:47Z |
dc.date.issued | 2018 |
dc.identifier.citation | Zasadzinski, M., Solé, M., Brandon, A., Muntés, V., Carrera, D. Next stop 'NoOps': enabling cross-system diagnostics through graph-based composition of logs and metrics. A: IEEE International Conference on Cluster Computing. "2018 IEEE International Conference on Cluster Computing: 10–13 September 2018, Belfast, United Kingdom: proceedings". Institute of Electrical and Electronics Engineers (IEEE), 2018, p. 212-222. |
dc.identifier.isbn | 978-1-5386-8319-4 |
dc.identifier.other | https://arxiv.org/abs/1809.07687 |
dc.identifier.uri | http://hdl.handle.net/2117/126118 |
dc.description.abstract | Performing diagnostics in IT systems is an increasingly complicated task, and it is not doable in satisfactory time by even the most skillful operators. Systems and their architecture change very rapidly in response to business and user demand. Many organizations see value in the maintenance and management model of NoOps that stands for No Operations. One of the implementations of this model is a system that is maintained automatically without any human intervention. The path to NoOps involves not only precise and fast diagnostics but also reusing as much knowledge as possible after the system is reconfigured or changed. The biggest challenge is to leverage knowledge on one IT system and reuse this knowledge for diagnostics of another, different system. We propose a framework of weighted graphs which can transfer knowledge, and perform high-quality diagnostics of IT systems. We encode all possible data in a graph representation of a system state and automatically calculate weights of these graphs. Then, thanks to the evaluation of similarity between graphs, we transfer knowledge about failures from one system to another and use it for diagnostics. We successfully evaluate the proposed approach on Spark, Hadoop, Kafka and Cassandra systems. |
dc.format.extent | 11 p. |
dc.language.iso | eng |
dc.publisher | Institute of Electrical and Electronics Engineers (IEEE) |
dc.subject | Àrees temàtiques de la UPC::Informàtica::Arquitectura de computadors |
dc.subject.lcsh | Knowledge management |
dc.subject.lcsh | Big data |
dc.subject.lcsh | Graph theory |
dc.subject.other | Graphs |
dc.subject.other | Similarity |
dc.subject.other | Diagnostics |
dc.subject.other | Root cause classification |
dc.subject.other | Logs |
dc.subject.other | NoOps |
dc.title | Next stop 'NoOps': enabling cross-system diagnostics through graph-based composition of logs and metrics |
dc.type | Conference report |
dc.subject.lemac | Gestió del coneixement |
dc.subject.lemac | Macrodades |
dc.subject.lemac | Grafs, Teoria de |
dc.contributor.group | Universitat Politècnica de Catalunya. CAP - Grup de Computació d'Altes Prestacions |
dc.identifier.doi | 10.1109/CLUSTER.2018.00039 |
dc.description.peerreviewed | Peer Reviewed |
dc.relation.publisherversion | https://ieeexplore.ieee.org/document/8514882 |
dc.rights.access | Open Access |
local.identifier.drac | 23551363 |
dc.description.version | Postprint (author's final draft) |
local.citation.author | Zasadzinski, M.; Solé, M.; Brandon, A.; Muntés, V.; Carrera, D. |
local.citation.contributor | IEEE International Conference on Cluster Computing |
local.citation.publicationName | 2018 IEEE International Conference on Cluster Computing: 10–13 September 2018, Belfast, United Kingdom: proceedings |
local.citation.startingPage | 212 |
local.citation.endingPage | 222 |