Automatic generation of workload profiles using unsupervised learning pipelines
Rights accessOpen Access
European Commission's projectHi-EST - Holistic Integration of Emerging Supercomputing Technologies (EC-H2020-639595)
The complexity of resource usage and power consumption on cloud-based applications makes the understanding of application behavior through expert examination difficult. The difficulty increases when applications are seen as “black boxes”, where only external monitoring can be retrieved. Furthermore, given the different amount of scenarios and applications, automation is required. Here we examine and model application behavior by finding behavior phases. We use Conditional Restricted Boltzmann Machines (CRBM) to model time-series containing resources traces measurements like CPU, Memory and IO. CRBMs can be used to map a given given historic window of trace behaviour into a single vector. This low dimensional and time-aware vector can be passed through clustering methods, from simplistic ones like k-means to more complex ones like those based on Hidden Markov Models (HMM). We use these methods to find phases of similar behaviour in the workloads. Our experimental evaluation shows that the proposed method is able to identify different phases of resource consumption across different workloads. We show that the distinct phases contain specific resource patterns that distinguish them.
CitationBuchaca, D., Bernal, J., Carrera, D. Automatic generation of workload profiles using unsupervised learning pipelines. "IEEE transactions on network and service management", Març 2018, vol. 15, núm. 1, p. 142-155.