Evaluating SLURM simulator with real-machine SLURM and vice versa
Document typeConference report
PublisherInstitute of Electrical and Electronics Engineers (IEEE)
Rights accessOpen Access
Having a precise and a fast job scheduler model that resembles the real-machine job scheduling software behavior is extremely important in the field of job scheduling. The idea behind SLURM simulator is preserving the original code of the core SLURM functions while allowing for all the advantages of a simulator. Since 2011, SLURM simulator has passed through several iterations of improvements in different research centers. In this work, we present our latest improvements of SLURM simulator and perform the first-ever validation of the simulator on the real machine. In particular, we improved the simulator's performance for about 2.6 times, made the simulator deterministic across several same set-up runs, and improved the simulator's accuracy; its deviation from the real-machine is lowered from previous 12% to at most 1.7%. Finally, we illustrate with several use cases the value of the simulator for job scheduling researchers, SLURM-system administrators, and SLURM developers.
© 2018 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes,creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
CitationJokanovic, A.; D'Amico, M.; Corbalán, J. Evaluating SLURM simulator with real-machine SLURM and vice versa. A: IEEE/ACM Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems. "Proceedings of PMBS 2018: Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems Held in conjunction with SC18: The International Conference for High Performance Computing, Networking, Storage and Analysis Dallas, Texas, November 11-16, 2018". Institute of Electrical and Electronics Engineers (IEEE), p. 72-82.
All rights reserved. This work is protected by the corresponding intellectual and industrial property rights. Without prejudice to any existing legal exemptions, reproduction, distribution, public communication or transformation of this work are prohibited without permission of the copyright holder