Modeling and optimizing NUMA effects and prefetching with machine learning
Cita com:
hdl:2117/192494
Document typeConference report
Defense date2020
PublisherAssociation for Computing Machinery (ACM)
Rights accessOpen Access
All rights reserved. This work is protected by the corresponding intellectual and industrial
property rights. Without prejudice to any existing legal exemptions, reproduction, distribution, public
communication or transformation of this work are prohibited without permission of the copyright holder
Abstract
Both NUMA thread/data placement and hardware prefetcher configuration have significant impacts on HPC performance. Optimizing both together leads to a large and complex design space that has previously been impractical to explore at runtime. In this work we deliver the performance benefits of optimizing both NUMA thread/data placement and prefetcher configuration at runtime through careful modeling and online profiling. To address the large design space, we propose a prediction model that reduces the amount of input information needed and the complexity of the prediction required. We do so by selecting a subset of performance counters and application configurations that provide the richest profile information as inputs, and by limiting the output predictions to a subset of configurations that cover most of the performance. Our model is robust and can choose near-optimal NUMA+Pre-fetcher configurations for applications from only two profile runs. We further demonstrate how to profile online with low overhead, resulting in a technique that delivers an average of 1.68X performance improvement over a locality-optimized NUMA baseline with all prefetchers enabled.
CitationSánchez Barrera, I. [et al.]. Modeling and optimizing NUMA effects and prefetching with machine learning. A: International Conference on Supercomputing. "Proceedings of the 34th ACM International Conference on Supercomputing (ICS-2020): Barcelona, June 29–July 2, 2020". New York: Association for Computing Machinery (ACM), 2020, article 34, p. 1-13.
ISBN978-1-4503-7983-0
Publisher versionhttps://dl.acm.org/doi/10.1145/3392717.3392765
Collections
- Doctorat en Arquitectura de Computadors - Ponències/Comunicacions de congressos [308]
- Computer Sciences - Ponències/Comunicacions de congressos [597]
- CAP - Grup de Computació d'Altes Prestacions - Ponències/Comunicacions de congressos [784]
- Departament d'Arquitectura de Computadors - Ponències/Comunicacions de congressos [1.976]
Files | Description | Size | Format | View |
---|---|---|---|---|
3392717.3392765.pdf | Versió publicada pel l'editor. En accés obert a ACM DL | 1,211Mb | View/Open |