Multilevel simulation-based co-design of next generation HPC microprocessors
10.1109/PMBS54543.2021.00008
Inclou dades d'ús des de 2022
Cita com:
hdl:2117/364938
Tipus de documentText en actes de congrés
Data publicació2021
EditorInstitute of Electrical and Electronics Engineers (IEEE)
Condicions d'accésAccés obert
Tots els drets reservats. Aquesta obra està protegida pels drets de propietat intel·lectual i
industrial corresponents. Sense perjudici de les exempcions legals existents, queda prohibida la seva
reproducció, distribució, comunicació pública o transformació sense l'autorització del titular dels drets
Abstract
This paper demonstrates the combined use of three simulation tools in support of a co-design methodology for an HPC-focused System-on-a-Chip (SoC) design. The simulation tools make different trade-offs between simulation speed, accuracy and model abstraction level, and are shown to be complementary. We apply the MUSA trace-based simulator for the initial sizing of vector register length, system-level cache (SLC) size and memory bandwidth. It has proven to be very efficient at pruning the design space, as its models enable sufficient accuracy without having to resort to highly detailed simulations. Then we apply gem5, a cycle-accurate micro-architecture simulator, for a more refined analysis of the performance potential of our reference SoC architecture, with models able to capture detailed hardware behavior at the cost of simulation speed. Furthermore, we study the network-on-chip (NoC) topology and IP placements using both gem5 for representative small- to medium-scale configurations and SESAM/VPSim, a transaction-level emulator for larger scale systems with good simulation speed and sufficient architectural details. Overall, we consider several system design concerns, such as processor subsystem sizing and NoC settings. We apply the selected simulation tools, focusing on different levels of abstraction, to study several configurations with various design concerns and evaluate them to guide architectural design and optimization decisions. Performance analysis is carried out with a number of representative benchmarks. The obtained numerical results provide guidance and hints to designers regarding SIMD instruction width, SLC sizing, memory bandwidth as well as the best placement of memory controllers and NoC form factor. Thus, we provide critical insights for efficient design of future HPC microprocessors.
CitacióZaourar, L. [et al.]. Multilevel simulation-based co-design of next generation HPC microprocessors. A: International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems. "Proceedings of PMBS 2021: Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems: held in conjunction with SC21: The International Conference for High Performance Computing, Networking, Storage and Analysis: St. Louis, Missouri, USA, November 14-19, 2021". Institute of Electrical and Electronics Engineers (IEEE), 2021, p. 18-29. ISBN 978-1-6654-1118-9. DOI 10.1109/PMBS54543.2021.00008.
ISBN978-1-6654-1118-9
Versió de l'editorhttps://ieeexplore.ieee.org/document/9652667
Col·leccions
- Doctorat en Arquitectura de Computadors - Ponències/Comunicacions de congressos [294]
- Computer Sciences - Ponències/Comunicacions de congressos [574]
- CAP - Grup de Computació d'Altes Prestacions - Ponències/Comunicacions de congressos [784]
- Departament d'Arquitectura de Computadors - Ponències/Comunicacions de congressos [1.955]
Fitxers | Descripció | Mida | Format | Visualitza |
---|---|---|---|---|
Multilevel simu ... processors_CameraReady.pdf | 1,985Mb | Visualitza/Obre |