Ir al contenido (pulsa Retorno)

Universitat Politècnica de Catalunya

    • Català
    • Castellano
    • English
    • LoginRegisterLog in (no UPC users)
  • mailContact Us
  • world English 
    • Català
    • Castellano
    • English
  • userLogin   
      LoginRegisterLog in (no UPC users)

UPCommons. Global access to UPC knowledge

Banner header
59.551 UPC E-Prints
You are here:
View Item 
  •   DSpace Home
  • E-prints
  • Grups de recerca
  • CAP - Grup de Computació d'Altes Prestacions
  • Ponències/Comunicacions de congressos
  • View Item
  •   DSpace Home
  • E-prints
  • Grups de recerca
  • CAP - Grup de Computació d'Altes Prestacions
  • Ponències/Comunicacions de congressos
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Performance characterization of spark workloads on shared NUMA Systems

Thumbnail
View/Open
Performance Characterization of Spark Workloads.pdf (256,6Kb)
Share:
 
 
10.1109/BigDataService.2018.00015
 
  View Usage Statistics
Cita com:
hdl:2117/123195

Show full item record
Baig, Shuja-ur-Rehman
Amaral, Marcelo
Polo Cantero, JoséMés informacióMés informacióMés informació
Carrera Pérez, DavidMés informació
Document typeConference report
Defense date2018
PublisherInstitute of Electrical and Electronics Engineers (IEEE)
Rights accessOpen Access
All rights reserved. This work is protected by the corresponding intellectual and industrial property rights. Without prejudice to any existing legal exemptions, reproduction, distribution, public communication or transformation of this work are prohibited without permission of the copyright holder
ProjectCOMPUTACION DE ALTAS PRESTACIONES VII (MINECO-TIN2015-65316-P)
Hi-EST - Holistic Integration of Emerging Supercomputing Technologies (EC-H2020-639595)
Abstract
As the adoption of Big Data technologies becomes the norm in an increasing number of scenarios, there is also a growing need to optimize them for modern processors. Spark has gained momentum over the last few years among companies looking for high performance solutions that can scale out across different cluster sizes. At the same time, modern processors can be connected to large amounts of physical memory, in the range of up to few terabytes. This opens an enormous range of opportunities for runtimes and applications that aim to improve their performance by leveraging low latencies and high bandwidth provided by RAM. The result is that there are several examples today of applications that have started pushing the in-memory computing paradigm to accelerate tasks. To deliver such a large physical memory capacity, hardware vendors have leveraged Non-Uniform Memory Architectures (NUMA). This paper explores how Spark-based workloads are impacted by the effects of NUMA-placement decisions, how different Spark configurations result in changes in delivered performance, how the characteristics of the applications can be used to predict workload collocation conflicts, and how to improve performance by collocating workloads in scale-up nodes. We explore several workloads run on top of the IBM Power8 processor, and provide manual strategies that can leverage performance improvements up to 40% on Spark workloads when using smart processor-pinning and workload collocation strategies.
CitationBaig, S., Amaral, M., Polo, J., Carrera, D. Performance characterization of spark workloads on shared NUMA Systems. A: International Conference on Big Data Computing Service and Applications. "2018 IEEE Fourth International Conference on Big Data Computing Service and Applications (BigDataService 2018): Bamberg, Germany: 26-29 March 2018". Institute of Electrical and Electronics Engineers (IEEE), 2018, p. 41-48. 
URIhttp://hdl.handle.net/2117/123195
DOI10.1109/BigDataService.2018.00015
ISBN9781538651209
Publisher versionhttps://ieeexplore.ieee.org/document/8405690
Collections
  • CAP - Grup de Computació d'Altes Prestacions - Ponències/Comunicacions de congressos [782]
  • Departament d'Arquitectura de Computadors - Ponències/Comunicacions de congressos [1.841]
  • Departament d'Enginyeria Electrònica - Ponències/Comunicacions de congressos [1.637]
  • GRUP ISI - Grup d'Instrumentació, sensors i interfícies - Ponències/Comunicacions de congressos [70]
Share:
 
  View Usage Statistics

Show full item record

FilesDescriptionSizeFormatView
Performance Cha ... ion of Spark Workloads.pdf256,6KbPDFView/Open

Browse

This CollectionBy Issue DateAuthorsOther contributionsTitlesSubjectsThis repositoryCommunities & CollectionsBy Issue DateAuthorsOther contributionsTitlesSubjects

© UPC Obrir en finestra nova . Servei de Biblioteques, Publicacions i Arxius

info.biblioteques@upc.edu

  • About This Repository
  • Contact Us
  • Send Feedback
  • Cookies policy
  • Inici de la pàgina