Now showing items 1-3 of 3

    • A methodology for Spark parameter tuning 

      Gounaris, Anastasios; Torres Viñals, Jordi (2017-05-19)
      Article
      Open Access
      Spark has been established as an attractive platform for big data analysis, since it manages to hide most of the complexities related to parallelism, fault tolerance and cluster setting from developers. However, this comes ...
    • Dynamic configuration of partitioning in spark applications 

      Gounaris, Anastasios; Kougka, Georgia; Tous Liesa, Rubén; Tripiana, Carlos; Torres Viñals, Jordi (2017-07-01)
      Article
      Open Access
      Spark has become one of the main options for large-scale analytics running on top of shared-nothing clusters. This work aims to make a deep dive into the parallelism configuration and shed light on the behavior of parallel ...
    • Spark deployment and performance evaluation on the MareNostrum supercomputer 

      Tous Liesa, Rubén; Gounaris, Anastasios; Tripiana, Carlos; Torres Viñals, Jordi; Girona Turell, Sergi; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José; Becerra Fontal, Yolanda; Carrera Pérez, David; Valero Cortés, Mateo (Institute of Electrical and Electronics Engineers (IEEE), 2015)
      Conference report
      Open Access
      In this paper we present a framework to enable data-intensive Spark workloads on MareNostrum, a petascale supercomputer designed mainly for compute-intensive applications. As far as we know, this is the first attempt to ...