Now showing items 1-3 of 3

    • Dynamic configuration of partitioning in spark applications 

      Gounaris, Anastasios; Kougka, Georgia; Tous Liesa, Rubén; Tripiana, Carlos; Torres Viñals, Jordi (2017-07-01)
      Article
      Open Access
      Spark has become one of the main options for large-scale analytics running on top of shared-nothing clusters. This work aims to make a deep dive into the parallelism configuration and shed light on the behavior of parallel ...
    • Scientific Big Data Visualization: a Coupled Tools Approach 

      Artigues, Antoni; Cucchietti, Fernando; Tripiana, Carlos; Vicente, David; Calmet, Hadrien; Marín, Guillermo; Houzeaux, Guillaume; Vázquez, Mariano (South Ural State University (Chelyabinsk, Russia), 2015-02)
      Article
      Open Access
      We designed and implemented a parallel visualisation system for the analysis of large scale time-dependent particle type data. The particular challenge we address is how to analyse a high performance computation style ...
    • Spark deployment and performance evaluation on the MareNostrum supercomputer 

      Tous Liesa, Rubén; Gounaris, Anastasios; Tripiana, Carlos; Torres Viñals, Jordi; Girona Turell, Sergi; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José; Becerra Fontal, Yolanda; Carrera Pérez, David; Valero Cortés, Mateo (Institute of Electrical and Electronics Engineers (IEEE), 2015)
      Conference report
      Open Access
      In this paper we present a framework to enable data-intensive Spark workloads on MareNostrum, a petascale supercomputer designed mainly for compute-intensive applications. As far as we know, this is the first attempt to ...