Now showing items 1-6 of 6

    • A survey on the distributed computing stack 

      Ramón Cortés, Cristian; Alvarez Vecino, Pol; Lordan Gomis, Francesc; Álvarez Cid-Fuentes, Javier; Ejarque Artigas, Jorge; Badia Sala, Rosa Maria (Elsevier, 2021-11)
      Restricted access - publisher's policy
      In this paper, we review the background and the state of the art of the Distributed Computing software stack. We aim to provide the readers with a comprehensive overview of this area by supplying a detailed big-picture of ...
    • DDS: integrating data analytics transformations in task-based workflows [version 1; peer review: 1 approved, 2 approved with reservations] 

      Mammadli, Nihad; Ejarque Artigas, Jorge; Álvarez Cid-Fuentes, Javier; Badia Sala, Rosa Maria (2022-05-25)
      Open Access
      High-performance data analytics (HPDA) is a current trend in e-science research that aims to integrate traditional HPC with recent data analytic frameworks. Most of the work done in this field has focused on improving data ...
    • Efficient development of high performance data analytics in Python 

      Álvarez Cid-Fuentes, Javier; Alvarez, Pol; Amela Milian, Ramon; Ishii, Kuninori; Morizawa, Rafael K.; Badia Sala, Rosa Maria (Elsevier, 2020-10)
      Open Access
      Our society is generating an increasing amount of data at an unprecedented scale, variety, and speed. This also applies to numerous research areas, such as genomics, high energy physics, and astronomy, for which large-scale ...
    • Hunting for open clusters in Gaia DR2: 582 new open clusters in the galactic disc 

      Castro Ginard, Alfred; Jordi Nebot, Carme; Luri Carrasco, Xavier; Álvarez Cid-Fuentes, Javier; Casamiquela, Laia; Anders, Friedrich; Cantat Gaudin, Tristan; Monguió Montells, Maria; Balaguer Núñez, Lola; Solà Martinell, Salvi; Badia Sala, Rosa Maria (EDP Sciences, 2020-03-04)
      Open Access
      Context. Open clusters are key targets for studies of Galaxy structure and evolution, and stellar physics. Since the Gaia data release 2 (DR2), the discovery of undetected clusters has shown that previous surveys were ...
    • Managing failures in task-based parallel workflows in distributed computing environments 

      Ejarque, Jorge; Bertran, Marta; Álvarez Cid-Fuentes, Javier; Conejero, Javier; Badia Sala, Rosa Maria (Springer, Cham, 2020)
      Part of book or chapter of book
      Open Access
      Current scientific workflows are large and complex. They normally perform thousands of simulations whose results combined with searching and data analytics algorithms, in order to infer new knowledge, generate a very large ...
    • Workflow environments for advanced cyberinfrastructure platforms 

      Badia Sala, Rosa Maria; Ejarque Artigas, Jorge; Lordan Gomis, Francesc; Lezzi, Daniele; Conejero Bañón, Javier; Álvarez Cid-Fuentes, Javier; Becerra Fontal, Yolanda; Queralt Calafat, Anna (Institute of Electrical and Electronics Engineers (IEEE), 2019)
      Conference report
      Open Access
      Progress in science is deeply bound to the effective use of high-performance computing infrastructures and to the efficient extraction of knowledge from vast amounts of data. Such data comes from different sources that ...