Ara es mostren els items 165-184 de 217

    • Scalability prediction for fundamental performance factors 

      Rosas Mendoza, Claudia; Giménez Lucas, Judit; Labarta Mancho, Jesús José (2014-06)
      Article
      Accés obert
      Inferring the expected performance for parallel applications is getting harder than ever; applications need to be modeled for restricted or nonexistent systems and performance analysts are required to identify and extrapolate ...
    • Scaling irregular array-type reductions in OmpSs 

      Ciesko, Jan; Mateo Bellido, Sergi; Teruel, Xavier; Beltran Querol, Vicenç; Martorell Bofill, Xavier; Badia Sala, Rosa Maria; Labarta Mancho, Jesús José (Barcelona Supercomputing Center, 2015-05-05)
      Text en actes de congrés
      Accés obert
      Array-type reductions represent a frequently occurring algorithmic pattern in many scientific applications. A special case occurs if array elements are accessed in a non-linear, often random manner, which makes their ...
    • Scaling non-regular shared-memory codes by reusing custom loop schedules 

      Nikolopoulos, Dimitrios; Artiaga Amouroux, Ernest; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José (2003-06)
      Article
      Accés obert
      In this paper we explore the idea of customizing and reusing loop schedules to improve the scalability of non-regular numerical codes in shared-memory architectures with non-uniform memory access latency. The main objective ...
    • Scheduler-activated dynamic page migration for multiprogrammed DSM multiprocessors 

      Nikolopoulos, Dimitrios S.; Polychronopoulos, Constantine D.; Papatheodorou, Theodore S.; Labarta Mancho, Jesús José; Ayguadé Parra, Eduard (2002-06)
      Article
      Accés restringit per política de l'editorial
      The performance of multiprogrammed shared-memory multiprocessors suffers often from scheduler interventions that neglect data locality. On cache-coherent distributed shared-memory (DSM) multiprocessors, such scheduler ...
    • Self-adaptive OmpSs tasks in heterogeneous environments 

      Planas Carbonell, Judit; Badia Sala, Rosa Maria; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José (Institute of Electrical and Electronics Engineers (IEEE), 2013)
      Text en actes de congrés
      Accés obert
      As new heterogeneous systems and hardware accelerators appear, high performance computers can reach a higher level of computational power. Nevertheless, this does not come for free: the more heterogeneity the system presents, ...
    • Shared queues in buffered multistage interconnection networks 

      Domingo Pascual, Jordi; Labarta Mancho, Jesús José; Casals, Olga; Llaberia Griñó, José M.; Valero Cortés, Mateo (1988-01)
      Report de recerca
      Accés obert
      This paper analyses the behaviour of a normal buffered delta network and as a result proposes the use of a shared queue instead of the two queues of the usual switching elements. The performance of the networks with ...
    • Short reasons for long vectors in HPC CPUs: a study based on RISC-V 

      Vizcaíno Serrano, Pablo; Ieronymakis, Georgios; Dimou, Nikolaus; Papaefstathiou, Vassilis; Labarta Mancho, Jesús José; Mantovani, Filippo (Association for Computing Machinery (ACM), 2023)
      Text en actes de congrés
      Accés obert
      For years, SIMD/vector units have enhanced the capabilities of modern CPUs in High-Performance Computing (HPC) and mobile technology. Typical commercially-available SIMD units process up to 8 double-precision elements with ...
    • Simulating the behavior of the human brain on GPUS 

      Valero-Lara, Pedro; Martinez-Perez, Ivan; Sirvent, Raul; Peña, Antonio J.; Martorell Bofill, Xavier; Labarta Mancho, Jesús José (2018-01-01)
      Article
      Accés obert
      The simulation of the behavior of the Human Brain is one of the most important challenges in computing today. The main problem consists of finding efficient ways to manipulate and compute the huge volume of data that this ...
    • Simulating whole supercomputer applications 

      Gonzalez, Juan; Casas, Marc; Moretó Planas, Miquel; Giménez, Judit; Labarta Mancho, Jesús José; Valero Cortés, Mateo (2009)
      Report de recerca
      Accés obert
      Architecture simulation tools are extremely useful not only to predict the performance of future system designs, but also to analyze and improve the performance of software running on well know architectures. However, since ...
    • Simulating whole supercomputer applications 

      González García, Juan; Casas, Marc; Giménez Lucas, Judit; Moretó Planas, Miquel; Ramírez Bellido, Alejandro; Labarta Mancho, Jesús José; Valero Cortés, Mateo (2011-06)
      Article
      Accés restringit per política de l'editorial
      Detailed simulations of large scale message-passing interface parallel applications are extremely time consuming and resource intensive. A new methodology that combines signal processing and data mining techniques plus a ...
    • Simulation environment for studying overlap of communication and computation 

      Subotic, Vladimir; Labarta Mancho, Jesús José; Valero Cortés, Mateo (Institute of Electrical and Electronics Engineers (IEEE), 2010)
      Text en actes de congrés
      Accés obert
      Overlapping communication and computation allows both processors and network to be utilized concurrently and leads to two clear benefits: overall speedup and a reduction in network performance requirements. Still, it remains ...
    • Size & shape matters: The need of HPC benchmarks of high resolution image training for deep learning 

      Parés Pont, Ferran; Megias Montsesinos, Pedro; Garcia Gasulla, Dario; Garcia Gasulla, Marta; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José (2021-03)
      Article
      Accés obert
      One of the purposes of HPC benchmarks is to identify limitations and bottlenecks in hardware. This functionality is particularly influential when assessing performance on emerging tasks, the nature and requirements of which ...
    • sLASs: a fully automatic auto-tuned linear algebra library based on OpenMP extensions implemented in OmpSs (LASs Library) 

      Valero Lara, Pedro; Catalán Pallarés, Sandra; Martorell Bofill, Xavier; Usui, Tetsuzo; Labarta Mancho, Jesús José (Elsevier, 2020-04-01)
      Article
      Accés obert
      In this work we have implemented a novel Linear Algebra Library on top of the task-based runtime OmpSs-2. We have used some of the most advanced OmpSs-2 features; weak dependencies and regions, together with the final ...
    • Soporte para el análisis de workloads en el proyecto eNANOS 

      Rodero Castro, Iván; Corbalán González, Julita; Duran González, Alejandro; Labarta Mancho, Jesús José (2005)
      Text en actes de congrés
      Accés obert
      El proyecto eNANOS plantea la planificación coordinada de trabajos entre varios niveles, desde el entorno heterogéneo y dinámico de un Grid hasta la ejecución de procesos y threads en las CPU’s de un computador o un cluster. ...
    • Spark deployment and performance evaluation on the MareNostrum supercomputer 

      Tous Liesa, Rubén; Gounaris, Anastasios; Tripiana, Carlos; Torres Viñals, Jordi; Girona Turell, Sergi; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José; Becerra Fontal, Yolanda; Carrera Pérez, David; Valero Cortés, Mateo (Institute of Electrical and Electronics Engineers (IEEE), 2015)
      Text en actes de congrés
      Accés obert
      In this paper we present a framework to enable data-intensive Spark workloads on MareNostrum, a petascale supercomputer designed mainly for compute-intensive applications. As far as we know, this is the first attempt to ...
    • Spatial support vector regression to detect silent errors in the exascale era 

      Subasi, Omer; Di, Sheng; Bautista Gomez, Leonardo; Balaprakash, Prasanna; Unsal, Osman Sabri; Labarta Mancho, Jesús José; Cristal Kestelman, Adrián; Cappello, Franck (Institute of Electrical and Electronics Engineers (IEEE), 2016)
      Text en actes de congrés
      Accés obert
      As the exascale era approaches, the increasing capacity of high-performance computing (HPC) systems with targeted power and energy budget goals introduces significant challenges in reliability. Silent data corruptions ...
    • SSMART: smart scheduling of multi-architecture tasks on heterogeneous systems 

      Planas, Judit; Badia Sala, Rosa Maria; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José (Association for Computing Machinery (ACM), 2015)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      High-performance computers can reach higher levels of computational power when combined with accelerators. Nevertheless, the more heterogeneity the system presents, the more complex becomes the programming task in terms ...
    • Studying performance changes with tracking analysis 

      Llort Sánchez, Germán; Servat, Harald; Gonzalez Garcia, Juan; Giménez Lucas, Judit; Labarta Mancho, Jesús José (Springer, 2015)
      Capítol de llibre
      Accés obert
      Numerical simulation and modelling using High Performance Computing has evolved into an established technique in academic and industrial research. At the same time, the High Performance Computing infrastructure is becoming ...
    • Studying the impact of the Full-Network embedding on multimodal pipelines 

      Vilalta Arias, Armand; Garcia-Gasulla, Dario; Pares, Ferran; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José; Moya-Sánchez, Ulises; Cortés García, Claudio Ulises (IOS Press, 2019-09-26)
      Article
      Accés obert
      The current state of the art for image annotation and image retrieval tasks is obtained through deep neural network multimodal pipelines, which combine an image representation and a text representation into a shared embedding ...
    • Symmetric rank-k update on clusters of multicore processors with SMPSs 

      Badia Sala, Rosa Maria; Labarta Mancho, Jesús José; Marjanovic, Vladimir; Martín Huertas, Alberto Francisco; Mayo, Rafael; Quintana Ortí, Enrique Salvador; Reyes, Ruymán (IOS Press, 2011)
      Comunicació de congrés
      Accés obert
      We investigate the use of the SMPSs programming model to leverage task parallelism in the execution of a message-pas sing implementation of the symmetric rank- k update on clusters equipped with multicore processors. Our ...