Ara es mostren els items 1-12 de 784

    • Data prefetching on in-order processors 

      Ortega Carrasco, Cristobal; García Flores, Víctor; Moretó Planas, Miquel; Casas, Marc; Rositoru, Roxana (Institute of Electrical and Electronics Engineers (IEEE), 2018)
      Text en actes de congrés
      Accés obert
      Low-power processors have attracted attention due to their energy-efficiency. A large market, such as the mobile one, relies on these processors for this very reason. Even High Performance Computing (HPC) systems are ...
    • Transparent load balancing of MPI programs using OmpSs-2@Cluster and DLB 

      Aguilar Mena, Jimmy; Ali, Omar Shaaban Ibrahim; López Herrero, Víctor; Garcia Casulla, Marta; Carpenter, Paul Matthew; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José (Association for Computing Machinery (ACM), 2022)
      Text en actes de congrés
      Accés obert
      Load imbalance is a long-standing source of inefficiency in high performance computing. The situation has only got worse as applications and systems increase in complexity, e.g., adaptive mesh refinement, DVFS, memory ...
    • Automatic aggregation of subtask accesses for nested OpenMP-style tasks 

      Ali, Omar Shaaban Ibrahim; Aguilar Mena, Jimmy; Beltran Querol, Vicenç; Carpenter, Paul Matthew; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José (Institute of Electrical and Electronics Engineers (IEEE), 2022)
      Text en actes de congrés
      Accés obert
      Task-based programming is a high performance and productive model to express parallelism. Tasks encapsulate work to be executed across multiple cores or offloaded to GPUs, FPGAs, other accelerators or other nodes. In order ...
    • An extension of the StarSs programming model for platforms with multiple GPUs 

      Ayguadé Parra, Eduard; Badia Sala, Rosa Maria; Igual Peña, Francisco D.; Labarta Mancho, Jesús José; Mayo Gual, Rafael; Quintana Ortí, Enrique Salvador (Springer, 2009)
      Comunicació de congrés
      Accés obert
      While general-purpose homogeneous multi-core architectures are becoming ubiquitous, there are clear indications that, for a number of important applications, a better performance/power ratio can be attained using specialized ...
    • Space compression algorithms acceleration on embedded multi-core and GPU platforms 

      Jover Álvarez, Álvaro; Rodríguez Ferrández, Iván; Kosmidis, Leonidas; Steenari, David (Association for Computing Machinery (ACM), 2022)
      Comunicació de congrés
      Accés obert
      Future space missions will require increased on-board computing power to process and compress massive amounts of data. Consequently, embedded multi-core and GPU platforms are considered, which have been shown beneficial ...
    • Analyzing the performance of hierarchical collective algorithms on ARM-based multicore clusters 

      Utrera Iglesias, Gladys Miriam; Gil, Marisa; Martorell Bofill, Xavier (Institute of Electrical and Electronics Engineers (IEEE), 2022)
      Comunicació de congrés
      Accés obert
      MPI is the de facto communication standard library for parallel applications in distributed memory architectures. Collective operations performance is critical in HPC applications as they can become the bottleneck of their ...
    • Tuning dynamic web applications using fine-grain analysis 

      Guitart Fernández, Jordi; Carrera Pérez, David; Torres Viñals, Jordi; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José (Institute of Electrical and Electronics Engineers (IEEE), 2005)
      Text en actes de congrés
      Accés obert
      In this paper we present a methodology to analyze the behavior and performance of Java application servers using a performance analysis framework. This framework, considers all levels involved in the application server ...
    • Soporte para el análisis de workloads en el proyecto eNANOS 

      Rodero Castro, Iván; Corbalán González, Julita; Duran González, Alejandro; Labarta Mancho, Jesús José (2005)
      Text en actes de congrés
      Accés obert
      El proyecto eNANOS plantea la planificación coordinada de trabajos entre varios niveles, desde el entorno heterogéneo y dinámico de un Grid hasta la ejecución de procesos y threads en las CPU’s de un computador o un cluster. ...
    • Optimizing NANOS OpenMP for the IBM Cyclops multithreaded architecture 

      Ródenas Picó, David; Martorell Bofill, Xavier; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José; Almási, George; Cascaval, Calin; Castaños, José G.; Moreira, Jose E. (Institute of Electrical and Electronics Engineers (IEEE), 2005)
      Text en actes de congrés
      Accés obert
      In this paper, we present two approaches to improve the execution of OpenMP applications on the IBM Cyclops multithreaded architecture. Both solutions are independent and they are focused to obtain better performance through ...
    • WAS control center: an autonomic performance-triggered tracing environment for WebSphere 

      Carrera Pérez, David; García, David; Torres Viñals, Jordi; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José (Institute of Electrical and Electronics Engineers (IEEE), 2005)
      Text en actes de congrés
      Accés obert
      Studying any aspect of an application server with high availability requirements can become a tedious task when a continuous monitoring of the server status is necessary. The creation of performance-driven autonomic systems ...
    • A trace-scaling agent for parallel application tracing 

      Freitag, Fèlix; Caubet Serrabou, Jordi; Labarta Mancho, Jesús José (Institute of Electrical and Electronics Engineers (IEEE), 2002)
      Text en actes de congrés
      Accés obert
      Tracing and performance analysis tools are an important component in the development of high performance applications. Tracing parallel programs with current tracing tools, however, easily leads to large trace files with ...
    • Dynamic load balancing of MPI+OpenMP applications 

      Corbalán González, Julita; Duran González, Alejandro; Labarta Mancho, Jesús José (Institute of Electrical and Electronics Engineers (IEEE), 2004)
      Text en actes de congrés
      Accés obert
      The hybrid programming model MPI+OpenMP are useful to solve the problems of load balancing of parallel applications independently of the architecture. Typical approaches to balance parallel applications using two levels ...