Ara es mostren els items 73-92 de 139

    • LEGaTO: Low-energy, secure, and resilient toolset for heterogeneous computing 

      Salami, Behzad; Parasyris, Konstantinos; Cristal Kestelman, Adrián; Unsal, Osman Sabri; Martorell Bofill, Xavier; Carpenter, Paul Matthew; De la Cruz Martínez, Raul; Bautista Gomez, Leonardo; Jiménez González, Daniel; Álvarez Martínez, Carlos; Nabavilarimi, Seyed Saber; Madonar Soria, Sergi (Institute of Electrical and Electronics Engineers (IEEE), 2020)
      Text en actes de congrés
      Accés obert
      The LEGaTO project leverages task-based programming models to provide a software ecosystem for Made in-Europe heterogeneous hardware composed of CPUs, GPUs, FPGAs and dataflow engines. The aim is to attain one order of ...
    • Leveraging OmpSs to exploit hardware accelerators 

      Sainz, Florentino; Mateo Bellido, Sergi; Beltran Querol, Vicenç; Bosque, José L.; Martorell Bofill, Xavier; Ayguadé Parra, Eduard (Institute of Electrical and Electronics Engineers (IEEE), 2014)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      CUDA and OpenCL are the most widely used programming models to exploit hardware accelerators. Both programming models provide a C-based programming language to write accelerator kernels and a host API used to glue the host ...
    • Lightning talks of EduHPC 2022 

      Qasem, Apan; Anzt, Hartwig; Ayguadé Parra, Eduard; Cahil, Katharine; Canal Corretger, Ramon; Chan, Jany; Fosler-Lussier, Eric; Llosa Espuny, José Francisco; Martorell Bofill, Xavier; Sancho Samsó, María Ribera (Institute of Electrical and Electronics Engineers (IEEE), 2022)
      Text en actes de congrés
      Accés obert
      The lightning talks at EduHPC provide an opportunity to share early results and insights on parallel and distributed computing (PDC) education and training efforts. The four lightning talks at EduHPC 2022 cover a range of ...
    • Measuring TCP bandwidth on top of a Gigabit and Myrinet network 

      Costa Prats, Juan José; Bueno Hedo, Javier; Martorell Bofill, Xavier; Cortés, Toni (2009)
      Report de recerca
      Accés obert
      In this article we measure the bandwidth achieved by the TCP protocol on top of a Gigabit and Myrinet network. We have created a synthetic benchmark, consisting on a server and a client that both produce and consume data, ...
    • Metodologí­a para la generación y evaluación automática de hardware específico 

      González, Cecilia; Jiménez González, Daniel; Martorell Bofill, Xavier; Álvarez Martínez, Carlos; Gaydadjiev, Georgi (2009-09)
      Text en actes de congrés
      Accés obert
      En el área de la bioinformática podemos encontrar aplicaciones que suponen un reto para el diseño de nuevas arquitecturas de procesadores en términos de rendimiento, ya que sus características difieren de las de las ...
    • Migration of a generic multi-physics framework to HPC environments 

      Dadvand, Pooyan; Rossi, Riccardo; Gil, Marisa; Martorell Bofill, Xavier; Cotela Dalmau, Jordi; Juanpere, E.; Idelsohn Barg, Sergio Rodolfo; Oñate Ibáñez de Navarra, Eugenio (Elsevier, 2013-07)
      Article
      Accés obert
      Creating a highly parallelizable code is a challenge specially for distributed memory machines (DMMs). Moreover, algorithms and data structures suitable for these platforms can be very different from the ones used in serial ...
    • Migration of a generic multi-physics framework to HPC environments 

      Dadvand, Pooyan; Rossi, Riccardo; Gil, Marisa; Martorell Bofill, Xavier; Cotela Dalmau, Jordi; Juanpere, E.; Idelsohn Barg, Sergio Rodolfo; Oñate Ibáñez de Navarra, Eugenio (2011)
      Text en actes de congrés
      Accés obert
      Creating a highly parallelizable code is a challenge and development for distributed memory machines (DMMs) can be very different form developing a serial code in term of algorithms and structure. For this reason, many ...
    • Modeling multi-board communication in the AXIOM cyber-physical system 

      Giorgi, Roberto; Mazumdar, Somnath; Viola, Stefano; Gai, Paolo; Garzarella, Stefano; Morelli, Bruno; Pnevmatikatos, Dionisis; Theodoropoulos, Dimitris; Alvarez, Carlos; Ayguadé Parra, Eduard; Bueno, Javier; Filgueras Izquierdo, Antonio; Jiménez-González, Daniel; Martorell Bofill, Xavier (2016-12-01)
      Article
      Accés restringit per política de l'editorial
      The main goal of the AXIOM project is to design a small board that could be used as a LEGOTM-style module to build systems with more performance while keeping the programming task simple by using a familiar shared-memory ...
    • MPI+OpenMP tasking scalability for the simulation of the human brain 

      Valero-Lara, Pedro; Sirvent, Raul; Pena, A. J.; Martorell Bofill, Xavier; Labarta Mancho, Jesús José (Association for Computing Machinery (ACM), 2018)
      Text en actes de congrés
      Accés obert
      The simulation of the behavior of the Human Brain is one of the most ambitious challenges today with a non-end of important applications. We can find many different initiatives in the USA, Europe and Japan which attempt ...
    • NanosCompiler: supporting flexible multilevel parallelism exploitation in OpenMP 

      González Tallada, Marc; Ayguadé Parra, Eduard; Martorell Bofill, Xavier; Labarta Mancho, Jesús José; Navarro, Nacho; Oliver Segura, José (2000-10)
      Article
      Accés restringit per política de l'editorial
      This paper describes the support provided by the NanosCompiler to nested parallelism in OpenMP. The NanosCompiler is a source-to-source parallelizing compiler implemented around a hierarchical internal program representation ...
    • O(n) key–value sort with active compute memory 

      Esmaili Dokht, Pouya; Guiot Cusido, Miquel; Radojkovic, Petar; Martorell Bofill, Xavier; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José; Adlard, Jason; Amato, Paolo; Sforzin, Marco (Institute of Electrical and Electronics Engineers (IEEE), 2024-02-29)
      Article
      Accés obert
      We propose the Active Compute Memory (ACM), a near-memory-processing architecture capable of performing key–value sort directly in the DRAM. In the ACM architecture, sort is merely the writing of data into memory with one ...
    • OmpSs@cloudFPGA: An FPGA task-based programming model with message passing 

      Haro Ruiz, Juan Miguel de; Cano, Rubén; Álvarez Martínez, Carlos; Jiménez González, Daniel; Martorell Bofill, Xavier; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José; Abel, François; Ringlein, Burkhard; Weiss, Beat (Institute of Electrical and Electronics Engineers (IEEE), 2022)
      Text en actes de congrés
      Accés obert
      Nowadays, a new parallel paradigm for energy-efficient heterogeneous hardware infrastructures is required to achieve better performance at a reasonable cost on high-performance computing applications. Under this new paradigm, ...
    • OmpSs@FPGA framework for high performance FPGA computing 

      Haro Ruiz, Juan Miguel de; Bosch Pons, Jaume; Filgueras Izquierdo, Antonio; Vidal, Miquel; Jiménez González, Daniel; Álvarez Martínez, Carlos; Martorell Bofill, Xavier; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José (Institute of Electrical and Electronics Engineers (IEEE), 2021-12-01)
      Article
      Accés obert
      This paper presents the new features of the OmpSs@FPGA framework. OmpSs is a data-flow programming model that supports task nesting and dependencies to target asynchronous parallelism and heterogeneity. OmpSs@FPGA is the ...
    • OmpSs@Zynq All-Programmable SoC Ecosystem 

      Filgueras Izquierdo, Antonio; Gil Blasco, Eduard; Jiménez González, Daniel; Álvarez Martínez, Carlos; Martorell Bofill, Xavier; Langer, Jan; Noguera Serra, Juan José; Vissers, Kees (Association for Computing Machinery (ACM), 2014)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      OmpSs is an OpenMP-like directive-based programming model that includes heterogeneous execution (MIC, GPU, SMP, etc.) and runtime task dependencies management. Indeed, OmpSs has largely influenced the recently appeared ...
    • On the instrumentation of OpenMP and OmpSs Tasking constructs 

      Servat, Harald; Teruel, Xavier; Llort Sánchez, Germán; Duran González, Alejandro; Giménez, J.; Martorell Bofill, Xavier; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José (Springer, 2012)
      Text en actes de congrés
      Accés obert
      Parallelism has become more and more commonplace with the advent of the multicore processors. Although different parallel pro- gramming models have arisen to exploit the computing capabilities of such processors, ...
    • OpenMP extensions for FPGA Accelerators 

      Cabrera, Daniel; Martorell Bofill, Xavier; Gaydadjiev, Georgi; Ayguadé Parra, Eduard; Jiménez González, Daniel (2009-07)
      Text en actes de congrés
      Accés obert
      Reconfigurable computing is one of the paths to explore towards low-power supercomputing. However, programming these reconfigurable devices is not an easy task and still requires significant research and development efforts ...
    • OpenMP tasking analysis for programers 

      Teruel, Xavier; Barton, Christopher; Duran González, Alejandro; Martorell Bofill, Xavier; Ayguadé Parra, Eduard; Unnikrishnan, Priya; Zhang, Guansong; Silvera, Raul (2009-11)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      As of 2008, the OpenMP 3.0 standard includes task support allowing programmers to exploit irregular parallelism. Although several compilers are providing support for this new feature there has not been extensive investigation ...
    • OpenMP tasks in IBM XL compilers 

      Teruel, Xavier; Unnikrishnan, Priya; Martorell Bofill, Xavier; Ayguadé Parra, Eduard; Silvera, Raul; Zhang, Guansong; Tiotto, Ettore (Association for Computing Machinery (ACM), 2008)
      Comunicació de congrés
      Accés obert
      Tasking is the most significant feature included in the new OpenMP 3.0 standard. It was introduced to handle unstructured parallelism and broaden the range of applications that can be parallelized by OpenMP. This paper ...
    • Optimizing NANOS OpenMP for the IBM Cyclops multithreaded architecture 

      Ródenas Picó, David; Martorell Bofill, Xavier; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José; Almási, George; Cascaval, Calin; Castaños, José G.; Moreira, Jose E. (Institute of Electrical and Electronics Engineers (IEEE), 2005)
      Text en actes de congrés
      Accés obert
      In this paper, we present two approaches to improve the execution of OpenMP applications on the IBM Cyclops multithreaded architecture. Both solutions are independent and they are focused to obtain better performance through ...
    • Optimizing the exploitation of multicore processors and GPUs with OpenMP and OpenCL 

      Ferrer, Roger; Planas Carbonell, Judit; Bellens, Pieter; Duran González, Alejandro; González Tallada, Marc; Martorell Bofill, Xavier; Badia Sala, Rosa Maria; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José (2011)
      Article
      Accés restringit per política de l'editorial
      In this paper, we present OMPSs, a programming model based on OpenMP and StarSs, that can also incorporate the use of OpenCL or CUDA kernels. We evaluate the proposal on three different architectures, SMP, Cell/B.E. and ...