Ara es mostren els items 102-121 de 140

    • Real-time GPU-based face detection in HD video sequences 

      Oro, David; Fernández, Carles; Rodriguez Saeta, Javier; Martorell Bofill, Xavier; Hernando Pericás, Francisco Javier (2011)
      Comunicació de congrés
      Accés restringit per política de l'editorial
      Modern GPUs have evolved into fully programmable parallel stream multiprocessors. Due to the nature of the graphic workloads, computer vision algorithms are in good position to leverage the computing power of these ...
    • Reducing compiler-inserted instrumentation in unified-parallel-C code generation 

      Alvanos, Michail; Amaral, José Nelson; Tiotto, Ettore; Farreras Esclusa, Montserrat; Martorell Bofill, Xavier (Institute of Electrical and Electronics Engineers (IEEE), 2014)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      Programs written in Partitioned Global Address Space (PGAS) languages can access any location of the entire address space via standard read/write operations. However, the compiler have to create the communication mechanisms ...
    • RePP-C: runtime estimation of performance-power with workload consolidation in CMPs 

      Nishtala, Rajiv; Martorell Bofill, Xavier (Institute of Electrical and Electronics Engineers (IEEE), 2016)
      Text en actes de congrés
      Accés obert
      Configuration of hardware knobs in multicore environments for meeting performance-power demands constitutes a desirable feature in modern data centers. At the same time, high energy efficiency (performance per watt) requires ...
    • REPP-H: runtime estimation of power and performance on heterogeneous data centers 

      Nishtala, Rajiv; Martorell Bofill, Xavier; Petrucci, Vinicius; Mossé, Daniel (2016)
      Text en actes de congrés
      Accés obert
      Modern data centers increasingly demand improved performance with minimal power consumption. Managing the power and performance requirements of the applications is challenging because these data centers, incidentally or ...
    • Resource-aware task scheduling 

      Tillenius, Martin; Larsson, Elisabeth; Badia Sala, Rosa Maria; Martorell Bofill, Xavier (Association for Computing Machinery (ACM), 2013)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      Dependency-aware task-based parallel programming models have proven to be successful for developing application software for multicore-based computer architectures. Here we consider the problem of scheduling tasks not ...
    • Running OpenMP application efficiently on an everything-shared SDSM 

      Costa Prats, Juan José; Cortés, Toni; Martorell Bofill, Xavier; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José (2006-05)
      Article
      Accés restringit per política de l'editorial
      Traditional software distributed shared memory (SDSM) systems modify the semantics of a real hardware shared memory system by relaxing the coherence semantic and by limiting the memory regions that are actually shared. ...
    • Running OpenMp applications efficiently on an everything-shared SDSM 

      Costa Prats, Juan José; Cortés, Toni; Martorell Bofill, Xavier; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José (Institute of Electrical and Electronics Engineers (IEEE), 2004)
      Comunicació de congrés
      Accés obert
      Traditional software distributed shared memory (SDSM) systems modify the semantics of a real hardware shared memory system by relaxing the coherence semantic and by limiting the memory regions that are actually shared. ...
    • Runtime address space computation for SDSM systems 

      Balart Tarzan, Jairo; González Tallada, Marc; Martorell Bofill, Xavier; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José (2007)
      Article
      Accés obert
      This paper explores the benefits and limitations of using a inspector/executor approach for Software Distributed Shared Memory (SDSM) systems. The role of the inspector is to obtain a description of the address space ...
    • Runtime estimation of performance–power in CMPs under QoS constraints 

      Nishtala, Rajiv; Martorell Bofill, Xavier; Carpenter, Paul Matthew (Barcelona Supercomputing Center, 2015-05-05)
      Text en actes de congrés
      Accés obert
      One of the main challenges in data center systems is operating under certain Quality of Service (QoS) while minimizing power consumption. Increasingly, data centers are exploring and adopting heterogeneous server architectures ...
    • Runtime-guided management of scratchpad memories in multicore architectures 

      Álvarez Martí, Lluc; Moretó Planas, Miquel; Casas, Marc; Castillo Villar, Emilio; Martorell Bofill, Xavier; Labarta Mancho, Jesús José; Ayguadé Parra, Eduard; Valero Cortés, Mateo (Institute of Electrical and Electronics Engineers (IEEE), 2015)
      Text en actes de congrés
      Accés obert
      The increasing number of cores and the anticipated level of heterogeneity in upcoming multicore architectures cause important problems in traditional cache hierarchies. A good way to alleviate these problems is to add ...
    • Safe Parallelism: Compiler Analysis Techniques for Ada and OpenMP 

      Royuela Alcázar, Sara; Martorell Bofill, Xavier; Quiñones Moreno, Eduardo; Pinho, Luis Miguel (Springer Verlag, 2018-05-26)
      Comunicació de congrés
      Accés obert
      There is a growing need to support parallel computation in Ada to cope with the performance requirements of the most advanced functionalities of safety-critical systems. In that regard, the use of parallel programming ...
    • Scalability evaluation of a polymorphic register file: a CG case study 

      Ciobanu, Catalin; Martorell Bofill, Xavier; Kuzmanov, Georgi; Ramírez Bellido, Alejandro; Gaydadjiev, Georgi (Springer, 2011)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      We evaluate the scalability of a Polymorphic Register File using the Conjugate Gradient method as a case study. We focus on a heterogeneous multi-processor architecture, taking into consideration critical parameters such ...
    • Scaling irregular array-type reductions in OmpSs 

      Ciesko, Jan; Mateo Bellido, Sergi; Teruel, Xavier; Beltran Querol, Vicenç; Martorell Bofill, Xavier; Badia Sala, Rosa Maria; Labarta Mancho, Jesús José (Barcelona Supercomputing Center, 2015-05-05)
      Text en actes de congrés
      Accés obert
      Array-type reductions represent a frequently occurring algorithmic pattern in many scientific applications. A special case occurs if array elements are accessed in a non-linear, often random manner, which makes their ...
    • Simulating next-generation Cyber-physical computing platforms 

      Burgio, Paolo; Álvarez Martínez, Carlos; Ayguadé Parra, Eduard; Filgueras Izquierdo, Antonio; Jiménez González, Daniel; Martorell Bofill, Xavier; Navarro, Nacho; Giorgi, Roberto (2015)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      In specific domains, such as cyber-physical systems, platforms are quickly evolving to include multiple (many-) cores and programmable logic in a single system-on-chip, while includ- ing interfaces to commodity ...
    • Simulating the behavior of the human brain on GPUS 

      Valero-Lara, Pedro; Martinez-Perez, Ivan; Sirvent, Raul; Peña, Antonio J.; Martorell Bofill, Xavier; Labarta Mancho, Jesús José (2018-01-01)
      Article
      Accés obert
      The simulation of the behavior of the Human Brain is one of the most important challenges in computing today. The main problem consists of finding efficient ways to manipulate and compute the huge volume of data that this ...
    • sLASs: a fully automatic auto-tuned linear algebra library based on OpenMP extensions implemented in OmpSs (LASs Library) 

      Valero Lara, Pedro; Catalán Pallarés, Sandra; Martorell Bofill, Xavier; Usui, Tetsuzo; Labarta Mancho, Jesús José (Elsevier, 2020-04-01)
      Article
      Accés obert
      In this work we have implemented a novel Linear Algebra Library on top of the task-based runtime OmpSs-2. We have used some of the most advanced OmpSs-2 features; weak dependencies and regions, together with the final ...
    • Task-based programming models for heterogeneous recurrent workloads 

      Bosch Pons, Jaume; Vidal, Miquel; Filgueras Izquierdo, Antonio; Jiménez González, Daniel; Álvarez Martínez, Carlos; Martorell Bofill, Xavier; Ayguadé Parra, Eduard (Springer Nature, 2021)
      Text en actes de congrés
      Accés obert
      This paper proposes the extension of task-based programming models with recurrent workloads concepts. The proposal introduces new clauses in the OmpSs task directive to efficiently model recurrent workloads. The clauses ...
    • Task-based programming with OmpSs and its application 

      Fernandez, Alejandro; Beltran Querol, Vicenç; Martorell Bofill, Xavier; Badia,, R.M.; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José (Springer, 2014)
      Text en actes de congrés
      Accés obert
      OmpSs is a task-based programming model that aims to provide portability and flexibility for sequential codes while the performance is achieved by the dynamic exploitation of the parallelism at task level. OmpSs targets ...
    • Task-parallel reductions in OpenMP and OmpSs 

      Ciesko, Jan; Mateo, Sergi; Teruel, Xavier; Beltran Querol, Vicenç; Martorell Bofill, Xavier; Badia,, R.M.; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José (Springer, 2014)
      Text en actes de congrés
      Accés obert
      The wide adoption of parallel processing hardware in mainstream computing as well as the raising interest for efficient parallel programming in the developer community increase the demand for parallel programming model ...
    • TEXTAROSSA: Towards EXtreme scale Technologies and Accelerators for euROhpc hw/Sw Supercomputing Applications for exascale 

      Agosta, Giovanni; Cattaneo, Daniele; Fornaciari, William; Galimberti, Andrea; Massari, Giuseppe; Reghenzani, Federico; Terraneo, Federico; Filgueras Izquierdo, Antonio; Vidal, Miquel; Álvarez Martínez, Carlos; Martorell Bofill, Xavier (Institute of Electrical and Electronics Engineers (IEEE), 2021)
      Text en actes de congrés
      Accés obert
      To achieve high performance and high energy efficiency on near-future exascale computing systems, three key technology gaps needs to be bridged. These gaps include: energy efficiency and thermal control; extreme computation ...