Ara es mostren els items 26-45 de 51

    • Improving the integration of task nesting and dependencies in OpenMP 

      Pérez, Josep M.; Beltran Querol, Vicenç; Labarta Mancho, Jesús José; Ayguadé Parra, Eduard (Institute of Electrical and Electronics Engineers (IEEE), 2017)
      Text en actes de congrés
      Accés obert
      The tasking model of OpenMP 4.0 supports both nesting and the definition of dependences between sibling tasks. A natural way to parallelize many codes with tasks is to first taskify the high-level functions and then to ...
    • Improving the interoperability between MPI and task-based programming models 

      Sala Penadés, Kevin; Bellón, Jorge; Farré, Pau; Teruel, Xavier; Pérez, Josep M.; Peña, Antonio J.; Holmes, Daniel; Beltran Querol, Vicenç; Labarta Mancho, Jesús José (Association for Computing Machinery (ACM), 2018)
      Text en actes de congrés
      Accés obert
      In this paper we propose an API to pause and resume task execution depending on external events. We leverage this generic API to improve the interoperability between MPI synchronous communication primitives and tasks. When ...
    • Improving the performance of classical linear algebra iterative methods via hybrid parallelism 

      Martínez Ferrer, Pedro José; Arslan, Tufan; Beltran Querol, Vicenç (Elsevier, 2023-09)
      Article
      Accés restringit per política de l'editorial
      We propose fork-join and task-based hybrid implementations of four classical linear algebra iterative methods (Jacobi, Gauss–Seidel, conjugate gradient and biconjugate gradient stabilized) on CPUs as well as variations of ...
    • Improving web server efficiency on commodity hardware 

      Beltran Querol, Vicenç (Universitat Politècnica de Catalunya, 2008-10-03)
      Tesi
      Accés obert
      El ràpid creixement de la Web requereix una gran quantitat de recursos computacionals que han de ser utilitzats eficientment. Avui en dia, els servidors basats en hardware estendard son les plataformes preferides per ...
    • Improving web server performance through main memory compression 

      Beltran Querol, Vicenç; Torres Viñals, Jordi; Ayguadé Parra, Eduard (IEEE Computer Society, 2008)
      Comunicació de congrés
      Accés obert
      Current web servers are highly multithreaded applications whose scalability benefits from the current multicore/multiprocessor trend. However, some workloads can not capitalize on this because their performance is limited by ...
    • Introducing the Task-Aware Storage I/O (TASIO) Library 

      Roca Nonell, Aleix; Beltran Querol, Vicenç; Mateo Bellido, Sergi (Springer, 2019-08-09)
      Capítol de llibre
      Accés obert
      Task-based programming models are excellent tools to parallelize and seamlessly load balance an application workload. However, the integration of I/O intensive applications and task-based programming models is lacking. ...
    • Leveraging OmpSs to exploit hardware accelerators 

      Sainz, Florentino; Mateo Bellido, Sergi; Beltran Querol, Vicenç; Bosque, José L.; Martorell Bofill, Xavier; Ayguadé Parra, Eduard (Institute of Electrical and Electronics Engineers (IEEE), 2014)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      CUDA and OpenCL are the most widely used programming models to exploit hardware accelerators. Both programming models provide a C-based programming language to write accelerator kernels and a host API used to glue the host ...
    • Mitigating the NUMA effect on task-based runtime systems 

      Maroñas Bravo, Marcos; Navarro Muñoz, Antoni; Ayguadé Parra, Eduard; Beltran Querol, Vicenç (Springer Nature, 2023-09)
      Article
      Accés obert
      Processors with multiple sockets or chiplets are becoming more conventional. These kinds of processors usually expose a single shared address space. However, due to hardware restrictions, they adopt a NUMA approach, where ...
    • OmpSs-2@Cluster: Distributed memory execution of nested OpenMP-style tasks 

      Aguilar Mena, Jimmy; Ali, Omar Shaaban Ibrahim; Beltran Querol, Vicenç; Carpenter, Paul Matthew; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José (Springer Nature, 2022)
      Text en actes de congrés
      Accés obert
      State-of-the-art programming approaches generally have a strict division between intra-node shared memory parallelism and inter-node MPI communication. Tasking with dependencies offers a clean, dependable abstraction for ...
    • On the adequacy of lightweight thread approaches for high-level parallel programming models 

      Castelló, Adrián; Mayo Gual, Rafael; Sala Penadés, Kevin; Beltran Querol, Vicenç; Balaji, Pavan; Peña, Antonio J. (Elsevier, 2018-07)
      Article
      Accés obert
      High-level parallel programming models (PMs) are becoming crucial in order to extract the computational power of current on-node multi-threaded parallelism. The most popular PMs, such as OpenMP or OmpSs, are directive-based: ...
    • On the applicability of PEBS based online memory access tracking for heterogeneous memory management at scale 

      Roca Nonell, Aleix; Gerofi, Balazs; Bautista-Gomez, Leonardo; Martinet, Dominique; Beltran Querol, Vicenç; Ishikawa, Yutaka (Association for Computing Machinery (ACM), 2018-11)
      Comunicació de congrés
      Accés obert
      Operating systems have historically had to manage only a single type of memory device. The imminent availability of heterogeneous memory devices based on emerging memory technologies confronts the classic single memory ...
    • Optimizing iterative data-flow scientific applications using directed cyclic graphs 

      Álvarez Robert, David; Beltran Querol, Vicenç (Institute of Electrical and Electronics Engineers (IEEE), 2023-04-24)
      Article
      Accés obert
      Data-flow programming models have become a popular choice for writing parallel applications as an alternative to traditional work-sharing parallelism. They are better suited to write applications with irregular parallelism ...
    • Optimizing resource utilization with software-based temporal multi-threading (sTMT) 

      Beltran Querol, Vicenç; Ayguadé Parra, Eduard (2013)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      Compute and memory access units are two of the most important resources to appropriately manage in current and future multi–/many–core architectures. Memory bandwidth and computational capacity need to be exploited in ...
    • Performance impact of the grid middleware 

      Carrera Pérez, David; Guitart Fernández, Jordi; Beltran Querol, Vicenç; Torres Viñals, Jordi; Ayguadé Parra, Eduard (American Scientific Publishers, 2006-01)
      Capítol de llibre
      Accés obert
      The Open Grid Services Architecture (OGSA) defines a new vision of the Grid based on the use of Web Services (Grid Services). The standard interfaces, behaviors and schemes that are consistent with the OGSA specification ...
    • Saiph, a domain specific language for computational fluid dynamics simulations 

      Macià, Sandra; Beltran Querol, Vicenç; Mira Martínez, Daniel; Mateo, Sergi (Barcelona Supercomputing Center, 2017-05-04)
      Accés obert
      Nowadays, High-Performance Computing (HPC) is assuming an increasingly central role in scientific research while computer architectures are becoming more and more hetero-geneous and using different parallel programming ...
    • Saiph: towards a DSL for high-performance computational fluid dynamics 

      Macià, Sandra; Mateo, Sergi; Martínez-Ferrer, Pedro J.; Beltran Querol, Vicenç; Mira Martínez, Daniel; Ayguadé Parra, Eduard (Association for Computing Machinery (ACM), 2018)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      Nowadays high-performance computing is taking an increasingly central role in scientific research while computer architectures are becoming more heterogeneous and complex with different parallel programming models and ...
    • Scaling irregular array-type reductions in OmpSs 

      Ciesko, Jan; Mateo Bellido, Sergi; Teruel, Xavier; Beltran Querol, Vicenç; Martorell Bofill, Xavier; Badia Sala, Rosa Maria; Labarta Mancho, Jesús José (Barcelona Supercomputing Center, 2015-05-05)
      Text en actes de congrés
      Accés obert
      Array-type reductions represent a frequently occurring algorithmic pattern in many scientific applications. A special case occurs if array elements are accessed in a non-linear, often random manner, which makes their ...
    • Seamless optimization of the GEMM kernel for task-based programming models 

      Lorenzon, Arthur F.; Marques, Sandro M. V. N.; Navarro Muñoz, Antoni; Beltran Querol, Vicenç (Association for Computing Machinery (ACM), 2022)
      Text en actes de congrés
      Accés obert
      The general matrix-matrix multiplication (GEMM) kernel is a fundamental building block of many scientific applications. Many libraries such as Intel MKL and BLIS provide highly optimized sequential and parallel versions ...
    • Simulating complex systems with a low-detail model 

      Nou Castell, Ramon; Guitart Fernández, Jordi; Beltran Querol, Vicenç; Carrera Pérez, David; Montero Mercadé, Lídia; Torres Viñals, Jordi; Ayguadé Parra, Eduard (2005-09)
      Text en actes de congrés
      Accés obert
      In this paper we show how modeling and simulating a complex system such as a web-server can help to evaluate di erent metrics and proposals to improve the performance without necessity of using a real system. Many times ...
    • Speeding up distributed MapReduce applications using hardware accelerators 

      Becerra Fontal, Yolanda; Beltran Querol, Vicenç; Carrera Pérez, David; González Tallada, Marc; Torres Viñals, Jordi; Ayguadé Parra, Eduard (2009-09)
      Text en actes de congrés
      Accés obert
      In an attempt to increase the performance/cost ratio, large compute clusters are becoming heterogeneous at multiple levels: from asymmetric processors, to different system architectures, operating systems and networks. ...