Now showing items 1-20 of 273

    • A case study of hybrid dataflow and shared-memory programming models: Dependency-based parallel game engine 

      Gajinov, Vladimir; Eric, Igor; Stojanovic, Saa; Milutinovic, Veljko; Unsal, Osman Sabri; Ayguadé Parra, Eduard; Cristal Kestelman, Adrián (Institute of Electrical and Electronics Engineers (IEEE), 2014)
      Conference report
      Restricted access - publisher's policy
      Recently proposed hybrid dataflow and shared memory programming models combine these two underlying models in order to support a wider range of problems naturally. The effectiveness of such hybrid models for parallel ...
    • A CellBE-based HPC application for the analysis of vulnerabilities in cryptographic hash functions 

      Cilardo, Alessandro; Esposito, Luigi; Veniero, Antonio; Mazzeo, Antonino; Beltran Querol, Vicenç; Ayguadé Parra, Eduard (2010)
      Conference report
      Restricted access - publisher's policy
      After some recent breaks presented in the technical literature, it has become of paramount importance to gain a deeper understanding of the robustness and weaknesses of cryptographic hash functions. In particular, in the ...
    • A data flow language to develop high performance computing DSLs 

      Fernandez, Alejandro; Berltran, Vicenç; Mateo, Sergi; Patejko, Thomas; Ayguadé Parra, Eduard (Association for Computing Machinery (ACM), 2014)
      Conference report
      Restricted access - publisher's policy
      Developing complex scientific applications on high performance systems requires both domain knowledge and expertise in parallel and distributed programming models. In addition, modern high performance systems are heterogeneous, ...
    • A directive-based approach to perform persistent checkpoint/restart 

      Maroñas, Marcos; Mateo, Sergi; Beltran Querol, Vicenç; Ayguadé Parra, Eduard (Institute of Electrical and Electronics Engineers (IEEE), 2017)
      Conference report
      Restricted access - publisher's policy
      Exascale platforms require support for resilience capabilities due to increasing numbers of components and associated error rates. In this paper, we present a new directive-based approach to perform application-level ...
    • A framework for integrating data alignment, distribution, and redistribution in distributed memory multiprocessors 

      García Almiñana, Jordi; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José (2001-04)
      Article
      Restricted access - publisher's policy
      Parallel architectures with physically distributed memory provide a cost-effective scalability to solve many large scale scientific problems. However, these systems are very difficult to program and tune. In these systems, ...
    • A hardware runtime for task-based programming models 

      Tan, Xubin; Bosch, Jaume; Álvarez, Carlos; Jiménez González, Daniel; Ayguadé Parra, Eduard; Valero Cortés, Mateo (2019-09-01)
      Article
      Open Access
      Task-based programming models such as OpenMP 5.0 and OmpSs are simple to use and powerful enough to exploit task parallelism of applications over multicore, manycore and heterogeneous systems. However, their software-only ...
    • A hybrid web server architecture for secure e-business web applications 

      Beltran Querol, Vicenç; Carrera Pérez, David; Guitart Fernández, Jordi; Torres Viñals, Jordi; Ayguadé Parra, Eduard (2005-09)
      Article
      Restricted access - publisher's policy
      Nowadays the success of many e-commerce applications, such as on-line banking, depends on their reliability, robustness and security. Designing a web server architecture that keeps these properties under high loads is a ...
    • A module-based cell processor simulator 

      Cabarcas Jaramillo, Felipe; Rico Carro, Alejandro; Rodenas, David; Martorell Bofill, Xavier; Ramírez Bellido, Alejandro; Ayguadé Parra, Eduard (European Network of Excellence on High Performance and Embedded Architecture and Compilation (HiPEAC), 2006)
      Conference lecture
      Open Access
      An interesting design alternative to replication-based chip multiprocessors is to create heterogeneous chip multiprocessors composed of several different cores, with one or more of them running the operating system and ...
    • A novel asynchronous software cache implementation for the Cell-BE processor 

      Balart, J; González Tallada, Marc; Martorell Bofill, Xavier; Ayguadé Parra, Eduard; Sura, Z; Chen, T; Zhang, T; O'Brien, Kevin; O'Brien, Kathryn (2008-10)
      Article
      Restricted access - publisher's policy
      This paper describes the implementation of a runtime library for asynchronous communication in the Cell BE processor. The runtime library implementation provides with several services that allow the compiler to generate ...
    • A proposal for error handling in OpenMP 

      Duran González, Alejandro; Ferrer, Roger; Costa Prats, Juan José; González Tallada, Marc; Martorell Bofill, Xavier; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José (2006-06)
      Article
      Restricted access - publisher's policy
      OpenMP has been focused in performance applied to numerical applications, but when we try to move this focus to other kind of applications, like Web servers, we detect one important lack. In these applications, performance ...
    • A proposal for task parallelism in OpenMP 

      Ayguadé Parra, Eduard; Nawal, Copty; Duran González, Alejandro; Hoeflinger, Jay; Yuan, Lin; Massaioli, Federico; Ernesto, Su; Unnikrishnan, Priya; Guansong, Zhang (2007-06)
      Article
      Restricted access - publisher's policy
      This paper presents a novel proposal to define task parallelism in OpenMP. Task parallelism has been lacking in the OpenMP language for a number of years already. As we show, this makes certain kinds of applications difficult ...
    • A proposal to extend the OpenMP tasking model with dependent tasks 

      Duran Gonzalez, Alejandro; Ferrer, Roger; Ayguadé Parra, Eduard; Badia Sala, Rosa Maria; Labarta Mancho, Jesús José (2009-06)
      Article
      Restricted access - publisher's policy
      Tasking in OpenMP 3.0 has been conceived to handle the dynamic generation of unstructured parallelism. New directives have been added allowing the user to identify units of independent work (tasks) and to define points ...
    • A streaming machine description and programming model 

      Carpenter, Paul; Ródenas Picó, David; Martorell Bofill, Xavier; Ramírez Bellido, Alejandro; Ayguadé Parra, Eduard (2007-07)
      Article
      Restricted access - publisher's policy
      In this paper we present the initial development of a streaming environment based on a programming model and machine description. The stream programming model consists of an extension to the C language and it’s translation ...
    • A Survey on Performance Management for Internet Applications 

      Guitart Fernández, Jordi; Torres Viñals, Jordi; Ayguadé Parra, Eduard (2010-01-01)
      Article
      Restricted access - publisher's policy
      Internet applications have become indispensable for many business and personal processes, turning the performance of these applications into a key issue. For this reason, recent research has comprehensively explored ...
    • A template system for the efficient compilation of domain abstractions onto reconfigurable computers 

      Shafiq, Muhammad; Pericàs Gleim, Miquel; Ayguadé Parra, Eduard (2011)
      Conference report
      Restricted access - publisher's policy
      Past research has addressed the issue of using FPGAs as accelerators for HPC systems. However, writing low level code for an efficient, portable and scalable architecture altogether has been always a ...
    • A transparent runtime data distribution engine for OpenMP 

      Nikolopoulos, Dimitrios; Papatheodorou, Theodore; Polychronopoulos, C D; Labarta Mancho, Jesús José; Ayguadé Parra, Eduard (2001-07)
      Article
      Restricted access - publisher's policy
      This paper makes two important contributions. First, the paper investigates the performance implications of data placement in OpenMP programs running on modern NUMA multiprocessors. Data locality and minimization of the ...
    • A visual embedding for the unsupervised extraction of abstract semantics 

      García Gasulla, Dario; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José; Béjar Alonso, Javier; Cortés García, Claudio Ulises; Suzumura, Toyotaro; Chen, R (2017-05-01)
      Article
      Open Access
      Vector-space word representations obtained from neural network models have been shown to enable semantic operations based on vector arithmetic. In this paper, we explore the existence of similar information on vector ...
    • Accelerating software memory compression on the Cell/B.E. 

      Beltran Querol, Vicenç; Martorell Bofill, Xavier; Torres Viñals, Jordi; Ayguadé Parra, Eduard (2008)
      Conference report
      Restricted access - publisher's policy
      The idea of transparently compressing and decompressing the content of main memory to virtually enlarge their capacity has been previously proposed and studied in the literature. The rationale behind this idea lies in the ...
    • Access to streams in multiprocessor systems 

      Valero Cortés, Mateo; Peirón Guardia, Montse; Ayguadé Parra, Eduard (Institute of Electrical and Electronics Engineers (IEEE), 1993)
      Conference report
      Open Access
      When accessing streams in vector multiprocessor machines, degradation in the interconnection network and conflicts in the memory modules are the factors that reduce the efficiency of the system. In this paper, we present ...
    • Access to vectors in multi-module memories 

      Valero Cortés, Mateo; Peiron Guàrdia, Montse; Ayguadé Parra, Eduard (Institute of Electrical and Electronics Engineers (IEEE), 1994)
      Conference report
      Open Access
      The poor bandwidth obtained from memory when conflicts arise in the modules or in the interconnection network degrades the performance of computers. Address transformation schemes, such as interleaving, skewing and linear ...