• Code generation for the openmp 4.0 accelerator model onto ompss 

      Ozen, Guray (Universitat Politècnica de Catalunya, 2014-09-09)
      Projecte Final de Màster Oficial
      Accés obert
      I present a MACC compiler which is partial implementation of this specification in the OmpSs programming model in order to show code generation for hardware accelerators.
    • Compiler and runtime based parallelization & optimization for GPUs 

      Ozen, Guray (Universitat Politècnica de Catalunya, 2018-12-13)
      Tesi
      Accés obert
      Graphics Processing Units (GPU) have been widely adopted to accelerate the execution of HPC workloads due to their vast computational throughput, ability to execute a large number of threads inside SIMD groups in parallel ...
    • Exploring dynamic parallelism in OpenMP 

      Ozen, Guray; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José (Association for Computing Machinery (ACM), 2015)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      GPU devices are becoming a common element in current HPC platforms due to their high performance-per-Watt ratio. However, developing applications able to exploit their dazzling performance is not a trivial task, which ...
    • MACC: Mercurium ACCelerator Model 

      Ozen, Guray; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José (Barcelona Supercomputing Center, 2015-05-05)
      Text en actes de congrés
      Accés obert
      GPU Offloading is emergent programming model. OpenMP includes in its latest 4.0 specification the accelerator model. In this paper we present a newly implementation of this specification while generationg "native" GPU ...
    • Multiple target task sharing support for the OpenMP accelerator model 

      Ozen, Guray; Mateo, Sergi; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José; Beyer, James B. (Springer, 2016)
      Text en actes de congrés
      Accés obert
      The use of GPU accelerators is becoming common in HPC platforms due to the their effective performance and energy efficiency. In addition, new generations of multicore processors are being designed with wider vector units ...
    • On the roles of the programmer, the compiler and the runtime system when programming accelerators in OpenMP 

      Ozen, Guray; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José (Springer, 2014)
      Text en actes de congrés
      Accés obert
      OpenMP includes in its latest 4.0 specification the accelerator model. In this paper we present a partial implementation of this specification in the OmpSs programming model developed at the Barcelona Supercomputing Center ...
    • POSTER: collective dynamic parallelism for directive based GPU programming languages and compilers 

      Ozen, Guray; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José (Association for Computing Machinery (ACM), 2016)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      Early programs for GPU (Graphics Processing Units) acceleration were based on a flat, bulk parallel programming model, in which programs had to perform a sequence of kernel launches from the host CPU. In the latest releases ...