Now showing items 1-12 of 202

    • Compiler-assisted compaction/restoration of SIMD instructions 

      Cebrián González, Juan Manuel; Balem, Thibaud; Barredo Ferreira, Adrián; Casas Guix, Marc; Moreto Planas, Miquel; Ros Bardisa, Alberto; Jimborean, Alexandra (2021)
      Article
      Open Access
      All the supercomputers in the world exploit data-level parallelism (DLP), for example by using single instructions to operate over several data elements. Improving vector processing is therefore key for exascale computing. ...
    • Size & shape matters: The need of HPC benchmarks of high resolution image training for deep learning 

      Parés Pont, Ferran; Megias Montsesinos, Pedro; García Gasulla, Dario; Garcia Gasulla, Marta; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José (2021-03)
      Article
      Open Access
      One of the purposes of HPC benchmarks is to identify limitations and bottlenecks in hardware. This functionality is particularly influential when assessing performance on emerging tasks, the nature and requirements of which ...
    • Real-time Issues in the Ada Parallel Model with OpenMP 

      Pinho, Luis Miguel; Royuela Alcázar, Sara; Quiñones, Eduardo (Association for Computing Machinery, 2021)
      Article
      Open Access
      The current proposal for the next revision of the Ada language considers the possibility to map the language parallel features to an underlying OpenMP runtime. As previously presented, and discussed in previous workshops, ...
    • Workload-aware placement strategies to leverage disaggregated resources in the datacenter 

      Call Barreiro, Aaron; Polo Bardés, Jorda; Carrera Pérez, David (2021-07)
      Article
      Open Access
      Disaggregation of resources is a datacenter strategy that aims to decouple the physical location of resources from the place where they are accessed, as opposed to physically attached devices connected to the Peripheral ...
    • Hardware acceleration for query processing: Leveraging FPGAs, CPUs, and memory 

      Arcas Abella, Oriol; Armejach Sanosa, Adrià; Hayes, Timothy; Malazgirt, Görker Alp; Palomar Pérez, Óscar; Salami, Behzad; Sonmez, Nehir (2016-01)
      Article
      Open Access
      Database management systems have become an indispensable tool for industry, government, and academia, and form a significant component of modern datacenters. They can be used in a multitude of scenarios, including online ...
    • OmpSs@FPGA framework for high performance FPGA computing 

      Haro Ruiz, Juan Miguel de; Bosch Pons, Jaume; Filgueras Izquierdo, Antonio; Vidal, Miquel; Jiménez González, Daniel; Álvarez Martínez, Carlos; Martorell Bofill, Xavier; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José (Institute of Electrical and Electronics Engineers (IEEE), 2021-06-02)
      Article
      Open Access
      This paper presents the new features of the OmpSs@FPGA framework. OmpSs is a data-flow programming model that supports task nesting and dependencies to target asynchronous parallelism and heterogeneity. OmpSs@FPGA is the ...
    • The Marenostrum experimental exascale platform (MEEP) 

      Fell, Alexander; Mazure, Daniel J.; Garcia, Teresa C.; Perez, Borja; Teruel, Xavier; Wilson, Pete; Davis, John D. (Publishing center of the South Ural State University, 2021)
      Article
      Open Access
      Nascent Open Source Instruction Set Architectures such as OpenPOWER or RISC-V, allow software/hardware co-designers to fully utilize the underlying hardware, modify it or extend it based on their needs. In this paper, we ...
    • On the definition of resource sharing levels to understand and control the impact of contention in multicore processors 

      Mezzetti, Enrico; Abella Ferrer, Jaume; Cazorla Almeida, Francisco Javier; Tabani, Hamid; Kosmidis, Leonidas (SAE International, 2021-06)
      Article
      Open Access
      The trend toward the adoption of a multiprocessor system on a chip (MPSoC) in critical real-time domains, like avionics or automotive, responds to the demand for increased computing performance to support advanced software ...
    • Enabling genomics pipelines in commodity personal computers with flash storage 

      Cadenelli, Nicola; Jun, Sang-Woo; Polo Bardés, Jorda; Wright, Andrew J.; Carrera Pérez, David; Mithal, Arvind (Frontiers Media SA, 2021-04)
      Article
      Open Access
      Analysis of a patient’s genomics data is the first step toward precision medicine. Such analyses are performed on expensive enterprise-class server machines because input data sets are large, and the intermediate data ...
    • Forecastability measures that describe the complexity of a site for deep learning wind predictions 

      Manero Font, Jaume; Béjar Alonso, Javier (2021-05-29)
      Article
      Open Access
      The application of deep learning to wind time series for multi-step prediction obtains good results at short horizons. The accuracy of a wind forecast is highly dependent on the specific structure of wind in the specific ...
    • The OpenMP API for high integrity systems: Moving responsibility from users to vendors 

      Klemm, Michael; Quiñones, Eduardo; Taft, Tucker; Ziegenbein, Dirk; Royuela Alcázar, Sara (Association for Computing Machinery, 2021)
      Article
      Open Access
      OpenMP is traditionally focused on boosting performance in HPC systems. However, other domains are showing an increasing interest in the use of OpenMP by virtue of key aspects introduced in recent versions of the specification: ...
    • Performance characterization of video analytics workloads in heterogeneous edge infrastructures 

      Rivas Barragan, Daniel; Guim Bernat, Francesc; Polo Bardés, Jorda; Carrera Pérez, David (Wiley (John Wiley & Sons), 2021-05-07)
      Article
      Open Access
      Powered by deep learning, video analytic applications process millions of camera feeds in real-time to extract meaningful information from their surroundings. And this number grows by the minute. To avoid saturating the ...