Ara es mostren els items 137-140 de 140

    • Using shared-data localization to reduce the cost of inspector-execution in unified-parallel-C programs 

      Alvanos, Michail; Tiotto, Ettore; Amaral, José Nelson; Farreras Esclusa, Montserrat; Martorell Bofill, Xavier (2016-05-01)
      Article
      Accés obert
      Programs written in the Unified Parallel C (UPC) language can access any location of the entire local and remote address space via read/write operations. However, UPC programs that contain fine-grained shared accesses can ...
    • Variable batched DGEMM 

      Valero-Lara, Pedro; Martinez-Perez, Ivan; Mateo, Sergio; Sirvent Pardell, Raül; Beltran Querol, Vicenç; Martorell Bofill, Xavier; Labarta Mancho, Jesús José (Institute of Electrical and Electronics Engineers (IEEE), 2018)
      Text en actes de congrés
      Accés obert
      Many scientific applications are in need to solve a high number of small-size independent problems. These individual problems do not provide enough parallelism and then, these must be computed as a batch. Today, vendors ...
    • Work-efficient parallel non-maximum suppression for embedded GPU architectures 

      Oro Garcia, David; Fernandez Tena, Carles; Martorell Bofill, Xavier; Hernando Pericás, Francisco Javier (Institute of Electrical and Electronics Engineers (IEEE), 2016)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      With the emergence of GPU computing, deep neural networks have become a widely used technique for advancing research in the field of image and speech processing. In the context of object and event detection, slidingwindow ...
    • Work-efficient parallel non-maximum suppression kernels 

      Oro García, David; Fernandez Tena, Carles; Martorell Bofill, Xavier; Hernando Pericás, Francisco Javier (Wiley Heyden, 2020-08-21)
      Article
      Accés obert
      In the context of object detection, sliding-window classifiers and single-shot convolutional neural network (CNN) meta-architectures typically yield multiple overlapping candidate windows with similar high scores around ...