Ara es mostren els items 1-11 de 11

    • A Massively Parallel Algorithm for the Approximate Calculation of Inverse p-th Roots of Large Sparse Matrices 

      Lass, Michael; Mohr, Stephan; Wiebeler, Hendrik; Kühne, Thomas D.; Plessl, Christian (Association for Computing Machinery (ACM), 2018-07)
      Comunicació de congrés
      Accés obert
      We present the submatrix method, a highly parallelizable method for the approximate calculation of inverse p-th roots of large sparse symmetric matrices which are required in different scientific applications. Following ...
    • Beyond the socket: NUMA-aware GPUs 

      Ugljesa, Milic; Villa, Oreste; Bolotin, Evgeny; Arunkumar, Akhil; Ebrahimi, Eiman; Jaleel, Aamer; Ramirez, Alex; Nellans, David (Association for Computing Machinery, 2017-10)
      Comunicació de congrés
      Accés obert
      GPUs achieve high throughput and power efficiency by employing many small single instruction multiple thread (SIMT) cores. To minimize scheduling logic and performance variance they utilize a uniform memory system and ...
    • Breast cancer detection using machine learning with thermograms in an edge computing scenario 

      Tahmooresi, Maryam; Remondo Bueno, David; Alcober Segura, Jesús Ángel (Association for Computing Machinery (ACM), 2021)
      Comunicació de congrés
      Accés restringit per política de l'editorial
      The second cause of death in the world is cancer. Although breast cancer is the more common cause of death among women, the chance of survival can be increased by detecting cancer in the early stages. For this aim, there ...
    • Brook Auto: High-Level Certification-Friendly Programming for GPU-powered Automotive Systems 

      Trompouki, Matina M.; Kosmidis, Leonidas (Association for Computing Machinery (ACM), 2018)
      Comunicació de congrés
      Accés obert
      Modern automotive systems require increased performance to implement Advanced Driving Assistance Systems (ADAS). GPU-powered platforms are promising candidates for such computational tasks, however current low-level ...
    • Computational Fluid and Particle Dynamics Simulations for Respiratory System: Runtime Optimization on an Arm Cluster 

      Garcia-Gasulla, Marta; Josep-Fabrego, Marc; Eguzkitza, Beatriz; Mantovani, Filippo (Association for Computing Machinery (ACM), 2018-08-13)
      Comunicació de congrés
      Accés obert
      Computational fluid and particle dynamics simulations (CFPD) are of paramount importance for studying and improving drug effectiveness. Computational requirements of CFPD codes involves high-performance computing (HPC) ...
    • Evaluation of adherence to nutritional intervention through trajectory analysis 

      Sevilla-Villanueva, Beatriz; Gibert, Karina; Sànchez-Marrè, Miquel; Fitó Colomer, Montserrat; Covas, Maria Isabel (2017-05)
      Article
      Accés obert
      Classical Pre-Post Intervention Studies are often analyzed using traditional statistics. Nevertheless, the nutritional interventions have small effects on the metabolism and traditional statistics are not enough to detect ...
    • Iteration-fusing conjugate gradient 

      Zhuang, Sicong; Casas, Marc (Association for Computing Machinery (ACM), 2017-06)
      Comunicació de congrés
      Accés obert
      This paper presents the Iteration-Fusing Conjugate Gradient (IFCG) approach which is an evolution of the Conjugate Gradient method that consists in i) letting computations from different iterations to overlap between them ...
    • libPRISM: an intelligent adaptation of prefetch and SMT levels 

      Ortega, Cristobal; Moretó Planas, Miquel; Casas, Marc; Bertran, Ramon; Buyuktosunoglu, Alper; Eichenberger, Alexandre; Bose, Pradip (Association for Computing Machinery (ACM), 2017)
      Text en actes de congrés
      Accés obert
      Current microprocessors include several knobs to modify the hardware behavior in order to improve performance under different workload demands. An impractical and time consuming offline profiling is needed to evaluate the ...
    • Multidimensional blocking in UPC 

      Barton, Christopher; Cascaval, Calin; Almási, George; Garg, Rahul; Amaral, José Nelson; Farreras Esclusa, Montserrat (Springer, 2008-02)
      Article
      Accés restringit per política de l'editorial
      Partitioned Global Address Space (PGAS) languages offer an attractive, high-productivity programming model for programming large-scale parallel machines. PGAS languages, such as Unified Parallel C (UPC), combine the ...
    • Optimizing the exploitation of multicore processors and GPUs with OpenMP and OpenCL 

      Ferrer, Roger; Planas Carbonell, Judit; Bellens, Pieter; Duran Gonzalez, Alejandro; González Tallada, Marc; Martorell Bofill, Xavier; Badia Sala, Rosa Maria; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José (Springer, 2010)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      In this paper, we present OMPSs, a programming model based on OpenMP and StarSs, that can also incorporate the use of OpenCL or CUDA kernels. We evaluate the proposal on three different architectures, SMP, Cell/B.E. and ...
    • Terrain prickliness: theoretical grounds for high complexity viewsheds 

      Acharyya, Ankush; Jallu, Ramesh; Löffler, M.; Meijer, Geert; Saumell Mendiola, Maria; Silveira, Rodrigo Ignacio; Staals, Frank (2021)
      Text en actes de congrés
      Accés obert
      An important task when working with terrain models is computing viewsheds: the parts of the terrain visible from a given viewpoint. When the terrain is modeled as a polyhedral terrain, the viewshed is composed of the union ...