Ara es mostren els items 41-60 de 326

    • An out-of-the-box full-network embedding for convolutional neural networks 

      Garcia-Gasulla, Dario; Vilalta Arias, Armand; Parés, Ferran; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José; Cortés García, Claudio Ulises; Suzumura, Toyotaro (Institute of Electrical and Electronics Engineers (IEEE), 2018)
      Text en actes de congrés
      Accés obert
      Features extracted through transfer learning can be used to exploit deep learning representations in contexts where there are very few training samples, where there are limited computational resources, or when the tuning ...
    • Analysis of the overheads incurred due to speculation in a task based programming model 

      Gayatri, Rahulkumar; Badia Sala, Rosa Maria; Ayguadé Parra, Eduard (2015)
      Text en actes de congrés
      Accés obert
      In order to efficiently utilize the ever increasing processing power of multi-cores, a programmer must extract as much parallelism as possible from a given application. However with every such attempt there is an associated ...
    • Analyzing performance improvements and energy savings in Infiniband architecture using network compression 

      Dickov, Branimir; Pericas, Miquel; Carpenter, Paul Matthew; Navarro, Nacho; Ayguadé Parra, Eduard (Institute of Electrical and Electronics Engineers (IEEE), 2014)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      One of the greatest challenges in HPC is total system power and energy consumption. Whereas HPC interconnects have traditionally been designed with a focus on bandwidth and latency, there is an increasing interest in ...
    • Another trip to the wall: how much will stacked DRAM benefit HPC? 

      Radulović, Milan; Živanovič, Darko; Ruiz, Daniel; De Supinski, Bronis; McKee, Sally; Radojković, Petar; Ayguadé Parra, Eduard (Association for Computing Machinery (ACM), 2015)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      First defined two decades ago, the memory wall remains a fundamental limitation to system performance. Recent innovations in 3D-stacking technology enable DRAM devices with much higher bandwidths than traditional DIMMs. ...
    • Application acceleration on FPGAs with OmpSs@FPGA 

      Bosch, Jaume; Tan, Xubin; Filgueras Izquierdo, Antonio; Vidal, Miquel; Mateu, Marc; Jiménez-González, Daniel; Álvarez, Carlos; Martorell Bofill, Xavier; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José (Institute of Electrical and Electronics Engineers (IEEE), 2019)
      Text en actes de congrés
      Accés obert
      OmpSs@FPGA is the flavor of OmpSs that allows offloading application functionality to FPGAs. Similarly to OpenMP, it is based on compiler directives. While the OpenMP specification also includes support for heterogeneous ...
    • Applying interposition techniques for performance analysis of OPENMP parallel applications 

      González Tallada, Marc; Serra, Albert; Martorell Bofill, Xavier; Oliver Segura, José; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José; Navarro, Nacho (Institute of Electrical and Electronics Engineers (IEEE), 2000)
      Text en actes de congrés
      Accés obert
      Tuning parallel applications requires the use of effective tools for detecting performance bottlenecks. Along a parallel program execution, many individual situations of performance degradation may arise. We believe that ...
    • Artificial intelligence to identify retinal fundus images, quality validation, laterality evaluation, macular degeneration, and suspected glaucoma 

      Zapata Victori, Miguel Ángel; Royo Fibla, Dídac; Font, Octavi; Vela Segarra, José Ignacio; Marcantonio Santa Cruz, Ivanna Andrea; Moya Sánchez, Eduardo Ulises; Sánchez Pérez, Abraham; Garcia Gasulla, Dario; Cortés García, Claudio Ulises; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José (2020-02-13)
      Article
      Accés obert
      Purpose: To assess the performance of deep learning algorithms for different tasks in retinal fundus images: (1) detection of retinal fundus images versus optical coherence tomography (OCT) or other images, (2) evaluation ...
    • Assembling a high-productivity DSL for computational fluid dynamics 

      Macià, Sandra; Martínez-Ferrer, Pedro J.; Mateo, Sergi; Beltran Querol, Vicenç; Ayguadé Parra, Eduard (Association for Computing Machinery (ACM), 2019)
      Text en actes de congrés
      Accés obert
      As we move towards exascale computing, an abstraction for effective parallel computation is increasingly needed to overcome the maintainability and portability of scientific applications while ensuring the efficient and ...
    • Assessing Saiph, a task-based DSL for high-performance computational fluid dynamics 

      Macià Sorrosal, Sandra; Martínez Ferrer, Pedro José; Ayguadé Parra, Eduard; Beltran Querol, Vicenç (Elsevier, 2023-10)
      Article
      Accés restringit per política de l'editorial
      Scientific applications face the challenge of efficiently exploiting increasingly complex parallel and distributed systems. Developing hand-tuned codes is a time-consuming, tedious and hardly reusable task. Reaching high ...
    • Asynchronous and exact forward recovery for detected errors in iterative solvers 

      Jaulmes, Luc; Casas, Marc; Moretó Planas, Miquel; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José; Valero Cortés, Mateo (2018-03-19)
      Article
      Accés obert
      Current trends and projections show that faults in computer systems become increasingly common. Such errors may be detected, and possibly corrected transparently, e.g. by Error Correcting Codes (ECC). For a program to be ...
    • Asynchronous runtime with distributed manager for task-based programming models 

      Bosch Pons, Jaume; Álvarez Martínez, Carlos; Jiménez González, Daniel; Martorell Bofill, Xavier; Ayguadé Parra, Eduard (2020-09)
      Article
      Accés obert
      Parallel task-based programming models, like OpenMP, allow application developers to easily create a parallel version of their sequential codes. The standard OpenMP 4.0 introduced the possibility of describing a set of ...
    • Atomic quake: using transactional memory in an interactive mulitplayer game Server 

      Zyulkyarov, Ferad; Gajinov, Vladimir; Unsal, Osman Sabri; Cristal Kestelman, Adrián; Ayguadé Parra, Eduard; Harris, Tim; Valero Cortés, Mateo (2009)
      Text en actes de congrés
      Accés obert
      Transactional Memory (TM) is being studied widely as a new technique for synchronizing concurrent accesses to shared memory data structures for use in multi-core systems. Much of the initial work on TM has been evaluated ...
    • Auto-tuned OpenCL kernel co-execution in OmpSs for heterogeneous systems 

      Pérez, Borja; Stafford, Esteban; Bosque Orero, José Luis; Beivide Palacio, Ramon; Mateo Bellido, Sergi; Teruel García, Xavier; Martorell Bofill, Xavier; Ayguadé Parra, Eduard (2019-03-01)
      Article
      Accés obert
      The emergence of heterogeneous systems has been very notable recently. The nodes of the most powerful computers integrate several compute accelerators, like GPUs. Profiting from such node configurations is not a trivial ...
    • Automated curation of brand-related social media images with deep learning 

      Tous Liesa, Rubén; Gómez Parada, Mauro; Poveda, Jonatan; Cruz, Leonel; Wust, Otto; Makni, Mouna; Ayguadé Parra, Eduard (2018-10)
      Article
      Accés obert
      This paper presents a work consisting in using deep convolutional neural networks (CNNs) to facilitate the curation of brand-related social media images. The final goal is to facilitate searching and discovering user-generated ...
    • Automated generation of high-performance computational fluid dynamics codes 

      Macià Sorrosal, Sandra; Martínez Ferrer, Pedro J.; Ayguadé Parra, Eduard; Beltran Querol, Vicenç (Elsevier, 2022-05)
      Article
      Accés obert
      Domain-Specific Languages (DSLs) improve programmers productivity by decoupling problem descriptions from algorithmic implementations. However, DSLs for High-Performance Computing (HPC) have two additional critical ...
    • Automatic aggregation of subtask accesses for nested OpenMP-style tasks 

      Ali, Omar Shaaban Ibrahim; Aguilar Mena, Jimmy; Beltran Querol, Vicenç; Carpenter, Paul Matthew; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José (Institute of Electrical and Electronics Engineers (IEEE), 2022)
      Text en actes de congrés
      Accés obert
      Task-based programming is a high performance and productive model to express parallelism. Tasks encapsulate work to be executed across multiple cores or offloaded to GPUs, FPGAs, other accelerators or other nodes. In order ...
    • Automatic exploration of potential parallelism in sequential applications 

      Subotic, Vladimir; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José; Valero Cortés, Mateo (Springer, 2014)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      The multicore era has increased the need for highly parallel software. Since automatic parallelization turned out ineffective for many production codes, the community hopes for the development of tools that may assist ...
    • Automatic multilevel parallelization using OpenMP 

      Jin, H; Jost, G; Yan, J; Ayguadé Parra, Eduard; González Tallada, Marc; Martorell Bofill, Xavier (2004-06)
      Article
      Accés restringit per política de l'editorial
      In this paper we describe the extension of the CAPO parallelization support tool to support multilevel parallelism based on OpenMP directives. CAPO generates OpenMP directives with extensions supported by the NanosCompiler ...
    • Automatic pre-fetch and modulo scheduling transformations for the cell BE architecture 

      Vujic, N; González Tallada, Marc; Martorell Bofill, Xavier; Ayguadé Parra, Eduard (2008-01)
      Article
      Accés restringit per política de l'editorial
      Ease of programming is one of the main impediments for the broad acceptance of multi-core systems with no hardware support for transparent data transfer between local and global memories. Software cache is a robust approach ...
    • Automatic query driven data modelling in cassandra 

      Hernández, Roger; Becerra Fontal, Yolanda; Torres Viñals, Jordi; Ayguadé Parra, Eduard (Elsevier, 2015)
      Text en actes de congrés
      Accés obert
      Non-relational databases have recently been the preferred choice when it comes to dealing with BigData challenges, but their performance is very sensitive to the chosen data organisations. We have seen differences of over ...