Exploració per autor "Ayguadé Parra, Eduard"
Ara es mostren els items 41-60 de 326
-
An out-of-the-box full-network embedding for convolutional neural networks
Garcia-Gasulla, Dario; Vilalta Arias, Armand; Parés, Ferran; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José; Cortés García, Claudio Ulises; Suzumura, Toyotaro (Institute of Electrical and Electronics Engineers (IEEE), 2018)
Text en actes de congrés
Accés obertFeatures extracted through transfer learning can be used to exploit deep learning representations in contexts where there are very few training samples, where there are limited computational resources, or when the tuning ... -
Analysis of the overheads incurred due to speculation in a task based programming model
Gayatri, Rahulkumar; Badia Sala, Rosa Maria; Ayguadé Parra, Eduard (2015)
Text en actes de congrés
Accés obertIn order to efficiently utilize the ever increasing processing power of multi-cores, a programmer must extract as much parallelism as possible from a given application. However with every such attempt there is an associated ... -
Analyzing performance improvements and energy savings in Infiniband architecture using network compression
Dickov, Branimir; Pericas, Miquel; Carpenter, Paul Matthew; Navarro, Nacho; Ayguadé Parra, Eduard (Institute of Electrical and Electronics Engineers (IEEE), 2014)
Text en actes de congrés
Accés restringit per política de l'editorialOne of the greatest challenges in HPC is total system power and energy consumption. Whereas HPC interconnects have traditionally been designed with a focus on bandwidth and latency, there is an increasing interest in ... -
Another trip to the wall: how much will stacked DRAM benefit HPC?
Radulović, Milan; Živanovič, Darko; Ruiz, Daniel; De Supinski, Bronis; McKee, Sally; Radojković, Petar; Ayguadé Parra, Eduard (Association for Computing Machinery (ACM), 2015)
Text en actes de congrés
Accés restringit per política de l'editorialFirst defined two decades ago, the memory wall remains a fundamental limitation to system performance. Recent innovations in 3D-stacking technology enable DRAM devices with much higher bandwidths than traditional DIMMs. ... -
Application acceleration on FPGAs with OmpSs@FPGA
Bosch, Jaume; Tan, Xubin; Filgueras Izquierdo, Antonio; Vidal, Miquel; Mateu, Marc; Jiménez-González, Daniel; Álvarez, Carlos; Martorell Bofill, Xavier; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José (Institute of Electrical and Electronics Engineers (IEEE), 2019)
Text en actes de congrés
Accés obertOmpSs@FPGA is the flavor of OmpSs that allows offloading application functionality to FPGAs. Similarly to OpenMP, it is based on compiler directives. While the OpenMP specification also includes support for heterogeneous ... -
Applying interposition techniques for performance analysis of OPENMP parallel applications
González Tallada, Marc; Serra, Albert; Martorell Bofill, Xavier; Oliver Segura, José; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José; Navarro, Nacho (Institute of Electrical and Electronics Engineers (IEEE), 2000)
Text en actes de congrés
Accés obertTuning parallel applications requires the use of effective tools for detecting performance bottlenecks. Along a parallel program execution, many individual situations of performance degradation may arise. We believe that ... -
Artificial intelligence to identify retinal fundus images, quality validation, laterality evaluation, macular degeneration, and suspected glaucoma
Zapata Victori, Miguel Ángel; Royo Fibla, Dídac; Font, Octavi; Vela Segarra, José Ignacio; Marcantonio Santa Cruz, Ivanna Andrea; Moya Sánchez, Eduardo Ulises; Sánchez Pérez, Abraham; Garcia Gasulla, Dario; Cortés García, Claudio Ulises; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José (2020-02-13)
Article
Accés obertPurpose: To assess the performance of deep learning algorithms for different tasks in retinal fundus images: (1) detection of retinal fundus images versus optical coherence tomography (OCT) or other images, (2) evaluation ... -
Assembling a high-productivity DSL for computational fluid dynamics
Macià, Sandra; Martínez-Ferrer, Pedro J.; Mateo, Sergi; Beltran Querol, Vicenç; Ayguadé Parra, Eduard (Association for Computing Machinery (ACM), 2019)
Text en actes de congrés
Accés obertAs we move towards exascale computing, an abstraction for effective parallel computation is increasingly needed to overcome the maintainability and portability of scientific applications while ensuring the efficient and ... -
Assessing Saiph, a task-based DSL for high-performance computational fluid dynamics
Macià Sorrosal, Sandra; Martínez Ferrer, Pedro José; Ayguadé Parra, Eduard; Beltran Querol, Vicenç (Elsevier, 2023-10)
Article
Accés restringit per política de l'editorialScientific applications face the challenge of efficiently exploiting increasingly complex parallel and distributed systems. Developing hand-tuned codes is a time-consuming, tedious and hardly reusable task. Reaching high ... -
Asynchronous and exact forward recovery for detected errors in iterative solvers
Jaulmes, Luc; Casas, Marc; Moretó Planas, Miquel; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José; Valero Cortés, Mateo (2018-03-19)
Article
Accés obertCurrent trends and projections show that faults in computer systems become increasingly common. Such errors may be detected, and possibly corrected transparently, e.g. by Error Correcting Codes (ECC). For a program to be ... -
Asynchronous runtime with distributed manager for task-based programming models
Bosch Pons, Jaume; Álvarez Martínez, Carlos; Jiménez González, Daniel; Martorell Bofill, Xavier; Ayguadé Parra, Eduard (2020-09)
Article
Accés obertParallel task-based programming models, like OpenMP, allow application developers to easily create a parallel version of their sequential codes. The standard OpenMP 4.0 introduced the possibility of describing a set of ... -
Atomic quake: using transactional memory in an interactive mulitplayer game Server
Zyulkyarov, Ferad; Gajinov, Vladimir; Unsal, Osman Sabri; Cristal Kestelman, Adrián; Ayguadé Parra, Eduard; Harris, Tim; Valero Cortés, Mateo (2009)
Text en actes de congrés
Accés obertTransactional Memory (TM) is being studied widely as a new technique for synchronizing concurrent accesses to shared memory data structures for use in multi-core systems. Much of the initial work on TM has been evaluated ... -
Auto-tuned OpenCL kernel co-execution in OmpSs for heterogeneous systems
Pérez, Borja; Stafford, Esteban; Bosque Orero, José Luis; Beivide Palacio, Ramon; Mateo Bellido, Sergi; Teruel García, Xavier; Martorell Bofill, Xavier; Ayguadé Parra, Eduard (2019-03-01)
Article
Accés obertThe emergence of heterogeneous systems has been very notable recently. The nodes of the most powerful computers integrate several compute accelerators, like GPUs. Profiting from such node configurations is not a trivial ... -
Automated curation of brand-related social media images with deep learning
Tous Liesa, Rubén; Gómez Parada, Mauro; Poveda, Jonatan; Cruz, Leonel; Wust, Otto; Makni, Mouna; Ayguadé Parra, Eduard (2018-10)
Article
Accés obertThis paper presents a work consisting in using deep convolutional neural networks (CNNs) to facilitate the curation of brand-related social media images. The final goal is to facilitate searching and discovering user-generated ... -
Automated generation of high-performance computational fluid dynamics codes
Macià Sorrosal, Sandra; Martínez Ferrer, Pedro J.; Ayguadé Parra, Eduard; Beltran Querol, Vicenç (Elsevier, 2022-05)
Article
Accés obertDomain-Specific Languages (DSLs) improve programmers productivity by decoupling problem descriptions from algorithmic implementations. However, DSLs for High-Performance Computing (HPC) have two additional critical ... -
Automatic aggregation of subtask accesses for nested OpenMP-style tasks
Ali, Omar Shaaban Ibrahim; Aguilar Mena, Jimmy; Beltran Querol, Vicenç; Carpenter, Paul Matthew; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José (Institute of Electrical and Electronics Engineers (IEEE), 2022)
Text en actes de congrés
Accés obertTask-based programming is a high performance and productive model to express parallelism. Tasks encapsulate work to be executed across multiple cores or offloaded to GPUs, FPGAs, other accelerators or other nodes. In order ... -
Automatic exploration of potential parallelism in sequential applications
Subotic, Vladimir; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José; Valero Cortés, Mateo (Springer, 2014)
Text en actes de congrés
Accés restringit per política de l'editorialThe multicore era has increased the need for highly parallel software. Since automatic parallelization turned out ineffective for many production codes, the community hopes for the development of tools that may assist ... -
Automatic multilevel parallelization using OpenMP
Jin, H; Jost, G; Yan, J; Ayguadé Parra, Eduard; González Tallada, Marc; Martorell Bofill, Xavier (2004-06)
Article
Accés restringit per política de l'editorialIn this paper we describe the extension of the CAPO parallelization support tool to support multilevel parallelism based on OpenMP directives. CAPO generates OpenMP directives with extensions supported by the NanosCompiler ... -
Automatic pre-fetch and modulo scheduling transformations for the cell BE architecture
Vujic, N; González Tallada, Marc; Martorell Bofill, Xavier; Ayguadé Parra, Eduard (2008-01)
Article
Accés restringit per política de l'editorialEase of programming is one of the main impediments for the broad acceptance of multi-core systems with no hardware support for transparent data transfer between local and global memories. Software cache is a robust approach ... -
Automatic query driven data modelling in cassandra
Hernández, Roger; Becerra Fontal, Yolanda; Torres Viñals, Jordi; Ayguadé Parra, Eduard (Elsevier, 2015)
Text en actes de congrés
Accés obertNon-relational databases have recently been the preferred choice when it comes to dealing with BigData challenges, but their performance is very sensitive to the chosen data organisations. We have seen differences of over ...