Ara es mostren els items 1-20 de 99

  • Accelerating boosting-based face detection on GPUs 

    Oro, David; Fernández, Carles; Segura, Carlos; Martorell Bofill, Xavier; Hernando Pericás, Francisco Javier (2012)
    Text en actes de congrés
    Accés restringit per política de l'editorial
    The goal of face detection is to determine the presence of faces in arbitrary images, along with their locations and dimensions. As it happens with any graphics workloads, these algorithms benefit from data-level ...
  • Accelerating software memory compression on the Cell/B.E. 

    Beltran Querol, Vicenç; Martorell Bofill, Xavier; Torres Viñals, Jordi; Ayguadé Parra, Eduard (2008)
    Text en actes de congrés
    Accés restringit per política de l'editorial
    The idea of transparently compressing and decompressing the content of main memory to virtually enlarge their capacity has been previously proposed and studied in the literature. The rationale behind this idea lies in the ...
  • Achieving high memory performance from heterogeneous architectures with the SARC programming model 

    Ferrer, Roger; Beltran Querol, Vicenç; González Tallada, Marc; Martorell Bofill, Xavier; Ayguadé Parra, Eduard (ACM, 2009)
    Comunicació de congrés
    Accés restringit per política de l'editorial
    Current heterogeneous multicore architectures, including the Cell/B.E., GPUs, and future developments, like Larrabee, require enormous programming efforts to efficiently run current parallel applications, achieving high ...
  • ACOTES project: Advanced compiler technologies for embedded streaming 

    Duranton, M.; Munk, H.; Ayguadé Parra, Eduard; Bastoul, C.; Carpenter, Paul; Chamski, Z.; Cohen, A.; Cornero, M.; Dumont, P.; Pop, S.; Pop, A.; Ornstein, A.; Nuzman, D.; Miranda, C.; Martorell Bofill, Xavier; Lindwer, M.; Ladelsky, R.; Ferrer, Roger; Fellahi, M.; Pouchet, L. N; Zaks, A.; Shvadron, U.; Trifunovic, K.; Rohou, E.; Rosen, I.; Ramírez Bellido, Alejandro; Ródenas, D. (2011-04)
    Article
    Accés obert
    Streaming applications are built of data-driven, computational components, consuming and producing unbounded data streams. Streaming oriented systems have become dominant in a wide range of domains, including embedded ...
  • A module-based cell processor simulator 

    Cabarcas Jaramillo, Felipe; Rico Carro, Alejandro; Rodenas, David; Martorell Bofill, Xavier; Ramírez Bellido, Alejandro; Ayguadé Parra, Eduard (European Network of Excellence on High Performance and Embedded Architecture and Compilation (HiPEAC), 2006)
    Comunicació de congrés
    Accés obert
    An interesting design alternative to replication-based chip multiprocessors is to create heterogeneous chip multiprocessors composed of several different cores, with one or more of them running the operating system and ...
  • Analyzing the impact of communication imbalance in high-speed networks 

    Utrera Iglesias, Gladys Miriam; Gil, Marisa; Martorell Bofill, Xavier (2017-12-21)
    Article
    Accés obert
    In this work we analyze the communication load imbalance generated by irregular-data applications running in a multi-node cluster. Experimental approaches to diminish communication load imbalance are evaluated using a ...
  • An OpenMP* barrier using SIMD instructions for Intel® Xeon Phi™ coprocessor 

    Caballero, Diego; Duran González, Alejandro; Martorell Bofill, Xavier (Springer, 2013)
    Text en actes de congrés
    Accés restringit per política de l'editorial
    Barrier synchronisation is a widely-studied topic since the supercomputer era due to its significant impact on the overall performance of parallel applications. With the current shift to many-core architectures, such as ...
  • A novel asynchronous software cache implementation for the Cell-BE processor 

    Balart, J; González Tallada, Marc; Martorell Bofill, Xavier; Ayguadé Parra, Eduard; Sura, Z; Chen, T; Zhang, T; O'Brien, Kevin; O'Brien, Kathryn (2008-10)
    Article
    Accés restringit per política de l'editorial
    This paper describes the implementation of a runtime library for asynchronous communication in the Cell BE processor. The runtime library implementation provides with several services that allow the compiler to generate ...
  • A proposal for error handling in OpenMP 

    Duran González, Alejandro; Ferrer, Roger; Costa Prats, Juan José; González Tallada, Marc; Martorell Bofill, Xavier; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José (2006-06)
    Article
    Accés restringit per política de l'editorial
    OpenMP has been focused in performance applied to numerical applications, but when we try to move this focus to other kind of applications, like Web servers, we detect one important lack. In these applications, performance ...
  • A proposal for task-generating loops in OpenMP 

    Teruel, Xavier; Klemm, Michael; Li, Kelvin; Martorell Bofill, Xavier; Olivier, Stephen; Terboven, Christian (Springer, 2013)
    Text en actes de congrés
    Accés restringit per política de l'editorial
    With the addition of the OpenMP* tasking model, programmers are able to improve and extend the parallelization opportunities of their codes. Programmers can also distribute the creation of tasks using a worksharing construct, ...
  • A streaming machine description and programming model 

    Carpenter, Paul; Ródenas Picó, David; Martorell Bofill, Xavier; Ramírez Bellido, Alejandro; Ayguadé Parra, Eduard (2007-07)
    Article
    Accés restringit per política de l'editorial
    In this paper we present the initial development of a streaming environment based on a programming model and machine description. The stream programming model consists of an extension to the C language and it’s translation ...
  • Automatic communication coalescing for irregular computations in UPC language 

    Alvanos, Michail; Tiotto, Ettore; Farreras Esclusa, Montserrat; Martorell Bofill, Xavier (IBM, 2012)
    Text en actes de congrés
    Accés restringit per política de l'editorial
    Partitioned Global Address Space (PGAS) languages appeared to address programmer productivity in large scale parallel machines. However, fine grain accesses on shared structures have been identified as one of the main ...
  • Automatic generation and testing of application specific hardware accelerators on a new reconfigurable OpenSPARC platform 

    González Álvarez, Cecilia; Fernández, Mikel; Jiménez González, Daniel; Álvarez Martínez, Carlos; Martorell Bofill, Xavier (2011)
    Text en actes de congrés
    Accés obert
    Specific hardware customization for scientific applications has shown a big potential to address the current holy grail in computer architecture: reducing power consumption while increasing performance. In particular, the ...
  • Automatic multilevel parallelization using OpenMP 

    Jin, H; Jost, G; Yan, J; Ayguadé Parra, Eduard; González Tallada, Marc; Martorell Bofill, Xavier (2004-06)
    Article
    Accés restringit per política de l'editorial
    In this paper we describe the extension of the CAPO parallelization support tool to support multilevel parallelism based on OpenMP directives. CAPO generates OpenMP directives with extensions supported by the NanosCompiler ...
  • Automatic pre-fetch and modulo scheduling transformations for the cell BE architecture 

    Vujic, N; González Tallada, Marc; Martorell Bofill, Xavier; Ayguadé Parra, Eduard (2008-01)
    Article
    Accés restringit per política de l'editorial
    Ease of programming is one of the main impediments for the broad acceptance of multi-core systems with no hardware support for transparent data transfer between local and global memories. Software cache is a robust approach ...
  • Auto-tuned OpenCL kernel co-execution in OmpSs for heterogeneous systems 

    Pérez, Borja; Stafford, Esteban; Bosque Orero, José Luis; Beivide Palacio, Ramon; Mateo Bellido, Sergi; Teruel García, Xavier; Martorell Bofill, Xavier; Ayguadé Parra, Eduard (2019-03-01)
    Article
    Accés obert
    The emergence of heterogeneous systems has been very notable recently. The nodes of the most powerful computers integrate several compute accelerators, like GPUs. Profiting from such node configurations is not a trivial ...
  • AXIOM: a flexible platform for the smart home 

    Giorgi, Roberto; Bettin, Nicola; Gai, Paolo; Martorell Bofill, Xavier; Rizzo, Antonio (Springer, 2016-09-24)
    Capítol de llibre
    Accés restringit per política de l'editorial
    The AXIOM hardware/software platform aims at bringing easy programmability on top of a cluster of processors by using a fast interconnect and FPGA as a basis for building a scalable embedded system. The Smart Home is one ...
  • AXIOM: a hardware-software platform for cyber physical systems 

    Mazumdar, Somnath; Ayguadé Parra, Eduard; Bettin, Nicola; Bueno Hedo, Javier; Ermini, Sara; Filgueras Izquierdo, Antonio; Jiménez González, Daniel; Álvarez Martínez, Carlos; Martorell Bofill, Xavier; Montefoschi, Francesco; Oro Garcia, David; Pnevmatikatos, Dionisis; Rizzo, Antonio; Theodoropoulos, Dimitris; Giorgi, Roberto (2016)
    Text en actes de congrés
    Accés restringit per política de l'editorial
    Cyber-Physical Systems (CPSs) are widely necessary for many applications that require interactions with the humans and the physical environment. A CPS integrates a set of hardware-software components to distribute, execute ...
  • Barcelona OpenMP tasks suite: a set of benchmarks targeting the exploitation of task parallelism in OpenMP 

    Duran González, Alejandro; Teruel, Xavier; Ferrer, Roger; Martorell Bofill, Xavier; Ayguadé Parra, Eduard (2009)
    Comunicació de congrés
    Accés obert
  • CellSim: a validated modular heterogeneous multiprocessor simulator 

    Cabarcas Jaramillo, Felipe; Rico Carro, Alejandro; Ródenas Picó, David; Martorell Bofill, Xavier; Ramírez Bellido, Alejandro; Ayguadé Parra, Eduard (Thomson Editores Spain, 2007)
    Text en actes de congrés
    Accés obert
    As the number of transistors on a chip continues increasing the power consumption has become the most important constraint in processors design. Therefore, to increase performance, computer architects have decided to use ...