Now showing items 1-20 of 293

    • 2018 International Symposium on Computer Architecture influential paper award 

      González Colás, Antonio María (2018-07-01)
      Article
      Open Access
      The International Symposium on Computer Architecture (ISCA) recognizes every year the most influential paper published in this conference 15 years earlier, based on its impact on research, development, products or ideas. ...
    • A case study for the verification of complex timed circuits: IPCMOS 

      Peña Basurto, Marco Antonio; Cortadella, Jordi; Pastor Llorens, Enric; Smirnov, Alexandre (Institute of Electrical and Electronics Engineers (IEEE), 2002)
      Conference report
      Open Access
      The verification of a n-stage pulse-driven IPCMOS pipeline, for any n>0, is presented. The complexity of the system is 32n transistors and delay information is provided at the level of transistor The correctness of the ...
    • A control and management architecture supporting autonomic NFV services 

      Velasco Esteban, Luis Domingo; Casellas Regi, Ramón; Llana, Sergio; Gifre Renom, Lluís; Martínez Rivera, Ricardo Victor; Vilata, Ricard; Muñoz González, Raül; Ruiz, Marc (2019-02)
      Article
      Open Access
      The proposed control, orchestration and management (COM) architecture is presented from a high-level point of view; it enables the dynamic provisioning of services such as network data connectivity or generic network slicing ...
    • A decoupled KILO-instruction processor 

      Pericàs Gleim, Miquel; Cristal Kestelman, Adrián; González García, Rubén; Jiménez, Daniel A.; Valero Cortés, Mateo (Institute of Electrical and Electronics Engineers (IEEE), 2006)
      Conference report
      Open Access
      Building processors with large instruction windows has been proposed as a mechanism for overcoming the memory wall, but finding a feasible and implementable design has been an elusive goal. Traditional processors are ...
    • A fast and efficient algorithm to identify clusters in networks 

      Comellas Padró, Francesc de Paula; Miralles de la Asunción, Alicia (2010-11-01)
      Article
      Restricted access - publisher's policy
      A characteristic feature of many relevant real life networks, like the WWW, Internet, transportation and communication networks, or even biological and social networks, is their clustering structure. We discuss in this ...
    • A general guide to applying machine learning to computer architecture 

      Nemirovsky, Daniel; Arkose, Tugberk; Markovic, Nikola; Nemirovsky, Mario; Unsal, Osman Sabri; Cristal Kestelman, Adrián; Valero Cortés, Mateo (2018)
      Article
      Open Access
      The resurgence of machine learning since the late 1990s has been enabled by significant advances in computing performance and the growth of big data. The ability of these algorithms to detect complex patterns in data which ...
    • A hardware accelerator for ORB-SLAM 

      Taranco, Raúl (Universitat Politècnica de Catalunya, 2019-10-17)
      Master thesis
      Open Access
      Simultaneous Localization And Mapping (SLAM) is a key component of self-driving cars. We study ORB-SLAM, a SLAM state-of-the-art solution, and develop a hardware accelerator for a critical part of it: ORB feature extraction. ...
    • A hardware evaluation of cache partitioning to improve utilization and energy-efficiency while preserving responsiveness 

      Cook, Henry; Moretó Planas, Miquel; Bird, Sarah L.; Dao, Khanh; Patterson, David; Asanovic, Krste (ACM, 2013)
      Conference report
      Open Access
      Computing workloads often contain a mix of interactive, latency-sensitive foreground applications and recurring background computations. To guarantee responsiveness, interactive and batch applications are often run on ...
    • A hybrid web server architecture for secure e-business web applications 

      Beltran Querol, Vicenç; Carrera Pérez, David; Guitart Fernández, Jordi; Torres Viñals, Jordi; Ayguadé Parra, Eduard (2005-09)
      Article
      Restricted access - publisher's policy
      Nowadays the success of many e-commerce applications, such as on-line banking, depends on their reliability, robustness and security. Designing a web server architecture that keeps these properties under high loads is a ...
    • A multi-radix approach to asynchronous division 

      Cornetta, Gianluca; Cortadella, Jordi (Institute of Electrical and Electronics Engineers (IEEE), 2001)
      Conference report
      Open Access
      The speed of high-radix digit-recurrence dividers is mainly determined by the hardware complexity of the quotient-digit selection function. In this paper we present a scheme that combines the area efficiency of bundled ...
    • A novel register renaming technique for out-of-order processors 

      Tabani, Hamid; Arnau Montañés, José María; Tubella Murgadas, Jordi; González Colás, Antonio María (Institute of Electrical and Electronics Engineers (IEEE), 2018)
      Conference report
      Restricted access - publisher's policy
      Modern superscalar processors support a large number of in-flight instructions, which requires sizeable register files. Conventional register renaming techniques allocate a new storage location, i.e. physical register, for ...
    • A radix-16 SRT division unit with speculation of the quotient digits 

      Gianluca, Cornetta; Cortadella, Jordi (Institute of Electrical and Electronics Engineers (IEEE), 1999)
      Conference report
      Open Access
      The speed of a divider based on a digit-recurrence algorithm depends mainly on the latency of the quotient digit generation function. In this paper we present an analytical approach that extends the theory developed for ...
    • A resource identity management strategy for combined fog-to-cloud systems 

      Gómez Cárdenas, Alejandro; Masip Bruin, Xavier; Marín Tordera, Eva; Kahvazadeh, Sarang; García Almiñana, Jordi (2018)
      Conference report
      Restricted access - publisher's policy
      Fog-to-Cloud (F2C) is an emerging architecture intended to manage the resources continuum from far datacenters up to the near edge, putting together the cloud and fog concepts. It aims to obtain both, an efficient utilization ...
    • A Rudimentary Machine. Experiences in the Design of a Pedagogic Computer 

      Pastor Llorens, Enric; Sánchez Carracedo, Fermín; Corral González, Anna M. del (Publicat en web, 1999-07)
      Article
      Open Access
      This paper describes a pedagogic computer named \M aquina Rudimentaria". This computer has been de- signed to be used in a rst course on logic design or com- puter architecture; orthogonality and simplicity have been the ...
    • A self-adaptive hardware architecture with fault tolerance capabilities 

      Soto, Javier; Moreno Aróstegui, Juan Manuel; Cabestany Moncusí, Joan (2013-12-09)
      Article
      Restricted access - publisher's policy
      This paper describes a Fault Tolerance System (FTS) implemented in a new self-adaptive hardware architecture. This architecture is based on an array of cells that implements in a distributed way self-adaptive capabilities. ...
    • A sensor web architecture for integrating smart oceanographic sensors into the semantic sensor web 

      Río Fernández, Joaquín del; Toma, Daniel; Martínez Padró, Enoc; O'Reilly, Thomas C.; Delory, Eric; Pearlman, Jay S; Waldmann, Christoph; Jirka, Simon (2018-10)
      Article
      Open Access
      Effective ocean and coastal data management are needed to manage marine ecosystem health. Past ocean and coastal data management systems were often very specific to a particular application and region, but this focused ...
    • A software-hardware hybrid steering mechanism for clustered microarchitectures 

      Cai, Qiong; Codina Viñas, Josep M.; González González, José; González Colás, Antonio María (Institute of Electrical and Electronics Engineers (IEEE), 2008)
      Conference report
      Open Access
      Clustered microarchitectures provide a promising paradigm to solve or alleviate the problems of increasing microprocessor complexity and wire delays. High- performance out-of-order processors rely on hardware-only steering ...
    • A systolic algorithm for the fast computation of the connected components of a graph 

      Núñez, Fernando J.; Valero Cortés, Mateo (Institute of Electrical and Electronics Engineers (IEEE), 1988)
      Conference report
      Open Access
      The authors consider the description of a systolic algorithm to solve the connected-component problem. It is executed in a ring topology with N processors, requiring O(Nlog N) time without regard to the graph's sparsity. ...
    • A Tensor Marshaling Unit for sparse tensor algebra on general-purpose processors 

      Siracusa, Marco; Soria Pardos, Víctor; Sgherzi, Francesco; Randall, Joshua; Joseph, Douglas J.; Moretó Planas, Miquel; Armejach Sanosa, Adrià (Association for Computing Machinery (ACM), 2023)
      Conference report
      Open Access
      This paper proposes the Tensor Marshaling Unit (TMU), a near-core programmable dataflow engine for multicore architectures that accelerates tensor traversals and merging, the most critical op-erations of sparse tensor ...
    • A trace-scaling agent for parallel application tracing 

      Freitag, Fèlix; Caubet Serrabou, Jordi; Labarta Mancho, Jesús José (Institute of Electrical and Electronics Engineers (IEEE), 2002)
      Conference report
      Open Access
      Tracing and performance analysis tools are an important component in the development of high performance applications. Tracing parallel programs with current tracing tools, however, easily leads to large trace files with ...