Now showing items 1-20 of 23

    • A cost-effective clustered architecture 

      Canal Corretger, Ramon; Parcerisa Bundó, Joan Manuel; González Colás, Antonio María (Institute of Electrical and Electronics Engineers (IEEE), 1999)
      Conference report
      Open Access
      In current superscalar processors, all floating-point resources are idle during the execution of integer programs. As previous works show, this problem can be alleviated if the floating-point cluster is extended to execute ...
    • An energy-efficient memory unit for clustered microarchitectures 

      Bieschewski, Stefan; Parcerisa Bundó, Joan Manuel; González Colás, Antonio María (2016-08-01)
      Article
      Open Access
      Whereas clustered microarchitectures themselves have been extensively studied, the memory units for these clustered microarchitectures have received relatively little attention. This article discusses some of the inherent ...
    • Design of Clustered Superscalar Microarchitectures 

      Parcerisa Bundó, Joan Manuel (Universitat Politècnica de Catalunya, 2004-06-17)
      Doctoral thesis
      Open Access
      L'objectiu d'aquesta tesi és proposar noves tècniques per al disseny de microarquitectures clúster superescalars eficients. Les microarquitectures clúster particionen el disseny de diversos components crítics del hardware ...
    • Dynamic cluster assignment mechanisms 

      Canal Corretger, Ramon; Parcerisa Bundó, Joan Manuel; González Colás, Antonio María (Institute of Electrical and Electronics Engineers (IEEE), 2000)
      Conference report
      Open Access
      Clustered microarchitectures are an effective approach to reducing the penalties caused by wire delays inside a chip. Current superscalar processors have in fact a two-cluster microarchitecture with a naive code partitioning ...
    • Early register release for out-of-order processors with register windows 

      Quiñones, Eduardo; Parcerisa Bundó, Joan Manuel; González Colás, Antonio María (Institute of Electrical and Electronics Engineers (IEEE), 2007)
      Conference report
      Open Access
      Register windows is an architectural technique that reduces memory operations required to save and restore registers across procedure calls. Its effectiveness depends on the size of the register file. Such register ...
    • Early visibility resolution for removing ineffectual computations in the graphics pipeline 

      Anglada Sánchez, Martí; de Lucas Casamayor, Enrique; Parcerisa Bundó, Joan Manuel; Aragón Alcaraz, Juan Luis; González Colás, Antonio María (Institute of Electrical and Electronics Engineers (IEEE), 2019)
      Conference report
      Restricted access - publisher's policy
      GPUs' main workload is real-time image rendering. These applications take a description of a (animated) scene and produce the corresponding image(s). An image is rendered by computing the colors of all its pixels. It is ...
    • Efficient interconnects for clustered microarchitectures 

      Parcerisa Bundó, Joan Manuel; Sahuquillo, Julio; González Colás, Antonio María; Duato, José (Institute of Electrical and Electronics Engineers (IEEE), 2002)
      Conference report
      Open Access
      Clustering is an effective microarchitectural technique for reducing the impact of wire delays, the complexity, and the power requirements of microprocessors. In this work, we investigate the design of on-chip interconnection ...
    • Eliminating redundant fragment shader executions on a mobile GPU via hardware memoization 

      Arnau Montañés, José María; Parcerisa Bundó, Joan Manuel; Xekalakis, Polychronis (2014)
      Conference report
      Restricted access - publisher's policy
      Redundancy is at the heart of graphical applications. In fact, generating an animation typically involves the succession of extremely similar images. In terms of rendering these images, this behavior translates into the ...
    • Improving branch prediction and predicated execution in out-of-order processors 

      Quiñones, Eduardo; Parcerisa Bundó, Joan Manuel; González Colás, Antonio María (Institute of Electrical and Electronics Engineers (IEEE), 2007)
      Conference report
      Open Access
      If-conversion is a compiler technique that reduces the misprediction penalties caused by hard-to-predict branches, transforming control dependencies into data dependencies. Although it is globally beneficial, it has a ...
    • Improving latency tolerance of multithreading through decoupling 

      Parcerisa Bundó, Joan Manuel; González Colás, Antonio María (2001-10)
      Article
      Open Access
      The increasing hardware complexity of dynamically scheduled superscalar processors may compromise the scalability of this organization to make an efficient use of future increases in transistor budget. SMT processors, ...
    • La Influencia del orden de las preguntas en los exámenes de primer curso 

      López Álvarez, David; Cortés Martínez, Jordi; Fernández Barta, Montserrat; Parcerisa Bundó, Joan Manuel; Tous Liesa, Rubén; Tubella Murgadas, Jordi (Universitat Jaume I. Escola Superior de Tecnologia i Ciències Experimentals, 2013-07-10)
      Conference report
      Open Access
      El orden de las preguntas en un examen no debería tener influencia en sus resultados. Sin embargo, los autores tenemos la sensación de que los estudiantes de primero suelen ser secuenciales a la hora de resolver los ...
    • Leveraging register windows to reduce physical registers to the bare minimum 

      Quiñones, Eduardo; Parcerisa Bundó, Joan Manuel; González Colás, Antonio María (2010-12)
      Article
      Open Access
      Register window is an architectural technique that reduces memory operations required to save and restore registers across procedure calls. Its effectiveness depends on the size of the register file. Such register requirements ...
    • Memory bank predictors 

      Bieschewski, Stefan; Parcerisa Bundó, Joan Manuel; González Colás, Antonio María (Institute of Electrical and Electronics Engineers (IEEE), 2005)
      Conference report
      Open Access
      Cache memories are commonly implemented through multiple memory banks to improve bandwidth and latency. The early knowledge of the data cache bank that an instruction will access can help to improve the performance in ...
    • On-chip interconnects and instruction steering schemes for clustered microarchitectures 

      Parcerisa Bundó, Joan Manuel; Sahuquillo, Julio; González Colás, Antonio María; Duato, José (2005-02)
      Article
      Open Access
      Clustering is an effective microarchitectural technique for reducing the impact of wire delays, the complexity, and the power requirements of microprocessors. In this work, we investigate the design of on-chip interconnection ...
    • Parallel frame rendering: trading responsiveness for energy on a mobile GPU 

      Arnau Montañés, José María; Parcerisa Bundó, Joan Manuel; Xekalakis, Polychronis (2013)
      Conference report
      Restricted access - publisher's policy
      Perhaps one of the most important design aspects for smartphones and tablets is improving their energy efficiency. Unfortunately, rich media content applications typically put significant pressure to the GPU's memory ...
    • Reducing wire delay penalty through value prediction 

      Parcerisa Bundó, Joan Manuel; González Colás, Antonio María (Institute of Electrical and Electronics Engineers (IEEE), 2000)
      Conference report
      Open Access
      In this paper we show that value prediction can be used to avoid the penalty of long wire delays by predicting the data that is communicated through these long wires and validating the prediction locally where the value ...
    • Rendering elimination: early discard of redundant tiles in the graphics pipeline 

      Anglada Sánchez, Martí; de Lucas Casamayor, Enrique; Parcerisa Bundó, Joan Manuel; Aragón, Juan Luis; Marcuello Pascual, Pedro; González Colás, Antonio María (Institute of Electrical and Electronics Engineers (IEEE), 2019)
      Conference report
      Restricted access - publisher's policy
      GPUs are one of the most energy-consuming components for real-time rendering applications, since a large number of fragment shading computations and memory accesses are involved. Main memory bandwidth is especially taxing ...
    • TEAPOT: a toolset for evaluating performance, power and image quality on mobile graphics systems 

      Arnau Montañés, José María; Parcerisa Bundó, Joan Manuel; Xekalakis, Polychronis (ACM, 2013)
      Conference report
      Open Access
      In this paper we present TEAPOT, a full system GPU simulator, whose goal is to allow the evaluation of the GPUs that reside in mobile phones and tablets. To this extent, it has a cycle accurate GPU model for evaluating ...
    • The latency hiding effectiveness of decoupled access/execute processors 

      Parcerisa Bundó, Joan Manuel; González Colás, Antonio María (Institute of Electrical and Electronics Engineers (IEEE), 1998)
      Conference report
      Open Access
      Several studies have demonstrated that out-of-order execution processors may not be the most adequate organization for wide-issue processors due to the increasing penalties that wire delays cause in the issue logic. The ...
    • The synergy of multithreading and access/execute decoupling 

      Parcerisa Bundó, Joan Manuel; González Colás, Antonio María (Institute of Electrical and Electronics Engineers (IEEE), 1999)
      Conference report
      Open Access
      This work presents and evaluates a novel processor microarchitecture which combines two paradigms: access/execute decoupling and simultaneous multithreading. We investigate how both techniques complement each other: while ...