Now showing items 1-20 of 23

  • A cost-effective clustered architecture 

    Canal Corretger, Ramon; Parcerisa Bundó, Joan Manuel; González Colás, Antonio María (Institute of Electrical and Electronics Engineers (IEEE), 1999)
    Conference report
    Open Access
    In current superscalar processors, all floating-point resources are idle during the execution of integer programs. As previous works show, this problem can be alleviated if the floating-point cluster is extended to execute ...
  • An energy-efficient memory unit for clustered microarchitectures 

    Bieschewski, Stefan; Parcerisa Bundó, Joan Manuel; González Colás, Antonio María (2016-08-01)
    Article
    Open Access
    Whereas clustered microarchitectures themselves have been extensively studied, the memory units for these clustered microarchitectures have received relatively little attention. This article discusses some of the inherent ...
  • Design of Clustered Superscalar Microarchitectures 

    Parcerisa Bundó, Joan Manuel (Universitat Politècnica de Catalunya, 2004-06-17)
    Doctoral thesis
    Open Access
    L'objectiu d'aquesta tesi és proposar noves tècniques per al disseny de microarquitectures clúster superescalars eficients. Les microarquitectures clúster particionen el disseny de diversos components crítics del hardware ...
  • Dynamic cluster assignment mechanisms 

    Canal Corretger, Ramon; Parcerisa Bundó, Joan Manuel; González Colás, Antonio María (Institute of Electrical and Electronics Engineers (IEEE), 2000)
    Conference report
    Open Access
    Clustered microarchitectures are an effective approach to reducing the penalties caused by wire delays inside a chip. Current superscalar processors have in fact a two-cluster microarchitecture with a naive code partitioning ...
  • Early register release for out-of-order processors with register windows 

    Quiñones, Eduardo; Parcerisa Bundó, Joan Manuel; González Colás, Antonio María (Institute of Electrical and Electronics Engineers (IEEE), 2007)
    Conference report
    Open Access
    Register windows is an architectural technique that reduces memory operations required to save and restore registers across procedure calls. Its effectiveness depends on the size of the register file. Such register ...
  • Early visibility resolution for removing ineffectual computations in the graphics pipeline 

    Anglada Sánchez, Martí; de Lucas Casamayor, Enrique; Parcerisa Bundó, Joan Manuel; Aragón Alcaraz, Juan Luis; González Colás, Antonio María (Institute of Electrical and Electronics Engineers (IEEE), 2019)
    Conference report
    Restricted access - publisher's policy
    GPUs' main workload is real-time image rendering. These applications take a description of a (animated) scene and produce the corresponding image(s). An image is rendered by computing the colors of all its pixels. It is ...
  • Efficient interconnects for clustered microarchitectures 

    Parcerisa Bundó, Joan Manuel; Sahuquillo, Julio; González Colás, Antonio María; Duato, José (Institute of Electrical and Electronics Engineers (IEEE), 2002)
    Conference report
    Open Access
    Clustering is an effective microarchitectural technique for reducing the impact of wire delays, the complexity, and the power requirements of microprocessors. In this work, we investigate the design of on-chip interconnection ...
  • Eliminating redundant fragment shader executions on a mobile GPU via hardware memoization 

    Arnau Montañés, José María; Parcerisa Bundó, Joan Manuel; Xekalakis, Polychronis (2014)
    Conference report
    Restricted access - publisher's policy
    Redundancy is at the heart of graphical applications. In fact, generating an animation typically involves the succession of extremely similar images. In terms of rendering these images, this behavior translates into the ...
  • Improving branch prediction and predicated execution in out-of-order processors 

    Quiñones, Eduardo; Parcerisa Bundó, Joan Manuel; González Colás, Antonio María (Institute of Electrical and Electronics Engineers (IEEE), 2007)
    Conference report
    Open Access
    If-conversion is a compiler technique that reduces the misprediction penalties caused by hard-to-predict branches, transforming control dependencies into data dependencies. Although it is globally beneficial, it has a ...
  • Improving latency tolerance of multithreading through decoupling 

    Parcerisa Bundó, Joan Manuel; González Colás, Antonio María (2001-10)
    Article
    Open Access
    The increasing hardware complexity of dynamically scheduled superscalar processors may compromise the scalability of this organization to make an efficient use of future increases in transistor budget. SMT processors, ...
  • La Influencia del orden de las preguntas en los exámenes de primer curso 

    López Álvarez, David; Cortés Martínez, Jordi; Fernández Barta, Montserrat; Parcerisa Bundó, Joan Manuel; Tous Liesa, Rubén; Tubella Murgadas, Jordi (Universitat Jaume I. Escola Superior de Tecnologia i Ciències Experimentals, 2013-07-10)
    Conference report
    Open Access
    El orden de las preguntas en un examen no debería tener influencia en sus resultados. Sin embargo, los autores tenemos la sensación de que los estudiantes de primero suelen ser secuenciales a la hora de resolver los ...
  • Leveraging register windows to reduce physical registers to the bare minimum 

    Quiñones, Eduardo; Parcerisa Bundó, Joan Manuel; González Colás, Antonio María (2010-12)
    Article
    Open Access
    Register window is an architectural technique that reduces memory operations required to save and restore registers across procedure calls. Its effectiveness depends on the size of the register file. Such register requirements ...
  • Memory bank predictors 

    Bieschewski, Stefan; Parcerisa Bundó, Joan Manuel; González Colás, Antonio María (Institute of Electrical and Electronics Engineers (IEEE), 2005)
    Conference report
    Open Access
    Cache memories are commonly implemented through multiple memory banks to improve bandwidth and latency. The early knowledge of the data cache bank that an instruction will access can help to improve the performance in ...
  • On-chip interconnects and instruction steering schemes for clustered microarchitectures 

    Parcerisa Bundó, Joan Manuel; Sahuquillo, Julio; González Colás, Antonio María; Duato, José (2005-02)
    Article
    Open Access
    Clustering is an effective microarchitectural technique for reducing the impact of wire delays, the complexity, and the power requirements of microprocessors. In this work, we investigate the design of on-chip interconnection ...
  • Parallel frame rendering: trading responsiveness for energy on a mobile GPU 

    Arnau Montañés, José María; Parcerisa Bundó, Joan Manuel; Xekalakis, Polychronis (2013)
    Conference report
    Restricted access - publisher's policy
    Perhaps one of the most important design aspects for smartphones and tablets is improving their energy efficiency. Unfortunately, rich media content applications typically put significant pressure to the GPU's memory ...
  • Reducing wire delay penalty through value prediction 

    Parcerisa Bundó, Joan Manuel; González Colás, Antonio María (Institute of Electrical and Electronics Engineers (IEEE), 2000)
    Conference report
    Open Access
    In this paper we show that value prediction can be used to avoid the penalty of long wire delays by predicting the data that is communicated through these long wires and validating the prediction locally where the value ...
  • Rendering elimination: early discard of redundant tiles in the graphics pipeline 

    Anglada Sánchez, Martí; de Lucas Casamayor, Enrique; Parcerisa Bundó, Joan Manuel; Aragón, Juan Luis; Marcuello Pascual, Pedro; González Colás, Antonio María (Institute of Electrical and Electronics Engineers (IEEE), 2019)
    Conference report
    Restricted access - publisher's policy
    GPUs are one of the most energy-consuming components for real-time rendering applications, since a large number of fragment shading computations and memory accesses are involved. Main memory bandwidth is especially taxing ...
  • TEAPOT: a toolset for evaluating performance, power and image quality on mobile graphics systems 

    Arnau Montañés, José María; Parcerisa Bundó, Joan Manuel; Xekalakis, Polychronis (ACM, 2013)
    Conference report
    Open Access
    In this paper we present TEAPOT, a full system GPU simulator, whose goal is to allow the evaluation of the GPUs that reside in mobile phones and tablets. To this extent, it has a cycle accurate GPU model for evaluating ...
  • The latency hiding effectiveness of decoupled access/execute processors 

    Parcerisa Bundó, Joan Manuel; González Colás, Antonio María (Institute of Electrical and Electronics Engineers (IEEE), 1998)
    Conference report
    Open Access
    Several studies have demonstrated that out-of-order execution processors may not be the most adequate organization for wide-issue processors due to the increasing penalties that wire delays cause in the issue logic. The ...
  • The synergy of multithreading and access/execute decoupling 

    Parcerisa Bundó, Joan Manuel; González Colás, Antonio María (Institute of Electrical and Electronics Engineers (IEEE), 1999)
    Conference report
    Open Access
    This work presents and evaluates a novel processor microarchitecture which combines two paradigms: access/execute decoupling and simultaneous multithreading. We investigate how both techniques complement each other: while ...