Articles de revista
Recent Submissions
-
SHARP: An adaptable, energy-efficient accelerator for recurrent neural networks
(Association for Computing Machinery (ACM), 2023-01-24)
Article
Open AccessThe effectiveness of Recurrent Neural Networks (RNNs) for tasks such as Automatic Speech Recognition has fostered interest in RNN inference acceleration. Due to the recurrent nature and data dependencies of RNN computations, ... -
A survey of near-data processing architectures for neural networks
(2022-01-17)
Article
Open AccessData-intensive workloads and applications, such as machine learning (ML), are fundamentally limited by traditional computing systems based on the von-Neumann architecture. As data movement operations and energy consumption ... -
LOCATOR: Low-power ORB accelerator for autonomous cars
(Elsevier, 2023-04)
Article
Open AccessSimultaneous Localization And Mapping (SLAM) is crucial for autonomous navigation. ORB-SLAM is a state-of-the-art Visual SLAM system based on cameras used for self-driving cars. In this paper, we propose a high-performance, ... -
Triangle Dropping: An occluded-geometry predictor for energy-efficient mobile GPUs
(Association for Computing Machinery (ACM), 2022-09)
Article
Open AccessThis article proposes a novel micro-architecture approach for mobile GPUs aimed at early removing the occluded geometry in a scene by leveraging frame-to-frame coherence, thus reducing the overall energy consumption. Mobile ... -
Irregular accesses reorder unit: improving GPGPU memory coalescing for graph-based workloads
(2023-01)
Article
Open AccessGPGPU architectures have become the dominant platform for massively parallel workloads, delivering high performance and energy efficiency for popular applications such as machine learning, computer vision or self-driving ... -
E-BATCH: Energy-efficient and high-throughput RNN batching
(2022-03)
Article
Open AccessRecurrent Neural Network (RNN) inference exhibits low hardware utilization due to the strict data dependencies across time-steps. Batching multiple requests can increase throughput. However, RNN batching requires a large ... -
CREW: Computation reuse and efficient weight storage for hardware-accelerated MLPs and RNNs
(2022-08-01)
Article
Open AccessDeep Neural Networks (DNNs) have achieved tremendous success for cognitive applications. The core operation in a DNN is the dot product between quantized inputs and weights. Prior works exploit the weight/input repetition ... -
Vector extensions in COTS processors to increase guaranteed performance in real-time systems
(2023-03)
Article
Open AccessThe need for increased application performance in high-integrity systems like those in avionics is on the rise as software continues to implement more complex functionalities. The prevalent computing solution for future ... -
Dynamic sampling rate: harnessing frame coherence in graphics applications for energy-efficient GPUs
(Springer Nature, 2022)
Article
Open AccessIn real-time rendering, a 3D scene is modelled with meshes of triangles that the GPU projects to the screen. They are discretized by sampling each triangle at regular space intervals to generate fragments which are then ... -
Fast and accurate SER estimation for large combinational blocks in early stages of the design
(Institute of Electrical and Electronics Engineers (IEEE), 2021-07-01)
Article
Open AccessSoft Error Rate (SER) estimation is an important challenge for integrated circuits because of the increased vulnerability brought by technology scaling. This paper presents a methodology to estimate in early stages of the ... -
Energy-efficient stream compaction through filtering and coalescing accesses in GPGPU memory partitions
(Institute of Electrical and Electronics Engineers (IEEE), 2022-07-01)
Article
Open AccessGraph-based applications are essential in emerging domains such as data analytics or machine learning. Data gathering in a knowledge-based society requires great data processing efficiency. High-throughput GPGPU architectures ... -
DNN pruning with principal component analysis and connection importance estimation
(2022-01)
Article
Open AccessDNN pruning reduces memory footprint and computational work of DNN-based solutions to improve performance and energy-efficiency. An effective pruning scheme should be able to systematically remove connections and/or neurons ...