Recent Submissions

  • Studying the impact of the Full-Network embedding on multimodal pipelines 

    Vilalta, Armand; Garcia-Gasulla, Dario; Pares, Ferran; Ayguadé Parra, Eduard; Labarta Mancho, Jesús José; Moya-Sánchez, Ulises; Cortés García, Claudio Ulises (IOS Press, 2018)
    Article
    Open Access
    The current state of the art for image annotation and image retrieval tasks is obtained through deep neural network multimodal pipelines, which combine an image representation and a text representation into a shared embedding ...
  • Look-ahead in the two-sided reduction to compact band forms for symmetric eigenvalue problems and the SVD 

    Rodríguez Sánchez, Rafael; Catalán Pallarés, Sandra; Herrero Zaragoza, José Ramón; Quintana Ortí, Enrique Salvador; Tomás Domínguez, Andrés Enrique (2019-02-01)
    Article
    Open Access
    We address the reduction to compact band forms, via unitary similaritytransformations, for the solution of symmetric eigenvalue problems and the compu-tation of the singular value decomposition (SVD). Concretely, in the ...
  • Two-sided orthogonal reductions to condensed forms on asymmetric multicore processors 

    Alonso Jordá, Pedro; Catalán Pallarés, Sandra; Herrero Zaragoza, José Ramón; Quintana Ortí, Enrique Salvador; Rodríguez Sánchez, Rafael (2018-10)
    Article
    Restricted access - publisher's policy
    We investigate how to leverage the heterogeneous resources of an Asymmetric Multicore Processor (AMP) in order to deliver high performance in the reduction to condensed forms for the solution of dense eigenvalue and ...
  • Static scheduling of the LU factorization with look-ahead on asymmetric multicore processors 

    Catalán Pallarés, Sandra; Herrero Zaragoza, José Ramón; Quintana Ortí, Enrique Salvador; Rodríguez Sánchez, Rafael (2018-08)
    Article
    Restricted access - publisher's policy
    We analyze the benefits of look-ahead in the parallel execution of the LU factorization with partial pivoting (LUpp) in two distinct “asymmetric” multicore scenarios. The first one corresponds to an actual hardware-asymmetric ...
  • On the maturity of parallel applications for asymmetric multi-core processors 

    Chronaki, Kallia; Moreto Planas, Miquel; Casas, Marc; Rico, Alejandro; Badia Sala, Rosa Maria; Ayguadé Parra, Eduard; Valero Cortés, Mateo (Elsevier, 2019-05-01)
    Article
    Restricted access - publisher's policy
    Asymmetric multi-cores (AMCs) are a successful architectural solution for both mobile devices and supercomputers. By maintaining two types of cores (fast and slow) AMCs are able to provide high performance under the facility ...
  • Sampled simulation of task-based programs 

    Grass, Thomas; Carlson, Trevor E.; Rico Carro, Alejandro; Ceballos, Germán; Ayguadé Parra, Eduard; Casas Guix, Marc; Moreto Planas, Miquel (Institute of Electrical and Electronics Engineers (IEEE), 2019-02-01)
    Article
    Open Access
    Sampled simulation is a mature technique for reducing simulation time of single-threaded programs. Nevertheless, current sampling techniques do not take advantage of other execution models, like task-based execution, to ...
  • Memory controller for vector processor 

    Hussain, Tassadaq; Palomar, Oscar; Unsal, Osman Sabri; Cristal Kestelman, Adrián; Ayguadé Parra, Eduard (Springer, 2018-11)
    Article
    Open Access
    To manage power and memory wall affects, the HPC industry supports FPGA reconfigurable accelerators and vector processing cores for data-intensive scientific applications. FPGA based vector accelerators are used to increase ...
  • A case for malleable thread-level linear algebra libraries: The LU factorization with partial pivoting 

    Catalán Pallarés, Sandra; Herrero Zaragoza, José Ramón; Quintana Ortí, Enrique Salvador; Rodríguez Sánchez, Rafael; Van De Geijn, Robert (Institute of Electrical and Electronics Engineers (IEEE), 2019-01-31)
    Article
    Open Access
    We propose two novel techniques for overcoming load-imbalance encountered when implementing so-called look-ahead mechanisms in relevant dense matrix factorizations for the solution of linear systems. Both techniques target ...
  • Advances in the Hierarchical Emergent Behaviors (HEB) approach to autonomous vehicles 

    Roca, Damian; Milito, Rodolfo; Nemirovsky, Mario; Valero Cortés, Mateo (2018-11-13)
    Article
    Open Access
    Widespread deployment of autonomous vehicles (AVs) presents formidable challenges in terms on handling scalability and complexity, particularly regarding vehicular reaction in the face of unforeseen corner cases. Hierarchical ...
  • Exploring the capabilities of support vector machines in detecting silent data corruptions 

    Subasi, Omer; Di, Sheng; Bautista-Gomez, Leonardo; Balaprakash, Prasanna; Unsal, Osman Sabri; Labarta Mancho, Jesús José; Cristal Kestelman, Adrián; Krishnamoorthy, Sriram; Cappello, Franck (Elsevier, 2018-09)
    Article
    Restricted access - publisher's policy
    As the exascale era approaches, the increasing capacity of high-performance computing (HPC) systems with targeted power and energy budget goals introduces significant challenges in reliability. Silent data corruptions ...
  • Simulating the behavior of the human brain on GPUS 

    Valero-Lara, Pedro; Martinez-Perez, Ivan; Sirvent, Raul; Pena, A. J.; Martorell Bofill, Xavier; Labarta Mancho, Jesús José (2018-01-01)
    Article
    Open Access
    The simulation of the behavior of the Human Brain is one of the most important challenges in computing today. The main problem consists of finding efficient ways to manipulate and compute the huge volume of data that this ...
  • ReD: A reuse detector for content selection in exclusive shared last-level caches 

    Díaz, Javier; Monreal Arnal, Teresa; Ibáñez Marín, Pablo Enrique; Llaberia Griñó, José M.; Viñals Yúfera, Víctor (Elsevier, 2019-03)
    Article
    Restricted access - publisher's policy
    The reference stream reaching a chip multiprocessor Shared Last-Level Cache (SLLC) shows poor temporal locality, making conventional cache management policies inefficient. Few proposals address this problem for exclusive ...

View more