• A city of cities: Measuring how 15-minutes urban accessibility shapes human mobility in Barcelona 

      Graells Garrido, Eduardo; Serra Burriel, Feliu; Rowe, Francisco; Cucchietti, Fernando; Reyes Valenzuela, Patricio Alejandro (Public Library of Science (PLOS), 2021-05-05)
      Artículo
      Acceso abierto
      As cities expand, human mobility has become a central focus of urban planning and policy making to make cities more inclusive and sustainable. Initiatives such as the “15-minutes city” have been put in place to shift the ...
    • A comparative study of traditional and data-driven approaches in the project management performance 

      Hannemann, Ian-Hendrik Steffen (Universitat Politècnica de Catalunya, 2024-02-09)
      Projecte Final de Màster Oficial
      Acceso abierto
      This thesis undertook a comprehensive exploration, weaving together theoretical foundations, practical applications, and real-world implications at the intersection of project management and advanced technologies. The ...
    • A compromise archive platform for monitoring infrastructures 

      García Calatrava, Carlos; Cucchietti, Fernando; Becerra Fontal, Yolanda (Barcelona Supercomputing Center, 2020-05)
      Texto en actas de congreso
      Acceso abierto
      The great advancement in the technological field has led to an explosion in the amount of generated data. Many different sectors have understood the opportunity that acquiring, storing, and analyzing further information ...
    • A cost-based storage format selector for materialized results in big data frameworks 

      Munir, Rana Faisal; Abelló Gamazo, Alberto; Romero Moral, Óscar; Thiele, Maik; Lehner, Wolfgang (2019-05-08)
      Artículo
      Acceso abierto
      Modern big data frameworks (such as Hadoop and Spark) allow multiple users to do large-scale analysis simultaneously, by deploying data-intensive workflows (DIWs). These DIWs of different users share many common tasks (i.e, ...
    • A data quality framework for graph-based virtual data integration systems 

      Li, Yalei; Nadal Francesch, Sergi; Romero Moral, Óscar (Springer, 2022)
      Texto en actas de congreso
      Acceso abierto
      Data Quality (DQ) plays a critical role in data integration. Up to now, DQ has mostly been addressed from a single database perspective. Popular DQ frameworks rely on Integrity Constraints (IC) to enforce valid application ...
    • A fast supervised density-based discretization algorithm for classification tasks in the medical domain 

      Aristodimou, Aristos; Diavastos, Andreas; Pattichis, Constantinos (2022-02-01)
      Artículo
      Acceso abierto
      Discretization is a preprocessing technique used for converting continuous features into categorical. This step is essential for processing algorithms that cannot handle continuous data as input. In addition, in the big ...
    • A general guide to applying machine learning to computer architecture 

      Nemirovsky, Daniel; Arkose, Tugberk; Markovic, Nikola; Nemirovsky, Mario; Unsal, Osman Sabri; Cristal Kestelman, Adrián; Valero Cortés, Mateo (2018)
      Artículo
      Acceso abierto
      The resurgence of machine learning since the late 1990s has been enabled by significant advances in computing performance and the growth of big data. The ability of these algorithms to detect complex patterns in data which ...
    • A Graph Convolutional Network-based Framework for Federated Data Product Discovery 

      Bisquert Parés, Aniol (Universitat Politècnica de Catalunya, 2024-10-23)
      Projecte Final de Màster Oficial
      Acceso abierto
      Big data presents a novel opportunity for data-driven decision-making, and modern organizations acknowledge its potential. Being able to use these data collaboratively through advanced data-driven applications that use ...
    • A methodology for Spark parameter tuning 

      Gounaris, Anastasios; Torres Viñals, Jordi (2017-05-19)
      Artículo
      Acceso abierto
      Spark has been established as an attractive platform for big data analysis, since it manages to hide most of the complexities related to parallelism, fault tolerance and cluster setting from developers. However, this comes ...
    • A new reliability-based data-driven approach to simulation-based models 

      Ayensa Jiménez, Jacobo; Doweidar, Mohamed Hamdy; Doblaré Castellano, Manuel (Barcelona Supercomputing Center, 2017-05-04)
      Texto en actas de congreso
      Acceso abierto
      Data Science has burst into simulation-based en-gineering sciences with an impressive impulse. However, data are never uncertainty-free and a suitable approach is needed to face data measurement errors and their intrinsic ...
    • A New role of ontologies and advanced scientific visualization in big data analytics 

      Chuprina, Svetlana (Barcelona Supercomputing Center, 2016-09-10)
      Texto en actas de congreso
      Acceso abierto
      Accessing and contextual semantic searching structured, semi-structured and unstructured information resources and their ontology based analysis in a uniform way across text-free Big Data query implementation is a main ...
    • A programming model for hybrid workflows: combining task-based workflows and dataflows all-in-one 

      Ramón Cortés, Cristian; Lordan Gomis, Francesc; Ejarque Artigas, Jorge; Badia Sala, Rosa Maria (Elsevier, 2020-12)
      Artículo
      Acceso abierto
      In the past years, e-Science applications have evolved from large-scale simulations executed in a single cluster to more complex workflows where these simulations are combined with High-Performance Data Analytics (HPDA). ...
    • A Quick View on Current Techniques and Machine Learning Algorithms for Big Data Analytics 

      Berral García, Josep Lluís (Institute of Electrical and Electronics Engineers (IEEE), 2016)
      Texto en actas de congreso
      Acceso abierto
      Big-data is an excellent source of knowledge and information from our systems and clients, but dealing with such amount of data requires automation, and this brings us to data mining and machine leaming techniques. In ...
    • A resilient and distributed near real-time traffic forecasting application for Fog computing environments 

      Pérez, Juan Luis; Gutiérrez Torre, Alberto; Berral García, Josep Lluís; Carrera Pérez, David (Elsevier, 2018-10-01)
      Artículo
      Acceso abierto
      In this paper we propose an architecture for a city-wide traffic modeling and prediction service based on the Fog Computing paradigm. The work assumes an scenario in which a number of distributed antennas receive data ...
    • A resilient and distributed near real-time traffic forecasting application for Fog computing environments 

      Pérez, Juan L.; Gutierrez-Torre, Alberto; Berral García, Josep Lluís; Carrera, David (Elsevier, 2018-10)
      Artículo
      Acceso abierto
      In this paper we propose an architecture for a city-wide traffic modeling and prediction service based on the Fog Computing paradigm. The work assumes an scenario in which a number of distributed antennas receive data ...
    • A scalable synthetic traffic model of Graph500 for computer networks analysis 

      Fuentes Sáez, Pablo; Benito, Mariano; Vallejo, Enrique; Bosque Orero, José Luis; Beivide Palacio, Ramon; Anghel, Andreea; Rodríguez Herrera, Germán; Gusat, Mitch; Minkenberg, Cyriel; Valero Cortés, Mateo (2017-12-25)
      Artículo
      Acceso abierto
      The Graph500 benchmark attempts to steer the design of High-Performance Computing systems to maximize the performance under memory-constricted application workloads. A realistic simulation of such benchmarks for architectural ...
    • A software reference architecture for semantic-aware big data systems 

      Nadal Francesch, Sergi; Herrero Otal, Víctor; Romero Moral, Óscar; Abelló Gamazo, Alberto; Franch Gutiérrez, Javier; Vansummeren, Stijn; Valerio, Danilo (2016-06-13)
      Artículo
      Acceso abierto
      Context: Big Data systems are a class of software systems that ingest, store, process and serve massive amounts of heterogeneous data, from multiple sources. Despite their undisputed impact in current society, their ...
    • A study of data-driven decision making in the insurance industry 

      Vilà Calopa, Josep Maria (Universitat Politècnica de Catalunya, 2023-02-06)
      Projecte Final de Màster Oficial
      Acceso restringido por decisión del autor
      Realizado en/con:   Politecnico di Torino
      En aquest estudi es tractarà de explicar el potencial que el big data té en el món de les asseguradores. S’iniciarà amb una mica d’història per entendre la progressió que ha tingut la indústria. Això ajudarà a projectar ...
    • A systematic approach to assess the suitability of data for process mining 

      Salvan, Emma (Universitat Politècnica de Catalunya, 2023-06-30)
      Projecte Final de Màster Oficial
      Acceso restringido por acuerdo de confidencialidad
      In today's data-driven business landscape, the volume of information available and generated is unprecedented. The growing availability of data has sparked increased interest in process mining - a powerful approach for ...
    • About offline evaluation of ML/DL models in Amazon search engine 

      De La Hoz González, Iván (Universitat Politècnica de Catalunya, 2022-06-28)
      Trabajo final de grado
      Acceso restringido por acuerdo de confidencialidad