Now showing items 21-40 of 87

    • Configuring parallelism for hybrid layouts using multi-objective optimization 

      Munir, Rana Faisal; Abelló Gamazo, Alberto; Romero Moral, Óscar; Thiele, Maik; Lehner, Wolfgang (2020-06-01)
      Article
      Open Access
      Modern organizations typically store their data in a raw format in data lakes. These data are then processed and usually stored under hybrid layouts, because they allow projection and selection operations. Thus, they allow ...
    • Data engineering for data science: two sides of the same coin 

      Romero Moral, Óscar; Wrembel, Robert (Springer, 2020)
      Conference report
      Open Access
      A de facto technological standard of data science is based on notebooks (e.g., Jupyter), which provide an integrated environment to execute data workflows in different languages. However, from a data engineering point of ...
    • Dimensional enrichment of statistical linked open data 

      Varga, Jovan; Vaisman, Alejandro; Romero Moral, Óscar; Etcheverry, Lorena; Bach Pedersen, Torben; Thomsen, Christian (2016-10-01)
      Article
      Open Access
      On-Line Analytical Processing (OLAP) is a data analysis technique typically used for local and well-prepared data. However, initiatives like Open Data and Open Government bring new and publicly available data on the web ...
    • Discovering functional dependencies from ontologies 

      Romero Moral, Óscar; Calvanese, Diego; Abelló Gamazo, Alberto; Rodríguez Muro, Mariano (2009-04)
      Research report
      Open Access
      Discovering functional dependencies is a fundamental step in the design of relational databases and in most system reengineering processes, such as system maintainability and redesign. Typically, this task has been performed ...
    • Discovering functional dependencies from ontologies 

      Romero Moral, Óscar; Calvanese, Diego; Abelló Gamazo, Alberto; Rodríguez Muro, Mariano (Association for Computing Machinery (ACM), 2009)
      Conference report
      Restricted access - publisher's policy
      Nowadays, it is widely accepted that the data warehouse design task should be largely automated. Furthermore, the data warehouse conceptual schema must be structured according to the multidimensional model and as a ...
    • Discovering meaningful keys from ontologies 

      Romero Moral, Óscar; Abelló Gamazo, Alberto; Montesó, Joan Marc (2009-07)
      Research report
      Open Access
      Object identification is a crucial step in most information systems. Nowadays, we have many different ways to identify entities such as surrogates, keys and object identifiers. However, not all of them guarantee the entity ...
    • Distributed databases 

      Romero Moral, Óscar; Oliva, Marta (Universitat Politècnica de Catalunya, 2012)
      Lecture notes
      Open Access
    • DS-Prox : dataset proximity mining for governing the data lake 

      Al-serafi, Ayman Mounir Mohamed; Calders, Toon; Abelló Gamazo, Alberto; Romero Moral, Óscar (Springer, 2017)
      Conference report
      Open Access
      With the arrival of Data Lakes (DL) there is an increasing need for efficient dataset classification to support data analysis and information retrieval. Our goal is to use meta-features describing datasets to detect whether ...
    • DSS from an RE perspective: A systematic mapping 

      García, Stephany; Romero Moral, Óscar; Raventós Pagès, Ruth (2016-07)
      Article
      Open Access
      Decision support systems (DSS) provide a unified analytical view of business data to better support decision-making processes. Such systems have shown a high level of user satisfaction and return on investment. However, ...
    • E-assessment of relational database skills by means of LearnSQL 

      Quer, Carme; Abelló Gamazo, Alberto; Burgués Illa, Xavier; Casany Guerrero, María José; Martín Escofet, Carme; Rodríguez González, M. Elena; Romero Moral, Óscar; Urpí Tubella, Antoni (International Association of Technology, Education and Development (IATED), 2017)
      Conference report
      Open Access
      LearnSQL is a software system that allows the automatic and efficient e-learning and e-assessment of relational database skills. It has been used at the Barcelona School of Informatics for 18 semesters with an average of ...
    • Effective and scalable data discovery with NextiaJD 

      Flores Herrera, Javier de Jesús; Nadal Francesch, Sergi; Romero Moral, Óscar (OpenProceedings, 2021)
      Conference lecture
      Open Access
      We present NextiaJD, a data discovery system with high predictive performance and computational efficiency. NextiaJD aids data scientists in the discovery of datasets that can be crossed. To that end, it proposes a ranking ...
    • GEM: requirement-driven generation of ETL and multidimensional conceptual designs 

      Romero Moral, Óscar; Simitsis, Alkis; Abelló Gamazo, Alberto (2010-10-21)
      Research report
      Open Access
      At the early stages of a data warehouse design project, the main objective is to collect the business requirements and needs, and translate them into an appropriate conceptual, multidimensional design. Typically, this ...
    • Graph BI & analytics: current state and future challenges 

      Ghrab, Amine; Romero Moral, Óscar; Jouili, Salim; Skhiri, Sabri (Springer, 2018)
      Conference report
      Open Access
      In an increasingly competitive market, making well-informed decisions requires the analysis of a wide range of heterogeneous, large and complex data. This paper focuses on the emerging field of graph warehousing. Graphs ...
    • Graph-driven federated data management 

      Nadal Francesch, Sergi; Abelló Gamazo, Alberto; Romero Moral, Óscar; Vansummeren, Stijn; Vassiliadis, Panos (2023-01-01)
      Article
      Open Access
      Modern data analysis applications, require the ability to provide on-demand integration of data sources while offering a flexible and user-friendly query interface. Traditional techniques for answering queries using views, ...
    • Graph-driven federated data management (extended abstract) 

      Nadal Francesch, Sergi; Abelló Gamazo, Alberto; Romero Moral, Óscar; Vansummeren, Stijn; Vassiliadis, Panos (Institute of Electrical and Electronics Engineers (IEEE), 2022)
      Conference report
      Open Access
      Modern data analysis applications require the ability to provide on-demand integration of data sources while offering a user-friendly query interface. Traditional methods for answering queries using views, focused on a ...
    • H-word: Supporting job scheduling in Hadoop with workload-driven data redistribution 

      Jovanovic, Petar; Romero Moral, Óscar; Calders, Toon; Abelló Gamazo, Alberto (2016)
      Conference report
      Open Access
      Today’s distributed data processing systems typically follow a query shipping approach and exploit data locality for reducing network traffic. In such systems the distribution of data over the cluster resources plays a ...
    • HealthMesh: An architectural framework for federated healthcare data management 

      Bisquert Parés, Aniol; Hmimou Ham Man, Achraf; Berral García, Josep Lluís; Gutiérrez Torre, Alberto; Romero Moral, Óscar (CEUR-WS.org, 2024)
      Conference report
      Open Access
      Recently, significant milestones have been achieved in the field of healthcare data analysis. However, alongside these accomplishments, substantial data-related challenges have emerged in the domain of big data management. ...
    • High-level ETL for semantic data warehouses 

      Deb Nath, Rudra Pratap; Romero Moral, Óscar; Pedersen, Torben Bach; Hose, Katja (2022)
      Article
      Open Access
      The popularity of the Semantic Web (SW) encourages organizations to organize and publish semantic data using the RDF model. This growth poses new requirements to Business Intelligence (BI) technologies to enable On-Line ...
    • Incremental consolidation of data-intensive multi-flows 

      Jovanovic, Petar; Romero Moral, Óscar; Simitsis, Alkis; Abelló Gamazo, Alberto (2016-05-01)
      Article
      Open Access
      Business intelligence (BI) systems depend on efficient integration of disparate and often heterogeneous data. The integration of data is governed by data-intensive flows and is driven by a set of information requirements. ...
    • Incremental schema integration for data wrangling via knowledge graphs 

      Flores Herrera, Javier de Jesús; Rabbani, Kashif; Nadal Francesch, Sergi; Gómez Seoane, Cristina; Romero Moral, Óscar; Jamin, Emmanuel; Dasiopoulou, Stamatia (2024-05-14)
      Article
      Open Access
      Virtual data integration is the current approach to go for data wrangling in data-driven decision-making. In this paper, we focus on automating schema integration, which extracts a homogenised representation of the data ...