Now showing items 1-20 of 24

    • A data-driven approach to measure the usability of Web APIs 

      Koçi, Rediana; Franch Gutiérrez, Javier; Jovanovic, Petar; Abelló Gamazo, Alberto (Institute of Electrical and Electronics Engineers (IEEE), 2020)
      Conference report
      Open Access
      Application Programming Interfaces (APIs) are means of communication between applications, hence they can be seen as user interfaces, just with different kind of users, i.e., software or computers. However, the very first ...
    • A requirement-driven approach to the design and evolution of data warehouses 

      Jovanovic, Petar; Romero Moral, Óscar; Simitsis, Alkis; Abelló Gamazo, Alberto; Mayorova, Daria (2014-08-01)
      Article
      Restricted access - publisher's policy
      Designing data warehouse (DW) systems in highly dynamic enterprise environments is not an easy task. At each moment, the multidimensional (MD) schema needs to satisfy the set of information requirements posed by the business ...
    • A unified view of data-intensive flows in business intelligence systems : a survey 

      Jovanovic, Petar; Romero Moral, Óscar; Abelló Gamazo, Alberto (2016-12)
      Article
      Open Access
      Data-intensive flows are central processes in today’s business intelligence (BI) systems, deploying different technologies to deliver data, from a multitude of data sources, in user-preferred and analysis-ready formats. ...
    • BabbleFlow : a translator for analytic data flow programs 

      Jovanovic, Petar; Simitsis, Alkis; Wilkinson, Kevin (Association for Computing Machinery (ACM), 2014)
      Conference lecture
      Restricted access - publisher's policy
      A complex analytic data flow may perform multiple, inter-dependent tasks where each task uses a different processing engine. Such a multi-engine flow, termed a hybrid flow, may comprise subflows written in more than one ...
    • Bijoux : data generator for evaluating ETL process quality 

      Nakuçi, Emona; Theodorou, Vasileios; Jovanovic, Petar; Abelló Gamazo, Alberto (2014)
      Conference report
      Restricted access - publisher's policy
      Obtaining the right set of data for evaluating the fulfillment of different quality standards in the extract-transform-load (ETL) process design is rather challenging. First, the real data might be out of reach due to ...
    • Classification of changes in API evolution 

      Koçi, Rediana; Franch Gutiérrez, Javier; Jovanovic, Petar; Abelló Gamazo, Alberto (Institute of Electrical and Electronics Engineers (IEEE), 2019)
      Conference lecture
      Open Access
      Applications typically communicate with each other, accessing and exposing data and features by using Application Programming Interfaces (APIs). Even though API consumers expect APIs to be steady and well established, APIs ...
    • Data generator for evaluating ETL process quality 

      Theodorou, Vasileios; Jovanovic, Petar; Abelló Gamazo, Alberto; Nakuçi, Emona (Elsevier, 2017-01-01)
      Article
      Open Access
      Obtaining the right set of data for evaluating the fulfillment of different quality factors in the extract-transform-load (ETL) process design is rather challenging. First, the real data might be out of reach due to different ...
    • H-word: Supporting job scheduling in Hadoop with workload-driven data redistribution 

      Jovanovic, Petar; Romero Moral, Óscar; Calders, Toon; Abelló Gamazo, Alberto (2016)
      Conference report
      Open Access
      Today’s distributed data processing systems typically follow a query shipping approach and exploit data locality for reducing network traffic. In such systems the distribution of data over the cluster resources plays a ...
    • Improving Web API usage logging 

      Koçi, Rediana; Franch Gutiérrez, Javier; Jovanovic, Petar; Abelló Gamazo, Alberto (Springer, 2021)
      Conference report
      Open Access
      A Web API (WAPI) is a type of API whose interaction with its consumers is done through the Internet. While being accessed through the Internet can be challenging, mostly when WAPIs evolve, it gives providers the possibility ...
    • Incremental consolidation of data-intensive multi-flows 

      Jovanovic, Petar; Romero Moral, Óscar; Simitsis, Alkis; Abelló Gamazo, Alberto (2016-05-01)
      Article
      Open Access
      Business intelligence (BI) systems depend on efficient integration of disparate and often heterogeneous data. The integration of data is governed by data-intensive flows and is driven by a set of information requirements. ...
    • Integrating ETL processes from information requirements 

      Romero Moral, Óscar; Jovanovic, Petar; Simitsis, Alkis; Abelló Gamazo, Alberto (Springer, 2012)
      Conference report
      Restricted access - publisher's policy
      Data warehouse (DW) design is based on a set of requirements expressed as service level agreements (SLAs) and business level objects (BLOs). Populating a DW system from a set of information sources is realized with ...
    • Integration of Multidimensional and ETL design 

      Jovanovic, Petar (Universitat Politècnica de Catalunya, 2011-06-23)
      Master thesis
      Open Access
      This project represents master thesis and the final project, on the Master in Computing program, at Technical University of Catalonia. Led by the motivations and goals previously expressed, this project consists of the ...
    • Intermediate results materialization selection and format for data-intensive flows 

      Munir, Rana Faisal; Nadal Francesch, Sergi; Romero Moral, Óscar; Abelló Gamazo, Alberto; Jovanovic, Petar; Thiele, Maik; Lehner, Wolfgang (2018-05-01)
      Article
      Restricted access - publisher's policy
      Data-intensive flows deploy a variety of complex data transformations to build information pipelines from data sources to different end users. As data are processed, these workflows generate large intermediate results, ...
    • Mapreduce performance model for Hadoop 2.x 

      Glushkova, Daria; Jovanovic, Petar; Abelló Gamazo, Alberto (Elsevier, 2019-01)
      Article
      Open Access
      MapReduce is a popular programming model for distributed processing of large data sets. Apache Hadoop is one of the most common open-source implementations of such paradigm. Performance analysis of concurrent job executions ...
    • MapReduce performance models for Hadoop 2.x 

      Glushkova, Daria; Jovanovic, Petar; Abelló Gamazo, Alberto (CEUR-WS.org, 2017)
      Conference report
      Open Access
      MapReduce is a popular programming model for distributed processing of large data sets. Apache Hadoop is one of the most common open-source implementations of such paradigm. Performance analysis of concurrent job executions ...
    • PatternLens: Inferring evolutive patterns from web API usage logs 

      Koçi, Rediana; Franch Gutiérrez, Javier; Jovanovic, Petar; Abelló Gamazo, Alberto (Springer, 2021)
      Conference report
      Open Access
      The use of web Application Programming Interfaces (WAPIs) has experienced a boost in recent years. Developers (i.e., WAPI consumers) are continuously relying on third-party WAPIs to incorporate certain features into their ...
    • Quarry 

      Abelló Gamazo, Alberto; Romero Moral, Óscar; Jovanovic, Petar; Nadal Francesch, Sergi; Bilalli, Besim; Candón Arenas, Héctor; Mayorova, Daria; Thavornun, Varunya; Gil González, Daniel (2015-07-01)
      Software
      Restricted access - confidentiality agreement
    • Quarry : digging up the gems of your data treasury 

      Jovanovic, Petar; Romero Moral, Óscar; Simitsis, Alkis; Abelló Gamazo, Alberto; Candón Arenas, Héctor; Nadal Francesch, Sergi (2015)
      Conference lecture
      Open Access
      The design lifecycle of a data warehousing (DW) system is primarily led by requirements of its end-users and the complexity of underlying data sources. The process of designing a multidimensional (MD) schema and back-end ...
    • Quarry: A user-centered big data integration platform 

      Jovanovic, Petar; Nadal Francesch, Sergi; Romero Moral, Óscar; Abelló Gamazo, Alberto; Bilalli, Besim (2021-02)
      Article
      Open Access
      Obtaining valuable insights and actionable knowledge from data requires cross-analysis of domain data typically coming from various sources. Doing so, inevitably imposes burdensome processes of unifying different data ...
    • Requirement-driven creation and deployment of multidimensional and ETL designs 

      Jovanovic, Petar; Romero Moral, Óscar; Simitsis, Alkis; Abelló Gamazo, Alberto (Springer, 2012)
      Conference report
      Open Access
      We present our tool for assisting designers in the error-prone and time-consuming tasks carried out at the early stages of a data warehousing project. Our tool semi-automatically produces multidimensional (MD) and ETL ...