Browsing by Author "Jovanovic, Petar"
Now showing items 1-20 of 26
-
A data-driven approach to measure the usability of Web APIs
Koçi, Rediana; Franch Gutiérrez, Javier; Jovanovic, Petar; Abelló Gamazo, Alberto (Institute of Electrical and Electronics Engineers (IEEE), 2020)
Conference report
Open AccessApplication Programming Interfaces (APIs) are means of communication between applications, hence they can be seen as user interfaces, just with different kind of users, i.e., software or computers. However, the very first ... -
A requirement-driven approach to the design and evolution of data warehouses
Jovanovic, Petar; Romero Moral, Óscar; Simitsis, Alkis; Abelló Gamazo, Alberto; Mayorova, Daria (2014-08-01)
Article
Restricted access - publisher's policyDesigning data warehouse (DW) systems in highly dynamic enterprise environments is not an easy task. At each moment, the multidimensional (MD) schema needs to satisfy the set of information requirements posed by the business ... -
A unified view of data-intensive flows in business intelligence systems : a survey
Jovanovic, Petar; Romero Moral, Óscar; Abelló Gamazo, Alberto (2016-12)
Article
Open AccessData-intensive flows are central processes in today’s business intelligence (BI) systems, deploying different technologies to deliver data, from a multitude of data sources, in user-preferred and analysis-ready formats. ... -
BabbleFlow : a translator for analytic data flow programs
Jovanovic, Petar; Simitsis, Alkis; Wilkinson, Kevin (Association for Computing Machinery (ACM), 2014)
Conference lecture
Restricted access - publisher's policyA complex analytic data flow may perform multiple, inter-dependent tasks where each task uses a different processing engine. Such a multi-engine flow, termed a hybrid flow, may comprise subflows written in more than one ... -
Bijoux : data generator for evaluating ETL process quality
Nakuçi, Emona; Theodorou, Vasileios; Jovanovic, Petar; Abelló Gamazo, Alberto (2014)
Conference report
Restricted access - publisher's policyObtaining the right set of data for evaluating the fulfillment of different quality standards in the extract-transform-load (ETL) process design is rather challenging. First, the real data might be out of reach due to ... -
Classification of changes in API evolution
Koçi, Rediana; Franch Gutiérrez, Javier; Jovanovic, Petar; Abelló Gamazo, Alberto (Institute of Electrical and Electronics Engineers (IEEE), 2019)
Conference lecture
Open AccessApplications typically communicate with each other, accessing and exposing data and features by using Application Programming Interfaces (APIs). Even though API consumers expect APIs to be steady and well established, APIs ... -
Data generator for evaluating ETL process quality
Theodorou, Vasileios; Jovanovic, Petar; Abelló Gamazo, Alberto; Nakuçi, Emona (Elsevier, 2017-01-01)
Article
Open AccessObtaining the right set of data for evaluating the fulfillment of different quality factors in the extract-transform-load (ETL) process design is rather challenging. First, the real data might be out of reach due to different ... -
H-word: Supporting job scheduling in Hadoop with workload-driven data redistribution
Jovanovic, Petar; Romero Moral, Óscar; Calders, Toon; Abelló Gamazo, Alberto (2016)
Conference report
Open AccessToday’s distributed data processing systems typically follow a query shipping approach and exploit data locality for reducing network traffic. In such systems the distribution of data over the cluster resources plays a ... -
Improving Web API usage logging
Koçi, Rediana; Franch Gutiérrez, Javier; Jovanovic, Petar; Abelló Gamazo, Alberto (Springer, 2021)
Conference report
Open AccessA Web API (WAPI) is a type of API whose interaction with its consumers is done through the Internet. While being accessed through the Internet can be challenging, mostly when WAPIs evolve, it gives providers the possibility ... -
Incremental consolidation of data-intensive multi-flows
Jovanovic, Petar; Romero Moral, Óscar; Simitsis, Alkis; Abelló Gamazo, Alberto (2016-05-01)
Article
Open AccessBusiness intelligence (BI) systems depend on efficient integration of disparate and often heterogeneous data. The integration of data is governed by data-intensive flows and is driven by a set of information requirements. ... -
Integrating ETL processes from information requirements
Romero Moral, Óscar; Jovanovic, Petar; Simitsis, Alkis; Abelló Gamazo, Alberto (Springer, 2012)
Conference report
Restricted access - publisher's policyData warehouse (DW) design is based on a set of requirements expressed as service level agreements (SLAs) and business level objects (BLOs). Populating a DW system from a set of information sources is realized with ... -
Integration of Multidimensional and ETL design
Jovanovic, Petar (Universitat Politècnica de Catalunya, 2011-06-23)
Master thesis
Open AccessThis project represents master thesis and the final project, on the Master in Computing program, at Technical University of Catalonia. Led by the motivations and goals previously expressed, this project consists of the ... -
Intermediate results materialization selection and format for data-intensive flows
Munir, Rana Faisal; Nadal Francesch, Sergi; Romero Moral, Óscar; Abelló Gamazo, Alberto; Jovanovic, Petar; Thiele, Maik; Lehner, Wolfgang (2018-05-01)
Article
Restricted access - publisher's policyData-intensive flows deploy a variety of complex data transformations to build information pipelines from data sources to different end users. As data are processed, these workflows generate large intermediate results, ... -
Mapreduce performance model for Hadoop 2.x
Glushkova, Daria; Jovanovic, Petar; Abelló Gamazo, Alberto (Elsevier, 2019-01)
Article
Open AccessMapReduce is a popular programming model for distributed processing of large data sets. Apache Hadoop is one of the most common open-source implementations of such paradigm. Performance analysis of concurrent job executions ... -
MapReduce performance models for Hadoop 2.x
Glushkova, Daria; Jovanovic, Petar; Abelló Gamazo, Alberto (CEUR-WS.org, 2017)
Conference report
Open AccessMapReduce is a popular programming model for distributed processing of large data sets. Apache Hadoop is one of the most common open-source implementations of such paradigm. Performance analysis of concurrent job executions ... -
Operationalizing and automating data governance
Nadal Francesch, Sergi; Jovanovic, Petar; Bilalli, Besim; Romero Moral, Óscar (Springer Nature, 2022-12-10)
Article
Open AccessThe ability to cross data from multiple sources represents a competitive advantage for organizations. Yet, the governance of the data lifecycle, from the data sources into valuable insights, is largely performed in an ... -
PatternLens: Inferring evolutive patterns from web API usage logs
Koçi, Rediana; Franch Gutiérrez, Javier; Jovanovic, Petar; Abelló Gamazo, Alberto (Springer, 2021)
Conference report
Open AccessThe use of web Application Programming Interfaces (WAPIs) has experienced a boost in recent years. Developers (i.e., WAPI consumers) are continuously relying on third-party WAPIs to incorporate certain features into their ... -
Quarry
Abelló Gamazo, Alberto; Romero Moral, Óscar; Jovanovic, Petar; Nadal Francesch, Sergi; Bilalli, Besim; Candón Arenas, Héctor; Mayorova, Daria; Thavornun, Varunya; Gil González, Daniel (2015-07-01)
Software
Restricted access - confidentiality agreement -
Quarry : digging up the gems of your data treasury
Jovanovic, Petar; Romero Moral, Óscar; Simitsis, Alkis; Abelló Gamazo, Alberto; Candón Arenas, Héctor; Nadal Francesch, Sergi (2015)
Conference lecture
Open AccessThe design lifecycle of a data warehousing (DW) system is primarily led by requirements of its end-users and the complexity of underlying data sources. The process of designing a multidimensional (MD) schema and back-end ... -
Quarry: A user-centered big data integration platform
Jovanovic, Petar; Nadal Francesch, Sergi; Romero Moral, Óscar; Abelló Gamazo, Alberto; Bilalli, Besim (2021-02)
Article
Open AccessObtaining valuable insights and actionable knowledge from data requires cross-analysis of domain data typically coming from various sources. Doing so, inevitably imposes burdensome processes of unifying different data ...