Browsing by Author "Jovanovic, Petar"
Now showing items 1-20 of 21
-
A requirement-driven approach to the design and evolution of data warehouses
Jovanovic, Petar; Romero Moral, Óscar; Simitsis, Alkis; Abelló Gamazo, Alberto; Mayorova, Daria (2014-08-01)
Article
Restricted access - publisher's policyDesigning data warehouse (DW) systems in highly dynamic enterprise environments is not an easy task. At each moment, the multidimensional (MD) schema needs to satisfy the set of information requirements posed by the business ... -
A unified view of data-intensive flows in business intelligence systems : a survey
Jovanovic, Petar; Romero Moral, Óscar; Abelló Gamazo, Alberto (2016-12)
Article
Open AccessData-intensive flows are central processes in today’s business intelligence (BI) systems, deploying different technologies to deliver data, from a multitude of data sources, in user-preferred and analysis-ready formats. ... -
BabbleFlow : a translator for analytic data flow programs
Jovanovic, Petar; Simitsis, Alkis; Wilkinson, Kevin (Association for Computing Machinery (ACM), 2014)
Conference lecture
Restricted access - publisher's policyA complex analytic data flow may perform multiple, inter-dependent tasks where each task uses a different processing engine. Such a multi-engine flow, termed a hybrid flow, may comprise subflows written in more than one ... -
Bijoux : data generator for evaluating ETL process quality
Nakuçi, Emona; Theodorou, Vasileios; Jovanovic, Petar; Abelló Gamazo, Alberto (2014)
Conference report
Restricted access - publisher's policyObtaining the right set of data for evaluating the fulfillment of different quality standards in the extract-transform-load (ETL) process design is rather challenging. First, the real data might be out of reach due to ... -
Classification of changes in API evolution
Koçi, Rediana; Franch Gutiérrez, Javier; Jovanovic, Petar; Abelló Gamazo, Alberto (Institute of Electrical and Electronics Engineers (IEEE), 2019)
Conference lecture
Open AccessApplications typically communicate with each other, accessing and exposing data and features by using Application Programming Interfaces (APIs). Even though API consumers expect APIs to be steady and well established, APIs ... -
Data generator for evaluating ETL process quality
Theodorou, Vasileios; Jovanovic, Petar; Abelló Gamazo, Alberto; Nakuçi, Emona (Elsevier, 2017-01-01)
Article
Open AccessObtaining the right set of data for evaluating the fulfillment of different quality factors in the extract-transform-load (ETL) process design is rather challenging. First, the real data might be out of reach due to different ... -
H-word: Supporting job scheduling in Hadoop with workload-driven data redistribution
Jovanovic, Petar; Romero Moral, Óscar; Calders, Toon; Abelló Gamazo, Alberto (2016)
Conference report
Open AccessToday’s distributed data processing systems typically follow a query shipping approach and exploit data locality for reducing network traffic. In such systems the distribution of data over the cluster resources plays a ... -
Incremental consolidation of data-intensive multi-flows
Jovanovic, Petar; Romero Moral, Óscar; Simitsis, Alkis; Abelló Gamazo, Alberto (2016-05-01)
Article
Open AccessBusiness intelligence (BI) systems depend on efficient integration of disparate and often heterogeneous data. The integration of data is governed by data-intensive flows and is driven by a set of information requirements. ... -
Integrating ETL processes from information requirements
Romero Moral, Óscar; Jovanovic, Petar; Simitsis, Alkis; Abelló Gamazo, Alberto (Springer, 2012)
Conference report
Restricted access - publisher's policyData warehouse (DW) design is based on a set of requirements expressed as service level agreements (SLAs) and business level objects (BLOs). Populating a DW system from a set of information sources is realized with ... -
Integration of Multidimensional and ETL design
Jovanovic, Petar (Universitat Politècnica de Catalunya, 2011-06-23)
Master thesis
Open AccessThis project represents master thesis and the final project, on the Master in Computing program, at Technical University of Catalonia. Led by the motivations and goals previously expressed, this project consists of the ... -
Intermediate results materialization selection and format for data-intensive flows
Munir, Rana Faisal; Nadal Francesch, Sergi; Romero Moral, Óscar; Abelló Gamazo, Alberto; Jovanovic, Petar; Thiele, Maik; Lehner, Wolfgang (2018-05-01)
Article
Restricted access - publisher's policyData-intensive flows deploy a variety of complex data transformations to build information pipelines from data sources to different end users. As data are processed, these workflows generate large intermediate results, ... -
Mapreduce performance model for Hadoop 2.x
Glushkova, Daria; Jovanovic, Petar; Abelló Gamazo, Alberto (Elsevier, 2019-01)
Article
Open AccessMapReduce is a popular programming model for distributed processing of large data sets. Apache Hadoop is one of the most common open-source implementations of such paradigm. Performance analysis of concurrent job executions ... -
MapReduce performance models for Hadoop 2.x
Glushkova, Daria; Jovanovic, Petar; Abelló Gamazo, Alberto (CEUR-WS.org, 2017)
Conference report
Open AccessMapReduce is a popular programming model for distributed processing of large data sets. Apache Hadoop is one of the most common open-source implementations of such paradigm. Performance analysis of concurrent job executions ... -
Quarry
Abelló Gamazo, Alberto; Romero Moral, Óscar; Jovanovic, Petar; Nadal Francesch, Sergi; Bilalli, Besim; Candón Arenas, Héctor; Mayorova, Daria; Thavornun, Varunya; Gil González, Daniel (2015-07-01)
Software
Restricted access - confidentiality agreement -
Quarry : digging up the gems of your data treasury
Jovanovic, Petar; Romero Moral, Óscar; Simitsis, Alkis; Abelló Gamazo, Alberto; Candón Arenas, Héctor; Nadal Francesch, Sergi (2015)
Conference lecture
Open AccessThe design lifecycle of a data warehousing (DW) system is primarily led by requirements of its end-users and the complexity of underlying data sources. The process of designing a multidimensional (MD) schema and back-end ... -
Quarry: A user-centered big data integration platform
Jovanovic, Petar; Nadal Francesch, Sergi; Romero Moral, Óscar; Abelló Gamazo, Alberto; Bilalli, Besim (2020-04-18)
Article
Restricted access - publisher's policyObtaining valuable insights and actionable knowledge from data requires cross-analysis of domain data typically coming from various sources. Doing so, inevitably imposes burdensome processes of unifying different data ... -
Requirement-driven creation and deployment of multidimensional and ETL designs
Jovanovic, Petar; Romero Moral, Óscar; Simitsis, Alkis; Abelló Gamazo, Alberto (Springer, 2012)
Conference report
Open AccessWe present our tool for assisting designers in the error-prone and time-consuming tasks carried out at the early stages of a data warehousing project. Our tool semi-automatically produces multidimensional (MD) and ETL ... -
Requirement-driven design and optimization of data-intensive flows
Jovanovic, Petar (2016-09-26)
Doctoral thesis
Open Access
Covenantee: Université libre de BruxellesData have become number one assets of today's business world. Thus, its exploitation and analysis attracted the attention of people from different fields and having different technical backgrounds. Data-intensive flows are ... -
Supporting data integration tasks with semi-automatic ontology construction
Touma, Rizkallah; Romero Moral, Óscar; Jovanovic, Petar (Association for Computing Machinery (ACM), 2015)
Conference report
Restricted access - publisher's policyData integration aims to facilitate the exploitation of heterogeneous data by providing the user with a unified view of data residing in different sources. Currently, ontologies are commonly used to represent this unified ... -
Towards automated data integration in software analytics
Martínez Fernández, Silverio Juan; Jovanovic, Petar; Franch Gutiérrez, Javier; Jedlitschka, Andreas (Association for Computing Machinery (ACM), 2018)
Conference lecture
Open AccessSoftware organizations want to be able to base their decisions on the latest set of available data and the real-time analytics derived from them. In order to support "real-time enterprise" for software organizations and ...