Now showing items 1-13 of 13

    • A requirement-driven approach to the design and evolution of data warehouses 

      Jovanovic, Petar; Romero Moral, Óscar; Simitsis, Alkis; Abelló Gamazo, Alberto; Mayorova, Daria (2014-08-01)
      Article
      Restricted access - publisher's policy
      Designing data warehouse (DW) systems in highly dynamic enterprise environments is not an easy task. At each moment, the multidimensional (MD) schema needs to satisfy the set of information requirements posed by the business ...
    • BabbleFlow : a translator for analytic data flow programs 

      Jovanovic, Petar; Simitsis, Alkis; Wilkinson, Kevin (Association for Computing Machinery (ACM), 2014)
      Conference lecture
      Restricted access - publisher's policy
      A complex analytic data flow may perform multiple, inter-dependent tasks where each task uses a different processing engine. Such a multi-engine flow, termed a hybrid flow, may comprise subflows written in more than one ...
    • GEM: requirement-driven generation of ETL and multidimensional conceptual designs 

      Romero Moral, Óscar; Simitsis, Alkis; Abelló Gamazo, Alberto (2010-10-21)
      Research report
      Open Access
      At the early stages of a data warehouse design project, the main objective is to collect the business requirements and needs, and translate them into an appropriate conceptual, multidimensional design. Typically, this ...
    • Hyppo: using equivalences to optimize pipelines in exploratory machine learning 

      Kontaxakis, Antonios; Sacharidis, Dimitris; Simitsis, Alkis; Abelló Gamazo, Alberto; Nadal Francesch, Sergi (Institute of Electrical and Electronics Engineers (IEEE), 2024)
      Conference report
      Open Access
      We present HYPPO, a novel system to optimize pipelines encountered in exploratory machine learning. HYPPO exploits alternative computational paths of artifacts from past executions to derive better execution plans while ...
    • Incremental consolidation of data-intensive multi-flows 

      Jovanovic, Petar; Romero Moral, Óscar; Simitsis, Alkis; Abelló Gamazo, Alberto (2016-05-01)
      Article
      Open Access
      Business intelligence (BI) systems depend on efficient integration of disparate and often heterogeneous data. The integration of data is governed by data-intensive flows and is driven by a set of information requirements. ...
    • INforE: Interactive cross-platform analytics for everyone 

      Giatrakos, Nikos; Arnu, David; Bitsakis, Theodoros; Deligiannakis, Antonios; Garofalakis, Minos; Klinkenberg, Ralf; Konidaris, Aris; Kontaxakis, Antonis; Kotidis Kotidis, Yannis; Samoladas, Vasilis; Simitsis, Alkis; Stamatakis, George; Temme, Fabian; Torok, Mate; Yaqub, Edwin; Montagud, Arnau; Ponce De Leon, Miguel; Arndt, Holger; Burkard, Stefan (Association for Computing Machinery (ACM), 2020-10)
      Conference lecture
      Open Access
      We present INforE, a prototype supporting non-expert programmers in performing optimized, cross-platform, streaming analytics at scale. INforE offers: a) a new extension to the RapidMiner Studio for graphical design of Big ...
    • Integrating ETL processes from information requirements 

      Romero Moral, Óscar; Jovanovic, Petar; Simitsis, Alkis; Abelló Gamazo, Alberto (Springer, 2012)
      Conference report
      Restricted access - publisher's policy
      Data warehouse (DW) design is based on a set of requirements expressed as service level agreements (SLAs) and business level objects (BLOs). Populating a DW system from a set of information sources is realized with ...
    • Performance analysis of distributed GPU-accelerated task-based workflows 

      Nogueira Lobo de Carvalho, Marcos; Queralt Calafat, Anna; Romero Moral, Óscar; Simitsis, Alkis; Tatu, Cristian; Badia Sala, Rosa Maria (OpenProceedings, 2024)
      Conference report
      Open Access
      We present an empirical approach to identify the key factors affecting the execution performance of task-based workflows on a High Performance Computing (HPC) infrastructure composed of heterogeneous CPU-GPU clusters. Our ...
    • Quarry : digging up the gems of your data treasury 

      Jovanovic, Petar; Romero Moral, Óscar; Simitsis, Alkis; Abelló Gamazo, Alberto; Candón Arenas, Héctor; Nadal Francesch, Sergi (2015)
      Conference lecture
      Open Access
      The design lifecycle of a data warehousing (DW) system is primarily led by requirements of its end-users and the complexity of underlying data sources. The process of designing a multidimensional (MD) schema and back-end ...
    • Requirement-driven creation and deployment of multidimensional and ETL designs 

      Jovanovic, Petar; Romero Moral, Óscar; Simitsis, Alkis; Abelló Gamazo, Alberto (Springer, 2012)
      Conference report
      Open Access
      We present our tool for assisting designers in the error-prone and time-consuming tasks carried out at the early stages of a data warehousing project. Our tool semi-automatically produces multidimensional (MD) and ETL ...
    • Semantic web technologies for business intelligence 

      Berlanga, Rafael; Romero Moral, Óscar; Simitsis, Alkis; Nebot, Victoria; Pedersen, Torben; Abelló Gamazo, Alberto; Aramburu, María José (IGI Global, 2011)
      Part of book or chapter of book
      Restricted access - publisher's policy
      This chapter describes the convergence of two of the most influential technologies in the last decade, namely business intelligence (BI) and the Semantic Web (SW). Business intelligence is used by almost any enterprise to ...
    • Using semantic web technologies for exploratory OLAP: A survey 

      Abelló Gamazo, Alberto; Romero Moral, Óscar; Pedersen, Torben; Berlanga, Rafael; Nebot, Victoria; Aramburu, María José; Simitsis, Alkis (2015-02-01)
      Article
      Open Access
    • xPAD: A platform for analytic data flows 

      Simitsis, Alkis; Wilkinson, Kevin; Jovanovic, Petar (2013)
      Conference report
      Restricted access - publisher's policy
      As enterprises become more automated, real-time, and data-driven, they need to integrate new data sources and specialized processing engines. The traditional business intelligence architecture of Extract-Transform-Load ...