Now showing items 21-28 of 28

    • PRESISTANT: Learning based assistant for data pre-processing 

      Bilalli, Besim; Abelló Gamazo, Alberto; Aluja Banet, Tomàs; Wrembel, Robert (Elsevier, 2019-09)
      Article
      Open Access
      Data pre-processing is one of the most time consuming and relevant steps in a data analysis process (e.g., classification task). A given data pre-processing operator can have positive, negative, or zero impact on the final ...
    • Quarry 

      Abelló Gamazo, Alberto; Romero Moral, Óscar; Jovanovic, Petar; Nadal Francesch, Sergi; Bilalli, Besim; Candón Arenas, Héctor; Mayorova, Daria; Thavornun, Varunya; Gil González, Daniel (2015-07-01)
      Software
      Restricted access - confidentiality agreement
    • Quarry: A user-centered big data integration platform 

      Jovanovic, Petar; Nadal Francesch, Sergi; Romero Moral, Óscar; Abelló Gamazo, Alberto; Bilalli, Besim (2021-02)
      Article
      Open Access
      Obtaining valuable insights and actionable knowledge from data requires cross-analysis of domain data typically coming from various sources. Doing so, inevitably imposes burdensome processes of unifying different data ...
    • Reproducible experiments for generating pre-processing pipelines for AutoETL 

      Giovanelli, Joseph; Bilalli, Besim; Abelló Gamazo, Alberto; Silva Coira, Fernando; de Bernardo Roca, Guillermo (Elsevier, 2024-02)
      Article
      Restricted access - publisher's policy
      This work is a companion reproducibility paper of the experiments and results reported in Giovanelli et al. (2022), where data pre-processing pipelines are evaluated in order to find pipeline prototypes that reduce the ...
    • Resilient store: a heuristic-based data format selector for intermediate results 

      Munir, Rana Faisal; Romero Moral, Óscar; Abelló Gamazo, Alberto; Bilalli, Besim; Thiele, Maik; Lehner, Wolfgang (2016)
      Número de revista
      Open Access
      Large-scale data analysis is an important activity in many organizations that typically requires the deployment of data-intensive workflows. As data is processed these workflows generate large intermediate results, which ...
    • There is no data science without data governance: a proposal based on knowledge graphs 

      Bilalli, Besim; Jovanovic, Petar; Nadal Francesch, Sergi; Queralt Calafat, Anna; Romero Moral, Óscar (CEUR-WS.org, 2024)
      Conference lecture
      Open Access
      Data Science and data-driven Artificial Intelligence are here to stay and they are expected to further transform the current global economy. From a technical point of view, there is an overall agreement that disciplines ...
    • Towards intelligent data analysis : the metadata challenge 

      Bilalli, Besim; Abelló Gamazo, Alberto; Aluja Banet, Tomàs; Wrembel, Robert (2016)
      Conference lecture
      Restricted access - publisher's policy
      Once analyzed correctly, data can yield substantial benefits. The process of analyzing the data and transforming it into knowledge is known as Knowledge Discovery in Databases (KDD). The plethora and subtleties of algorithms ...
    • Wrapper methods for multi-objective feature selection 

      Njoku, Uchechukwu Fortune; Abelló Gamazo, Alberto; Bilalli, Besim; Bontempi, Gianluca (OpenProceedings, 2023)
      Conference report
      Open Access
      The ongoing data boom has democratized the use of data for improved decision-making. Beyond gathering voluminous data, preprocessing the data is crucial to ensure that their most rele- vant aspects are considered during ...