Now showing items 1-9 of 9

  • Automated data pre-processing via meta-learning 

    Bilalli, Besim; Abelló Gamazo, Alberto; Aluja Banet, Tomàs; Wrembel, Robert (2016)
    Conference report
    Open Access
    A data mining algorithm may perform differently on datasets with different characteristics, e.g., it might perform better on a dataset with continuous attributes rather than with categorical attributes, or the other way ...
  • Intelligent assistance for data pre-processing 

    Bilalli, Besim; Abelló Gamazo, Alberto; Aluja Banet, Tomàs; Wrembel, Robert (Elsevier, 2017-06-03)
    Article
    Open Access
    A data mining algorithm may perform differently on datasets with different characteristics, e.g., it might perform better on a dataset with continuous attributes rather than with categorical attributes, or the other way ...
  • Learning the impact of data pre-processing in data analysis 

    Bilalli, Besim (Universitat Politècnica de Catalunya, 2018-06-28)
    Doctoral thesis
    Open Access
    Covenantee:  Politechnika Poznańska
    There is a clear correlation between data availability and data analytics, and hence with the increase of data availability --- unavoidable according to Moore's law, the need for data analytics increases too. This certainly ...
  • On the predictive power of meta-features in OpenML 

    Bilalli, Besim; Abelló Gamazo, Alberto; Aluja Banet, Tomàs (2017-12-20)
    Article
    Open Access
    The demand for performing data analysis is steadily rising. As a consequence, people of different profiles (i.e., non-experienced users) have started to analyze their data. However, this is challenging for them. A key step ...
  • PRESISTANT : data pre-processing assistant 

    Bilalli, Besim; Abelló Gamazo, Alberto; Aluja Banet, Tomàs; Munir, Rana Faisal; Wrembel, Robert (Springer, 2019)
    Conference lecture
    Open Access
    A concrete classification algorithm may perform differently on datasets with different characteristics, e.g., it might perform better on a dataset with continuous attributes rather than with categorical attributes, or the ...
  • PRESISTANT: Learning based assistant for data pre-processing 

    Bilalli, Besim; Abelló Gamazo, Alberto; Aluja Banet, Tomàs; Wrembel, Robert (Elsevier, 2019-08-17)
    Article
    Restricted access - publisher's policy
    Data pre-processing is one of the most time consuming and relevant steps in a data analysis process (e.g., classification task). A given data pre-processing operator can have positive, negative, or zero impact on the final ...
  • Quarry 

    Abelló Gamazo, Alberto; Romero Moral, Óscar; Jovanovic, Petar; Nadal Francesch, Sergi; Bilalli, Besim; Candón Arenas, Héctor; Mayorova, Daria; Thavornun, Varunya; Gil González, Daniel (2015-07-01)
    Computer program
    Restricted access - confidentiality agreement
  • Resilient store: a heuristic-based data format selector for intermediate results 

    Munir, Rana Faisal; Romero Moral, Óscar; Abelló Gamazo, Alberto; Bilalli, Besim; Thiele, Maik; Lehner, Wolfgang (2016)
    Journal
    Open Access
    Large-scale data analysis is an important activity in many organizations that typically requires the deployment of data-intensive workflows. As data is processed these workflows generate large intermediate results, which ...
  • Towards intelligent data analysis : the metadata challenge 

    Bilalli, Besim; Abelló Gamazo, Alberto; Aluja Banet, Tomàs; Wrembel, Robert (2016)
    Conference lecture
    Restricted access - publisher's policy
    Once analyzed correctly, data can yield substantial benefits. The process of analyzing the data and transforming it into knowledge is known as Knowledge Discovery in Databases (KDD). The plethora and subtleties of algorithms ...