Now showing items 1-12 of 12

    • A framework for assessing the peer review duration of journals: case study in computer science 

      Bilalli, Besim; Munir, Rana Faisal; Abelló Gamazo, Alberto (Springer Nature, 2021-01)
      Article
      Restricted access - publisher's policy
      In various fields, scientific article publication is a measure of productivity and in many occasions it is used as a critical factor for evaluating researchers. Therefore, a lot of time is dedicated to writing articles ...
    • Automated data pre-processing via meta-learning 

      Bilalli, Besim; Abelló Gamazo, Alberto; Aluja Banet, Tomàs; Wrembel, Robert (2016)
      Conference report
      Open Access
      A data mining algorithm may perform differently on datasets with different characteristics, e.g., it might perform better on a dataset with continuous attributes rather than with categorical attributes, or the other way ...
    • Effective data pre-processing for AutoML 

      Giovanelli, Joseph; Bilalli, Besim; Abelló Gamazo, Alberto (CEUR-WS.org, 2021)
      Conference report
      Open Access
      Data pre-processing plays a key role in a data analytics process (e.g., supervised learning). It encompasses a broad range of activities that span from correcting errors to selecting the most relevant features for the ...
    • Intelligent assistance for data pre-processing 

      Bilalli, Besim; Abelló Gamazo, Alberto; Aluja Banet, Tomàs; Wrembel, Robert (Elsevier, 2017-06-03)
      Article
      Open Access
      A data mining algorithm may perform differently on datasets with different characteristics, e.g., it might perform better on a dataset with continuous attributes rather than with categorical attributes, or the other way ...
    • Learning the impact of data pre-processing in data analysis 

      Bilalli, Besim (Universitat Politècnica de Catalunya, 2018-06-28)
      Doctoral thesis
      Open Access
      Covenantee:   Politechnika Poznańska
      There is a clear correlation between data availability and data analytics, and hence with the increase of data availability --- unavoidable according to Moore's law, the need for data analytics increases too. This certainly ...
    • On the predictive power of meta-features in OpenML 

      Bilalli, Besim; Abelló Gamazo, Alberto; Aluja Banet, Tomàs (2017-12-20)
      Article
      Open Access
      The demand for performing data analysis is steadily rising. As a consequence, people of different profiles (i.e., non-experienced users) have started to analyze their data. However, this is challenging for them. A key step ...
    • PRESISTANT : data pre-processing assistant 

      Bilalli, Besim; Abelló Gamazo, Alberto; Aluja Banet, Tomàs; Munir, Rana Faisal; Wrembel, Robert (Springer, 2019)
      Conference lecture
      Open Access
      A concrete classification algorithm may perform differently on datasets with different characteristics, e.g., it might perform better on a dataset with continuous attributes rather than with categorical attributes, or the ...
    • PRESISTANT: Learning based assistant for data pre-processing 

      Bilalli, Besim; Abelló Gamazo, Alberto; Aluja Banet, Tomàs; Wrembel, Robert (Elsevier, 2019-09)
      Article
      Restricted access - publisher's policy
      Data pre-processing is one of the most time consuming and relevant steps in a data analysis process (e.g., classification task). A given data pre-processing operator can have positive, negative, or zero impact on the final ...
    • Quarry 

      Abelló Gamazo, Alberto; Romero Moral, Óscar; Jovanovic, Petar; Nadal Francesch, Sergi; Bilalli, Besim; Candón Arenas, Héctor; Mayorova, Daria; Thavornun, Varunya; Gil González, Daniel (2015-07-01)
      Software
      Restricted access - confidentiality agreement
    • Quarry: A user-centered big data integration platform 

      Jovanovic, Petar; Nadal Francesch, Sergi; Romero Moral, Óscar; Abelló Gamazo, Alberto; Bilalli, Besim (2021-02)
      Article
      Open Access
      Obtaining valuable insights and actionable knowledge from data requires cross-analysis of domain data typically coming from various sources. Doing so, inevitably imposes burdensome processes of unifying different data ...
    • Resilient store: a heuristic-based data format selector for intermediate results 

      Munir, Rana Faisal; Romero Moral, Óscar; Abelló Gamazo, Alberto; Bilalli, Besim; Thiele, Maik; Lehner, Wolfgang (2016)
      Número de revista
      Open Access
      Large-scale data analysis is an important activity in many organizations that typically requires the deployment of data-intensive workflows. As data is processed these workflows generate large intermediate results, which ...
    • Towards intelligent data analysis : the metadata challenge 

      Bilalli, Besim; Abelló Gamazo, Alberto; Aluja Banet, Tomàs; Wrembel, Robert (2016)
      Conference lecture
      Restricted access - publisher's policy
      Once analyzed correctly, data can yield substantial benefits. The process of analyzing the data and transforming it into knowledge is known as Knowledge Discovery in Databases (KDD). The plethora and subtleties of algorithms ...