Now showing items 1-20 of 24

    • A data-science pipeline to enable the interpretability of many-objective feature selection 

      Njoku, Uchechukwu Fortune; Abelló Gamazo, Alberto; Bilalli, Besim; Bontempi, Gianluca (CEUR-WS.org, 2024)
      Conference lecture
      Open Access
      Many-Objective Feature Selection (MOFS) approaches use four or more objectives to determine the relevance of a subset of features in a supervised learning task. As a consequence, MOFS typically returns a large set of ...
    • A framework for assessing the peer review duration of journals: case study in computer science 

      Bilalli, Besim; Munir, Rana Faisal; Abelló Gamazo, Alberto (Springer Nature, 2021-01)
      Article
      Open Access
      In various fields, scientific article publication is a measure of productivity and in many occasions it is used as a critical factor for evaluating researchers. Therefore, a lot of time is dedicated to writing articles ...
    • Advances and challenges in automated malaria diagnosis using digital microscopy imaging with artificial intelligence tools: A review 

      Rubio Maturana, Carles; Oliveira, Allisson Dantas de; Nadal Francesch, Sergi; Bilalli, Besim; Abelló Gamazo, Alberto; López Codina, Daniel; Sayrol Clols, Elisa (Frontiers Media SA, 2022-11-15)
      Article
      Open Access
      Malaria is an infectious disease caused by parasites of the genus Plasmodium spp. It is transmitted to humans by the bite of an infected female Anopheles mosquito. It is the most common disease in resource-poor settings, ...
    • Automated data pre-processing via meta-learning 

      Bilalli, Besim; Abelló Gamazo, Alberto; Aluja Banet, Tomàs; Wrembel, Robert (2016)
      Conference report
      Open Access
      A data mining algorithm may perform differently on datasets with different characteristics, e.g., it might perform better on a dataset with continuous attributes rather than with categorical attributes, or the other way ...
    • Data pre-processing pipeline generation for AutoETL 

      Giovanelli, Joseph; Bilalli, Besim; Abelló Gamazo, Alberto (Elsevier, 2022-09)
      Article
      Open Access
      Data pre-processing plays a key role in a data analytics process (e.g., applying a classification algorithm on a predictive task). It encompasses a broad range of activities that span from correcting errors to selecting ...
    • DOGO4ML: Development, operation and data governance for ML-based software systems 

      Ayala Martínez, Claudia Patricia; Bilalli, Besim; Gómez Seoane, Cristina; Martínez Fernández, Silverio Juan (CEUR-WS.org, 2022)
      Conference report
      Open Access
      Machine Learning based Software Systems (MLSS) are becoming increasingly pervasive in today’s society and can be found in virtually every domain. Building MLSS is challenging due to their interdisciplinary nature. MLSS ...
    • Effective data pre-processing for AutoML 

      Giovanelli, Joseph; Bilalli, Besim; Abelló Gamazo, Alberto (CEUR-WS.org, 2021)
      Conference report
      Open Access
      Data pre-processing plays a key role in a data analytics process (e.g., supervised learning). It encompasses a broad range of activities that span from correcting errors to selecting the most relevant features for the ...
    • Gestió de dades massives 

      Abelló Gamazo, Alberto; Bilalli, Besim (Institut d'Estudis Catalans (IEC), 2023-09-22)
      Article
      Open Access
      Aquest article pretén donar una visió general del que és la gestió de dades massives, la seva problemàtica i com s’han d’abordar les solucions. La principal dificultat és que no existeix una solució genèrica i s’ha de ...
    • iMAGING: a novel automated system for malaria diagnosis by using artificial intelligence tools and a universal low-cost robotized microscope 

      Rubio Maturana, Carles; Oliveira, Allisson Dantas de; Nadal Francesch, Sergi; Zarzuela Serrat, Francesc; Sulleiro Igual, Elena; Ruiz Marti, Edurne; Bilalli, Besim; Veiga Lluch, Anna; Espasa Soley, Mateu; Abelló Gamazo, Alberto; Pumarola Sunyer, Tomas; Segu Estruch, Marta; López Codina, Daniel; Sayrol Clos, Elisa; Joseph Munné, Joan (Frontiers Media SA, 2023-11-24)
      Article
      Open Access
      Malaria is one of the most prevalent infectious diseases in sub-Saharan Africa, with 247 million cases reported worldwide in 2021 according to the World Health Organization. Optical microscopy remains the gold standard ...
    • Impact of filter feature selection on classification: an empirical study 

      Njoku, Uchechukwu Fortune; Abelló Gamazo, Alberto; Bilalli, Besim; Bontempi, Gianluca (CEUR-WS.org, 2022)
      Conference lecture
      Open Access
      The high-dimensionality of Big Data poses challenges in data understanding and visualization. Furthermore, it leads to lengthy model building times in data analysis and poor generalization for machine learning models. ...
    • Intelligent assistance for data pre-processing 

      Bilalli, Besim; Abelló Gamazo, Alberto; Aluja Banet, Tomàs; Wrembel, Robert (Elsevier, 2017-06-03)
      Article
      Open Access
      A data mining algorithm may perform differently on datasets with different characteristics, e.g., it might perform better on a dataset with continuous attributes rather than with categorical attributes, or the other way ...
    • Learning fishing information from AIS data 

      Pons Recasens, Gerard; Bilalli, Besim; Abelló Gamazo, Alberto; Blanco Sánchez, Santiago (Association for Computing Machinery (ACM), 2022)
      Conference report
      Open Access
      The Automatic Identification System (AIS) allows vessels to emit their position, speed and course while sailing. By international law, all larges vessels (e.g., bigger than 15m in Europe) are required to provide such data. ...
    • Learning the impact of data pre-processing in data analysis 

      Bilalli, Besim (Universitat Politècnica de Catalunya, 2018-06-28)
      Doctoral thesis
      Open Access
      Covenantee:   Politechnika Poznańska
      There is a clear correlation between data availability and data analytics, and hence with the increase of data availability --- unavoidable according to Moore's law, the need for data analytics increases too. This certainly ...
    • On the predictive power of meta-features in OpenML 

      Bilalli, Besim; Abelló Gamazo, Alberto; Aluja Banet, Tomàs (2017-12-20)
      Article
      Open Access
      The demand for performing data analysis is steadily rising. As a consequence, people of different profiles (i.e., non-experienced users) have started to analyze their data. However, this is challenging for them. A key step ...
    • Operationalizing and automating data governance 

      Nadal Francesch, Sergi; Jovanovic, Petar; Bilalli, Besim; Romero Moral, Óscar (Springer Nature, 2022-12-10)
      Article
      Open Access
      The ability to cross data from multiple sources represents a competitive advantage for organizations. Yet, the governance of the data lifecycle, from the data sources into valuable insights, is largely performed in an ...
    • PRESISTANT : data pre-processing assistant 

      Bilalli, Besim; Abelló Gamazo, Alberto; Aluja Banet, Tomàs; Munir, Rana Faisal; Wrembel, Robert (Springer, 2019)
      Conference lecture
      Open Access
      A concrete classification algorithm may perform differently on datasets with different characteristics, e.g., it might perform better on a dataset with continuous attributes rather than with categorical attributes, or the ...
    • PRESISTANT: Learning based assistant for data pre-processing 

      Bilalli, Besim; Abelló Gamazo, Alberto; Aluja Banet, Tomàs; Wrembel, Robert (Elsevier, 2019-09)
      Article
      Open Access
      Data pre-processing is one of the most time consuming and relevant steps in a data analysis process (e.g., classification task). A given data pre-processing operator can have positive, negative, or zero impact on the final ...
    • Quarry 

      Abelló Gamazo, Alberto; Romero Moral, Óscar; Jovanovic, Petar; Nadal Francesch, Sergi; Bilalli, Besim; Candón Arenas, Héctor; Mayorova, Daria; Thavornun, Varunya; Gil González, Daniel (2015-07-01)
      Software
      Restricted access - confidentiality agreement
    • Quarry: A user-centered big data integration platform 

      Jovanovic, Petar; Nadal Francesch, Sergi; Romero Moral, Óscar; Abelló Gamazo, Alberto; Bilalli, Besim (2021-02)
      Article
      Open Access
      Obtaining valuable insights and actionable knowledge from data requires cross-analysis of domain data typically coming from various sources. Doing so, inevitably imposes burdensome processes of unifying different data ...
    • Reproducible experiments for generating pre-processing pipelines for AutoETL 

      Giovanelli, Joseph; Bilalli, Besim; Abelló Gamazo, Alberto; Silva Coira, Fernando; de Bernardo Roca, Guillermo (Elsevier, 2024-02)
      Article
      Restricted access - publisher's policy
      This work is a companion reproducibility paper of the experiments and results reported in Giovanelli et al. (2022), where data pre-processing pipelines are evaluated in order to find pipeline prototypes that reduce the ...