Recent Submissions

  • Finding relevant information in big datasets with ML 

    Njoku, Uchechukwu Fortune; Abelló Gamazo, Alberto; Bilalli, Besim; Bontempi, Gianluca (OpenProceedings, 2024)
    Conference lecture
    Open Access
    Due to the abundance of data, noisy, irrelevant, or redundant features often need to be identified and discarded. Feature selection is a collection of methods used to ensure that only relevant data are used for a data ...
  • GLiDE: Integrated Gamified Learning Dashboard Environment 

    Farré Tost, Carles; López Cuesta, Lidia; Oriol Hilari, Marc; Espinola Garcia, Adrià; Miñana Montecino, Albert; Franch Gutiérrez, Javier (CEUR-WS.org, 2024)
    Conference lecture
    Open Access
    The Integrated Gamified Learning Dashboard Environment (GLiDE) project aims at fostering student engagement, teamwork, and project performance in software engineering education. By integrating gamification elements and ...
  • RE-Miner: Mining mobile user reviews with feature extraction and emotion classification 

    Motger de la Encarnación, Joaquim; Tiessler Aguirre, Max; Oriol Hilari, Marc; Bertolín Rico, Irene (CEUR-WS.org, 2024)
    Conference report
    Open Access
    In the context of app stores, user reviews are pivotal on supporting multiple requirements engineering tasks. Among these, feature extraction and emotion classification play a crucial role in requirements prioritization, ...
  • There is no data science without data governance: a proposal based on knowledge graphs 

    Bilalli, Besim; Jovanovic, Petar; Nadal Francesch, Sergi; Queralt Calafat, Anna; Romero Moral, Óscar (CEUR-WS.org, 2024)
    Conference lecture
    Open Access
    Data Science and data-driven Artificial Intelligence are here to stay and they are expected to further transform the current global economy. From a technical point of view, there is an overall agreement that disciplines ...
  • A data-science pipeline to enable the interpretability of many-objective feature selection 

    Njoku, Uchechukwu Fortune; Abelló Gamazo, Alberto; Bilalli, Besim; Bontempi, Gianluca (CEUR-WS.org, 2024)
    Conference lecture
    Open Access
    Many-Objective Feature Selection (MOFS) approaches use four or more objectives to determine the relevance of a subset of features in a supervised learning task. As a consequence, MOFS typically returns a large set of ...
  • Discovery of semantic non-syntactic joins 

    Maynou Yelamos, Marc; Nadal Francesch, Sergi (CEUR-WS.org, 2024)
    Conference lecture
    Open Access
    Data discovery is an essential step in the data integration pipeline involving finding datasets whose combined information provides relevant insights. Discovering joinable attributes requires assessing the closeness of the ...
  • HealthMesh: An architectural framework for federated healthcare data management 

    Bisquert Parés, Aniol; Hmimou Ham Man, Achraf; Berral García, Josep Lluís; Gutiérrez Torre, Alberto; Romero Moral, Óscar (CEUR-WS.org, 2024)
    Conference report
    Open Access
    Recently, significant milestones have been achieved in the field of healthcare data analysis. However, alongside these accomplishments, substantial data-related challenges have emerged in the domain of big data management. ...
  • Unveiling competition dynamics in mobile app markets through user reviews 

    Motger de la Encarnación, Joaquim; Franch Gutiérrez, Javier; Gervasi, Vincenzo; Marco Gómez, Jordi (Springer, 2024)
    Conference report
    Restricted access - publisher's policy
    [Context and motivation] User reviews published in mobile app repositories are essential for understanding user satisfaction and engagement within a specific market segment. [Question/problem] Manual analysis of reviews ...
  • Performance analysis of distributed GPU-accelerated task-based workflows 

    Nogueira Lobo de Carvalho, Marcos; Queralt Calafat, Anna; Romero Moral, Óscar; Simitsis, Alkis; Tatu, Cristian; Badia Sala, Rosa Maria (OpenProceedings, 2024)
    Conference report
    Open Access
    We present an empirical approach to identify the key factors affecting the execution performance of task-based workflows on a High Performance Computing (HPC) infrastructure composed of heterogeneous CPU-GPU clusters. Our ...
  • Do DL models and training environments have an impact on energy consumption? 

    Rey Juárez, Santiago del; Martínez Fernández, Silverio Juan; Cruz, Luís; Franch Gutiérrez, Javier (Institute of Electrical and Electronics Engineers (IEEE), 2023)
    Conference report
    Open Access
    Current research in the computer vision field mainly focuses on improving Deep Learning (DL) correctness and inference time performance. However, there is still little work on the huge carbon footprint that has training ...
  • Adaptive task-oriented chatbots using feature-based knowledge bases 

    Campàs Gené, Carla; Motger de la Encarnación, Joaquim; Franch Gutiérrez, Javier; Marco Gómez, Jordi (Springer, 2023)
    Conference lecture
    Open Access
    Task-oriented chatbots relying on a knowledge base for domain-specific content exploitation have been largely addressed in research and industry applications. Despite this, multiple challenges remain to be fully conquered, ...
  • Comparision of models built using AutoML and data fusion 

    Haq, Anam; Wilk, Szymon; Abelló Gamazo, Alberto (Springer, 2022)
    Conference report
    Open Access
    Automated machine learning (AutoML) has made life easier for data analysts or scientists by providing quick insights into data by building machine learning (ML) models. AutoML techniques are applied to vast areas from image ...

View more