Collections in this community

Recent Submissions

  • Keeping the data lake in form: proximity mining for pre-filtering schema matching 

    Al-serafi, Ayman Mounir Mohamed; Abelló Gamazo, Alberto; Romero Moral, Óscar; Calders, Toon (2020-05)
    Article
    Open Access
    Data Lakes (DLs) are large repositories of raw datasets from disparate sources. As more datasets are ingested into a DL, there is an increasing need for efficient techniques to profile them and to detect the relationships ...
  • Multidimensional integration of RDF datasets 

    Behan, Jam Jahanzeb Khan; Romero Moral, Óscar; Zimányi, Esteban (Springer, 2019)
    Conference lecture
    Open Access
    Data providers have been uploading RDF datasets on the web to aid researchers and analysts in finding insights. These datasets, made available by different data providers, contain common characteristics that enable their ...
  • Automatically configuring parallelism for hybrid layouts 

    Munir, Rana Faisal; Abelló Gamazo, Alberto; Romero Moral, Óscar; Thiele, Maik; Lehner, Wolfgang (Springer, 2019)
    Conference lecture
    Open Access
    Distributed processing frameworks process data in parallel by dividing it into multiple partitions and each partition is processed in a separate task. The number of tasks is always created based on the total file size. ...
  • FAME: supporting continuous requirements elicitation by combining user feedback and monitoring 

    Oriol Hilari, Marc; Stade, Melanie; Fotrousi, Farnaz; Nadal Francesch, Sergi; Varga, Jovan; Seyff, Norbert; Abelló Gamazo, Alberto; Franch Gutiérrez, Javier; Marco Gómez, Jordi; Schmidt, Oleg (Institute of Electrical and Electronics Engineers (IEEE), 2018)
    Conference report
    Open Access
    Context: Software evolution ensures that software systems in use stay up to date and provide value for end-users. However, it is challenging for requirements engineers to continuously elicit needs for systems used by ...
  • ODIN: A dataspace management system 

    Nadal Francesch, Sergi; Rabbani, Kashif; Romero Moral, Óscar; Nigatu, Shumet Tadesse (CEUR-WS.org, 2019)
    Conference lecture
    Open Access
    ODIN is a system that supports the incremental pay-as-you-go integration of data sources into dataspaces and provides user-friendly querying mechanisms on top of them. We describe its main characteristics and underlying ...

View more