Recent Submissions

  • Keeping the data lake in form: proximity mining for pre-filtering schema matching 

    Al-serafi, Ayman Mounir Mohamed; Abelló Gamazo, Alberto; Romero Moral, Óscar; Calders, Toon (2020-05)
    Article
    Open Access
    Data Lakes (DLs) are large repositories of raw datasets from disparate sources. As more datasets are ingested into a DL, there is an increasing need for efficient techniques to profile them and to detect the relationships ...

View more