Now showing items 1-3 of 3

    • An integration data tool for joinable tables based on apache spark 

      Flores Herrera, Javier de Jesús (Universitat Politècnica de Catalunya, 2020-06-29)
      Master thesis
      Open Access
      Data analysts perform exploratory programming for several analytical tasks on notebooks. One is Data Discovery which consists in finding attributes that might join. This is timeconsuming and new techniques are needed to ...
    • Effective and scalable data discovery with NextiaJD 

      Flores Herrera, Javier de Jesús; Nadal Francesch, Sergi; Romero Moral, Óscar (OpenProceedings, 2021)
      Conference lecture
      Open Access
      We present NextiaJD, a data discovery system with high predictive performance and computational efficiency. NextiaJD aids data scientists in the discovery of datasets that can be crossed. To that end, it proposes a ranking ...
    • Towards scalable data discovery 

      Flores Herrera, Javier de Jesús; Nadal Francesch, Sergi; Romero Moral, Óscar (OpenProceedings, 2021)
      Conference lecture
      Open Access
      We study the problem of discovering joinable datasets at scale. We approach the problem from a learning perspective relying on profiles. These are succinct representations that capture the underlying characteristics of the ...