Enviaments recents

  • H-word: Supporting job scheduling in Hadoop with workload-driven data redistribution 

    Jovanovic, Petar; Romero Moral, Óscar; Calders, Toon; Abelló Gamazo, Alberto (2016)
    Text en actes de congrés
    Accés obert
    Today’s distributed data processing systems typically follow a query shipping approach and exploit data locality for reducing network traffic. In such systems the distribution of data over the cluster resources plays a ...
  • TINTIN : comprobación incremental de aserciones SQL 

    Oriol Hilari, Xavier; Teniente López, Ernest; Rull, Guillem (2016)
    Text en actes de congrés
    Accés obert
    Ninguno de los SGBD más populares del momento implementa aserciones SQL, obligando así a implementar manualmente su comprobación. Por ello, presentamos TINTIN: una aplicación que genera automáticamente el código SQL para ...
  • Automated data pre-processing via meta-learning 

    Bilalli, Besim; Abelló Gamazo, Alberto; Aluja Banet, Tomàs; Wrembel, Robert (2016)
    Text en actes de congrés
    Accés obert
    A data mining algorithm may perform differently on datasets with different characteristics, e.g., it might perform better on a dataset with continuous attributes rather than with categorical attributes, or the other way ...
  • Towards intelligent data analysis : the metadata challenge 

    Bilalli, Besim; Abelló Gamazo, Alberto; Aluja Banet, Tomàs; Wrembel, Robert (2016)
    Comunicació de congrés
    Accés restringit per política de l'editorial
    Once analyzed correctly, data can yield substantial benefits. The process of analyzing the data and transforming it into knowledge is known as Knowledge Discovery in Databases (KDD). The plethora and subtleties of algorithms ...
  • POIESIS: A tool for quality-aware ETL process redesign 

    Theodorou, Vasileios; Abelló Gamazo, Alberto; Thiele, Maik; Lehner, Wolfgang (2015)
    Text en actes de congrés
    Accés obert
    We present a tool, called POIESIS, for automatic ETL process enhancement. ETL processes are essential data-centric activities in modern business intelligence environments and they need to be examined through a viewpoint ...
  • A simulation framework for real-time assessment of dynamic ride sharing demand responsive transportation models 

    Linares Herreros, Mª Paz; Montero Mercadé, Lídia; Barceló Bugeda, Jaime; Carmona, carlos (Institute of Electrical and Electronics Engineers (IEEE), 2016)
    Text en actes de congrés
    Accés obert
    Sustainable mobility is not only a technological question, automotive technology will be part of the solution combined with a paradigm shift from car ownership to vehicle usage, and the application of Information and ...
  • Towards information profiling: data lake content metadata management 

    Al-serafi, Ayman Mounir Mohamed; Abelló Gamazo, Alberto; Romero Moral, Óscar; Calders, Toon (Institute of Electrical and Electronics Engineers (IEEE), 2016)
    Text en actes de congrés
    Accés obert
    There is currently a burst of Big Data (BD) processed and stored in huge raw data repositories, commonly called Data Lakes (DL). These BD require new techniques of data integration and schema alignment in order to make the ...
  • A machine learning approach for layout inference in spreadsheets 

    Koci, Elvis; Thiele, Maik; Romero Moral, Óscar; Lehner, Wolfgang (SciTePress, 2016)
    Text en actes de congrés
    Accés obert
    Spreadsheet applications are one of the most used tools for content generation and presentation in industry and the Web. In spite of this success, there does not exist a comprehensive approach to automatically extract and ...
  • NOSQL design for analytical workloads: Variability matters 

    Herrero Otal, Víctor; Abelló Gamazo, Alberto; Romero Moral, Óscar (Springer, 2016)
    Text en actes de congrés
    Accés restringit per política de l'editorial
    Big Data has recently gained popularity and has strongly questioned relational databases as universal storage systems, especially in the presence of analytical workloads. As result, co-relational alternatives, commonly ...
  • Towards exploratory OLAP on linked data 

    Abelló Gamazo, Alberto; Gallinucci, Enrico; Golfarelli, Matteo; Rizzi Bach, Stefano; Romero Moral, Óscar (2016)
    Text en actes de congrés
    Accés obert
    In the context of exploratory OLAP, coupling the information wealth of linked data with the precision and detail of corporate data can greatly improve the effectiveness of the decision-making process. In this paper we ...

Mostra'n més