Ir al contenido (pulsa Retorno)

Universitat Politècnica de Catalunya

    • Català
    • Castellano
    • English
    • LoginRegisterLog in (no UPC users)
  • mailContact Us
  • world English 
    • Català
    • Castellano
    • English
  • userLogin   
      LoginRegisterLog in (no UPC users)

UPCommons. Global access to UPC knowledge

60.175 UPC academic works
You are here:
View Item 
  •   DSpace Home
  • Treballs acadèmics
  • Escola d'Enginyeria de Telecomunicació i Aeroespacial de Castelldefels
  • Grau en Enginyeria Telemàtica (Pla 2009)
  • View Item
  •   DSpace Home
  • Treballs acadèmics
  • Escola d'Enginyeria de Telecomunicació i Aeroespacial de Castelldefels
  • Grau en Enginyeria Telemàtica (Pla 2009)
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Utilització d'Open Source Software per l'analisi de big data

Thumbnail
View/Open
memoria.pdf (5,776Mb)
Share:
 
  View Usage Statistics
Cita com:
hdl:2117/91233

Show full item record
Iglesias Gimeno, Jordi
Martinez Otal, Alejandro
Author's e-mailamartinezotalarrobagmail.com, jordi.eetac@gmail.com
Tutor / directorMeseguer Pallarès, RocMés informacióMés informacióMés informació
Document typeBachelor thesis
Date2016-09-14
Rights accessOpen Access
Attribution-NonCommercial-ShareAlike 3.0 Spain
Except where otherwise noted, content on this work is licensed under a Creative Commons license : Attribution-NonCommercial-ShareAlike 3.0 Spain
Abstract
In this project we aim to identify, analyze and justify the contribution that the data can do in big businesses or schools by creating added value from the data we collect. Being a relatively new concept, Big data, differential attributes, its purpose and different traditional data mining methodologies will be defined. We also want to highlight as Big data can become a source of competitive advantage with technologies like Hadoop and Spark as an alternative storage and processing of high volumes of data, also via large tools preprocessing data such as programs Microsoft Excel or more specific pre tools data processing as WEKA. Also since we have the opportunity to work with a tool in our work experience, closely related to our work, we will exploit some functionality in order to learn as much as possible about the analysis monitoring and further processing of our great data volumes. This work aims to define the scenarios in which Hadoop and Spark can be used instead of the classic models of storage as well as the possible reuse infrastructure Business existing Intelligence to use Hadoop and Spark as a data source further than the existing EDW. With this research we pretend to defiance the scenarios in which the Open Software tools that work with Big Data can be user instead of the classic models for data storage, as much as the usage of the infrastructure that existing Machine Learning algorithms provide to use those tools. We will also focus on one of our cases is in the academic field by gathering information about the "logs" we have from the students during the semester, so that we can handle and manage that information in a more useful way and always thinking for the students to predict future outcomes by their trajectories and so to give a feedback to the teacher or tutor before the end of the course.
SubjectsBig data, Open source software, Macrodades, Programari lliure
DegreeGRAU EN ENGINYERIA TELEMÀTICA (Pla 2009)
URIhttp://hdl.handle.net/2117/91233
Collections
  • Escola d'Enginyeria de Telecomunicació i Aeroespacial de Castelldefels - Grau en Enginyeria Telemàtica (Pla 2009) [152]
Share:
 
  View Usage Statistics

Show full item record

FilesDescriptionSizeFormatView
memoria.pdf5,776MbPDFView/Open

Browse

This CollectionBy Issue DateAuthorsOther contributionsTitlesSubjectsThis repositoryCommunities & CollectionsBy Issue DateAuthorsOther contributionsTitlesSubjects

© UPC Obrir en finestra nova . Servei de Biblioteques, Publicacions i Arxius

info.biblioteques@upc.edu

  • About This Repository
  • Contact Us
  • Send Feedback
  • Inici de la pàgina