Ir al contenido (pulsa Retorno)

Universitat Politècnica de Catalunya

    • Català
    • Castellano
    • English
    • LoginRegisterLog in (no UPC users)
  • mailContact Us
  • world English 
    • Català
    • Castellano
    • English
  • userLogin   
      LoginRegisterLog in (no UPC users)

UPCommons. Global access to UPC knowledge

Banner header
64.019 UPC academic works
You are here:
View Item 
  •   DSpace Home
  • Treballs acadèmics
  • Màsters oficials
  • Màster universitari en Estadística i Investigació Operativa (UPC-UB)
  • View Item
  •   DSpace Home
  • Treballs acadèmics
  • Màsters oficials
  • Màster universitari en Estadística i Investigació Operativa (UPC-UB)
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Tree Boosting Data Competitions with XGBoost

Thumbnail
View/Open
memoria.pdf (948,7Kb)
Share:
 
  View Usage Statistics
Cita com:
hdl:2117/100293

Show full item record
Bort Escabias, Carlos
Tutor / directorDelicado Useros, Pedro FranciscoMés informacióMés informacióMés informació
Document typeMaster thesis
Date2017-01
Rights accessOpen Access
Attribution-NonCommercial-NoDerivs 3.0 Spain
Except where otherwise noted, content on this work is licensed under a Creative Commons license : Attribution-NonCommercial-NoDerivs 3.0 Spain
Abstract
This Master's Degree Thesis objective is to provide understanding on how to approach a supervised learning predictive problem and illustrate it using a statistical/machine learning algorithm, Tree Boosting. A review of tree methodology is introduced in order to understand its evolution, since Classification and Regression Trees, followed by Bagging, Random Forest and, nowadays, Tree Boosting. The methodology is explained following the XGBoost implementation, which achieved state-of-the-art results in several data competitions. A framework for applied predictive modelling is explained with its proper concepts: objective function, regularization term, overfitting, hyperparameter tuning, k-fold cross validation and feature engineering. All these concepts are illustrated with a real dataset of videogame churn; used in a datathon competition.
SubjectsMathematical statistics, Estadística matemàtica
DegreeMÀSTER UNIVERSITARI EN ESTADÍSTICA I INVESTIGACIÓ OPERATIVA (Pla 2013)
URIhttp://hdl.handle.net/2117/100293
Collections
  • Màsters oficials - Màster universitari en Estadística i Investigació Operativa (UPC-UB) [392]
Share:
 
  View Usage Statistics

Show full item record

FilesDescriptionSizeFormatView
memoria.pdf948,7KbPDFView/Open

Browse

This CollectionBy Issue DateAuthorsOther contributionsTitlesSubjectsThis repositoryCommunities & CollectionsBy Issue DateAuthorsOther contributionsTitlesSubjects

© UPC Obrir en finestra nova . Servei de Biblioteques, Publicacions i Arxius

info.biblioteques@upc.edu

  • About This Repository
  • Contact Us
  • Send Feedback
  • Privacy Settings
  • Inici de la pàgina