A regional perspective on the accuracy of machine learning forecasts of tourism demand based on data characteristics
Document typeExternal research report
Rights accessOpen Access
In this work we assess the role of data characteristics in the accuracy of machine learning (ML) tourism forecasts from a spatial perspective. First, we apply a seasonal-trend decomposition procedure based on non-parametric regression to isolate the different components of the time series of international tourism demand to all Spanish regions. This approach allows us to compute a set of measures to describe the features of the data. Second, we analyse the performance of several ML models in a recursive multiple-step-ahead forecasting experiment. In a third step, we rank all seventeen regions according to their characteristics and the obtained forecasting performance, and use the rankings as the input for a multivariate analysis to evaluate the interactions between time series features and the accuracy of the predictions. By means of dimensionality reduction techniques we summarise all the information into two components and project all Spanish regions into perceptual maps. We find that entropy and dispersion show a negative relation with accuracy, while the effect of other data characteristics on forecast accuracy is heavily dependent on the forecast horizon.
CitationClaveria, O.; Monte, E.; Torra Porras, S. "A regional perspective on the accuracy of machine learning forecasts of tourism demand based on data characteristics". 2018.
URL other repositoryhttp://www.ub.edu/irea/working_papers/2018/201805.pdf