New insights into evaluation of regression models through a decomposition of the prediction errors: application to near-infrared spectral data
Visualitza/Obre
Estadístiques de LA Referencia / Recolecta
Inclou dades d'ús des de 2022
Cita com:
hdl:2099/13769
Tipus de documentArticle
Data publicació2013
EditorInstitut d'Estadística de Catalunya
Condicions d'accésAccés obert
Llevat que s'hi indiqui el contrari, els
continguts d'aquesta obra estan subjectes a la llicència de Creative Commons
:
Reconeixement-NoComercial-SenseObraDerivada 3.0 Espanya
Abstract
This paper analyzes the performance of linear regression models taking into account usual criteria such as the number of principal components or latent factors, the goodness of fit or the predictive capability. Other comparison criteria, more common in an economic context, are also considered: the degree of multicollinearity and a decomposition of the mean squared error of the prediction which determines the nature, systematic or random, of the prediction errors. The applications use real data of extra-virgin oil obtained by near-infrared spectroscopy. The high dimensionality of the data is reduced by applying principal component analysis and partial least squares analysis. A possible improvement of these methods by using cluster analysis or the information of the relative maxima of the spectrum is investigated. Finally, obtained results are generalized via cross-validation and bootstrapping.
CitacióSánchez Rodríguez, María Isabel [et al.]. New insights into evaluation of regression models through a decomposition of the prediction errors: application to near-infrared spectral data. "SORT", vol. 37, núm. 1, p. 57-78.
ISSN1696-2281
Fitxers | Descripció | Mida | Format | Visualitza |
---|---|---|---|---|
37.1.4.sanchez-etal.pdf | 3,512Mb | Visualitza/Obre |