DSpace DSpace UPC
 Català   Castellano   English  

E-prints UPC >
Altres >
Enviament des de DRAC >

Empreu aquest identificador per citar o enllaçar aquest ítem: http://hdl.handle.net/2117/14798

Ítem no disponible en accés obert per política de l'editorial

Arxiu Descripció MidaFormat
00949655.2011.pdf738,55 kBAdobe PDF Accés restringit

Citació: Font, M.; Puig, X.; Ginebra, J. Bayesian analysis of frequency count data. "Journal of statistical computation and simulation", 2011, p. 1-18.
Títol: Bayesian analysis of frequency count data
Autor: Font Valverde, Martí Veure Producció científica UPC; Puig Oriol, Xavier Veure Producció científica UPC; Ginebra Molins, Josep Veure Producció científica UPC
Data: 2011
Tipus de document: Article
Resum: The zero truncated inverse Gaussian–Poisson model, obtained by first mixing the Poisson model assuming its expected value has an inverse Gaussian distribution and then truncating the model at zero, is very useful when modelling frequency count data. A Bayesian analysis based on this statistical model is implemented on the word frequency counts of various texts, and its validity is checked by exploring the posterior distribution of the Pearson errors and by implementing posterior predictive consistency checks. The analysis based on this model is useful because it allows one to use the posterior distribution of the model mixing density as an approximation of the posterior distribution of the density of the word frequencies of the vocabulary of the author, which is useful to characterize the style of that author. The posterior distribution of the expectation and of measures of the variability of that mixing distribution can be used to assess the size and diversity of his vocabulary. An alternative analysis is proposed based on the inverse Gaussian-zero truncated Poisson mixture model, which is obtained by switching the order of the mixing and the truncation stages. Even though this second model fits some of the word frequency data sets more accurately than the first model, in practice the analysis based on it is not as useful because it does not allow one to estimate the word frequency distribution of the vocabulary.
ISSN: 0094-9655
URI: http://hdl.handle.net/2117/14798
DOI: 10.1080/00949655.2011.600311
Apareix a les col·leccions:Altres. Enviament des de DRAC
GRESA - Grup de recerca en estadística aplicada. Articles de revista
Departament d'Estadística i Investigació Operativa. Articles de revista
Comparteix:


Stats Mostra les estadístiques d'aquest ítem

SFX Query

Aquest ítem (excepte textos i imatges no creats per l'autor) està subjecte a una llicència de Creative Commons Llicència Creative Commons
Creative Commons

 

Valid XHTML 1.0! Programari DSpace Copyright © 2002-2004 MIT and Hewlett-Packard Comentaris
Universitat Politècnica de Catalunya. Servei de Biblioteques, Publicacions i Arxius