Mostra el registre d'ítem simple

dc.contributor.authorFuentes, Claudio
dc.contributor.authorCasella, George
dc.date.accessioned2010-04-27T15:35:10Z
dc.date.available2010-04-27T15:35:10Z
dc.date.issued2009
dc.identifier.citationFuentes, Claudio; Casella, George. Testing for the existence of clusters. "SORT", 2009, vol. 33, núm. 2, p. 115-158.
dc.identifier.issn1696-2281
dc.identifier.urihttp://hdl.handle.net/2099/8947
dc.description.abstractDetecting and determining clusters present in a certain sample has been an important concern, among researchers from different fields, for a long time. In particular, assessing whether the clusters are statistically significant, is a question that has been asked by a number of experimenters. Recently, this question arose again in a study in maize genetics, where determining the significance of clusters is crucial as a primary step in the identification of a genome-wide collection of mutants that may affect the kernel composition. Although several efforts have been made in this direction, not much has been done with the aim of developing an actual hypothesis test in order to assess the significance of clusters. In this paper, we propose a new methodology that allows the examination of the hypothesis test H0 : =1 vs. H1 : =k, where denotes the number of clusters present in a certain population. Our procedure, based on Bayesian tools, permits us to obtain closed form expressions for the posterior probabilities corresponding to the null hypothesis. From here, we calibrate our results by estimating the frequentist null distribution of the posterior probabilities in order to obtain the p-values associated with the observed posterior probabilities. In most cases, actual evaluation of the posterior probabilities is computationally intensive and several algorithms have been discussed in the literature. Here, we propose a simple estimation procedure, based on MCMC techniques, that permits an efficient and easily implementable evaluation of the test. Finally, we present simulation studies that support our conclusions, and we apply our method to the analysis of NIR spectroscopy data coming from the genetic study that motivated this work.
dc.format.extent44 p.
dc.language.isoeng
dc.publisherInstitut d'Estadística de Catalunya
dc.rightsAttribution-NonCommercial-NoDerivs 3.0 Spain
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/es/
dc.subjectÀrees temàtiques de la UPC::Matemàtiques i estadística::Estadística matemàtica
dc.subject.lcshMathematical statistics
dc.subject.otherHierarchical models
dc.subject.otherBayesian inference
dc.subject.otherFrequentist calibration
dc.subject.otherMonte Carlo methods
dc.subject.otherP-values.
dc.titleTesting for the existence of clusters
dc.typeArticle
dc.subject.lemacEstadística matemàtica
dc.description.peerreviewedPeer Reviewed
dc.subject.amsClassificació AMS::62 Statistics::62F Parametric inference
dc.rights.accessOpen Access
local.citation.authorFuentes, Claudio; Casella, George
local.citation.publicationNameSORT
local.citation.volume33
local.citation.number2
local.citation.startingPage115
local.citation.endingPage158


Fitxers d'aquest items

Thumbnail

Aquest ítem apareix a les col·leccions següents

Mostra el registre d'ítem simple