The Zipf–Poisson-stopped-sum distribution with an application for modeling the degree sequence of social networks
Visualitza/Obre
Cita com:
hdl:2117/345797
Tipus de documentArticle
Data publicació2020-03-01
Condicions d'accésAccés obert
Llevat que s'hi indiqui el contrari, els
continguts d'aquesta obra estan subjectes a la llicència de Creative Commons
:
Reconeixement-NoComercial-SenseObraDerivada 3.0 Espanya
Abstract
Under the Zipf Distribution, the frequency of a value is a power function of its size. Thus, when plotting frequencies versus size in log–log scale of data following that distribution, one obtains a straight line. The Zipf has been assumed to be appropriate for modeling highly skewed data from many different areas. Nevertheless, for many real data sets, the linearity is observed only in the tail; thus, the Zipf is fitted only for values larger than a given threshold and, consequently, there is a loss of information. The Zipf–Poisson-stopped-sum distribution is introduced as a more flexible alternative. It is proven that in log–log scale allows for top-concavity, maintaining the linearity in the tail. Consequently, the distribution fits properly many data sets in their entire range. To prove the suitability of our model 16 network degree sequences describing the interaction between members of a given platform have been fitted. The results have been compared with the fits obtained through other bi-parametric distributions.
CitacióDuarte-López, A.; Perez-Casany, M.; Valero, J. The Zipf–Poisson-stopped-sum distribution with an application for modeling the degree sequence of social networks. "Computational statistics and data analysis", 1 Març 2020, vol. 143, p. 106838: 1-106838: 16.
ISSN0167-9473
Versió de l'editorhttps://www.sciencedirect.com/science/article/pii/S0167947319301938
Col·leccions
Fitxers | Descripció | Mida | Format | Visualitza |
---|---|---|---|---|
finalDraft.pdf | 906,6Kb | Visualitza/Obre |