Compression and the origins of Zipf's law for word frequencies

View/Open
Cita com:
hdl:2117/100379
Document typeArticle
Defense date2016-11-01
Rights accessOpen Access
Except where otherwise noted, content on this work
is licensed under a Creative Commons license
:
Attribution-NonCommercial-ShareAlike 3.0 Spain
Abstract
Here we sketch a new derivation of Zipf's law for word frequencies based on optimal coding. The structure of the derivation is reminiscent of Mandelbrot's random typing model but it has multiple advantages over random typing: (1) it starts from realistic cognitive pressures, (2) it does not require fine tuning of parameters, and (3) it sheds light on the origins of other statistical laws of language and thus can lead to a compact theory of linguistic laws. Our findings suggest that the recurrence of Zipf's law in human languages could originate from pressure for easy and fast communication.
CitationFerrer-i-Cancho, R. Compression and the origins of Zipf's law for word frequencies. "Complexity", 1 Novembre 2016, vol. 21, núm. S2, p. 409-411.
ISSN1076-2787
Publisher versionhttp://onlinelibrary.wiley.com/doi/10.1002/cplx.21820/abstract
Files | Description | Size | Format | View |
---|---|---|---|---|
1605.01326v2.pdf | 84,79Kb | View/Open |