Show simple item record

dc.contributor.authorGonzález Torre, Ivan
dc.contributor.authorDebowski, Lukasz
dc.contributor.authorHernández Fernández, Antonio
dc.contributor.otherUniversitat Politècnica de Catalunya. Institut de Ciències de l'Educació
dc.date.accessioned2021-10-07T07:27:34Z
dc.date.available2021-10-07T07:27:34Z
dc.date.issued2021-08-20
dc.identifier.citationGonzález, I.; Debowski, L.; Hernández-Fernández, A. Can Menzerath's law be a criterion of complexity in communication? "PloS one", 20 Agost 2021, vol. 16, núm. 8, article e0256133, p. 1-21.
dc.identifier.issn1932-6203
dc.identifier.urihttp://hdl.handle.net/2117/353245
dc.description.abstractMenzerath’s law is a quantitative linguistic law which states that, on average, the longer is a linguistic construct, the shorter are its constituents. In contrast, Menzerath-Altmann’s law (MAL) is a precise mathematical power-law-exponential formula which expresses the expected length of the linguistic construct conditioned on the number of its constituents. In this paper, we investigate the anatomy of MAL for constructs being word tokens and constituents being syllables, measuring its length in graphemes. First, we derive the exact form of MAL for texts generated by the memoryless source with three emitted symbols, which can be interpreted as a "monkey typing" model or a null model. We show that this null model complies with Menzerath’s law, revealing that Menzerath’s law itself can hardly be a criterion of complexity in communication. This observation does not apply to the more precise Menzerath-Altmann’s law, which predicts an inverted regime for sufficiently range constructs, i.e., the longer is a word, the longer are its syllables. To support this claim, we analyze MAL on data from 21 languages, consisting of texts from the Standardized Project Gutenberg. We show the presence of the inverted regime, not exhibited by the null model, and we demonstrate robustness of our results. We also report the complicated distribution of syllable sizes with respect to their position in the word, which might be related with the emerging MAL. Altogether, our results indicate that Menzerath’s law—in terms of correlations—is a spurious observation, while complex patterns and efficiency dynamics should be rather attributed to specific forms of Menzerath-Altmann’s law.
dc.description.sponsorshipThis work has been funded by the project PRO2021-S03-HERNANDEZ (Institut d’Estudis Catalans), where AHF is the principal investigator. URL: https://futur.upc.edu/30546321 AHF is also funded by the grant TIN2017-89244-R from Ministerio de Economia, Industria y Competitividad (Gobierno de España) and supported by the recognition 2017SGR-856 (MACDA) from AGAUR (Generalitat de Catalunya). URL: https://futur.upc. edu/2202438
dc.format.extent21 p.
dc.language.isoeng
dc.publisherPublic Library of Science (PLOS)
dc.rightsAttribution 4.0 International
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.subjectÀrees temàtiques de la UPC::Informàtica::Intel·ligència artificial::Llenguatge natural
dc.subject.lcshComputational linguistics
dc.subject.otherMenzerath's law
dc.subject.otherMenzerath-Altmann's law
dc.subject.otherMonkey typing
dc.subject.otherLinguistic laws
dc.subject.otherMemoryless source
dc.subject.otherCommunication
dc.subject.otherStandardized Project Gutenberg Corpus
dc.subject.otherHidden Markov Model
dc.titleCan Menzerath’s law be a criterion of complexity in communication?
dc.typeArticle
dc.subject.lemacLingüística computacional
dc.contributor.groupUniversitat Politècnica de Catalunya. LARCA - Laboratori d'Algorísmia Relacional, Complexitat i Aprenentatge
dc.identifier.doi10.1371/journal.pone.0256133
dc.description.peerreviewedPeer Reviewed
dc.relation.publisherversionhttps://journals.plos.org/plosone/article?id=10.1371/journal.pone.0256133
dc.rights.accessOpen Access
local.identifier.drac31981580
dc.description.versionPostprint (published version)
dc.relation.projectidinfo:eu-repo/grantAgreement/IEC/PRO2021-S03-HERNANDEZ
dc.relation.projectidinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2013-2016/TIN2017-89244-R/ES/GESTION Y ANALISIS DE DATOS COMPLEJOS/
dc.relation.projectidinfo:eu-repo/grantAgreement/AGAUR/2017 SGR 856
local.citation.authorGonzález, I.; Debowski, L.; Hernández-Fernández, A.
local.citation.publicationNamePloS one
local.citation.volume16
local.citation.number8, article e0256133
local.citation.startingPage1
local.citation.endingPage21
dc.description.sdgObjectius de Desenvolupament Sostenible::9 - Indústria, Innovació i Infraestructura


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record