Show simple item record

dc.contributorRodríguez Hontoria, Horacio
dc.contributorSallent Ribes, Sebastián
dc.contributor.authorMiquel Ribé, Marc
dc.description.abstract”Wikipedia is a free web-based, collaborative, multilingual encyclopedia project supported by the non-profit Wikimedia Foundation” this is the way the definition of Wikipedia in the article of the English language edition starts. This means it can be modified at any time, by anyone and at any place. These bases and their participation success make of Wikipedia an excellent social object of study which, at the same time, for being a technological construct, can be approached by techniques of natural language processing, information retrieval or data mining. However, in the current research there is a clear lack of software which can make an integral approach. Taking this into account, we make an in depth characterization of Wikipedia with the end goal of understanding which elements and structures compound its data and how they can be obtained with an analytical tool. We start with the existing API called wikAPIdia, which we develope until include new functionalities and have it ready to use in multiple scenarios and problematics of social sciences. Looking for a practical case to test it, we review the current state of art in motivation of editors and the topical coverage in the repository. This allows us to consider the aim of understanding Wikipedia from the perspective of having a different cultural configuration for each language. Phrasing it as a question: ”is there a national or self-representative motivation which is reflected in the content and thus disposes them differenciately?”. Autoreferentiality is the concept we present in order to analyse this hypothetical higher interest in local content. An identification and recollection is made on articles from heterogenous topics which can refer to the local history, sport teams or pop culture, but still maintain a semantic relation to the context of editors. Later, we propose a multidimensional analysis of them on features which can be significant indicators, to reach common conclusions and evaluate the language editions through an index of Autoreferentiality. Last, we point out which is the impact of this content and the risk of not considering its existance in the design of applications based on user generated content.
dc.publisherUniversitat Politècnica de Catalunya
dc.rightsAttribution-NonCommercial-ShareAlike 3.0 Spain
dc.subjectÀrees temàtiques de la UPC::So, imatge i multimèdia::Creació multimèdia::Edició web
dc.subject.lcshComputer software--Development
dc.subject.otherSoftware Development
dc.subject.otherCollaborative Web
dc.subject.otherTopical Coverage
dc.subject.otherData Mining
dc.titleThe Self-focus category: motivation reflected on topical coverage in Wikipedia
dc.typeMaster thesis
dc.subject.lemacWikis (Informàtica)
dc.subject.lemacSoftware -- Desenvolupament
dc.rights.accessOpen Access
dc.audience.educationlevelEstudis de primer/segon cicle

Files in this item


This item appears in the following Collection(s)

Show simple item record

Attribution-NonCommercial-ShareAlike 3.0 Spain
Except where otherwise noted, content on this work is licensed under a Creative Commons license : Attribution-NonCommercial-ShareAlike 3.0 Spain