Show simple item record

dc.contributorFont Aragones, Xavier
dc.contributor.authorFernández Pawlukojc, Alex
dc.description.abstractAs of late, there has been a growing interest in the field of "data-mining" which involves, among other things, the processment of massive amounts of data for a higher purpose. The result of such processes usually allows for the assessment of interesting variables concerning huge pools of population, like trending fads or the development of political currents. One such pool is Reddit, a web content aggregator service which has been gaining popularity around the world over the last few years. A statistical analysis has been performed, focused on the titles that people gave to the most popular content published in Reddit within two specific timeframes: 2013 and 2016, both in August. The results show interesting patterns regarding the most popular words and combinations of words, which points towards promising results should further investigation be undertaken.
dc.publisherUniversitat Politècnica de Catalunya
dc.rightsAttribution-NonCommercial-NoDerivs 3.0 Spain
dc.subjectÀrees temàtiques de la UPC::Informàtica::Sistemes d'informació
dc.subject.lcshData mining
dc.subject.otherText Mining
dc.titleText Mining Reddit's Top Posts: The Potential Information In Internet-Based Communities
dc.typeBachelor thesis
dc.subject.lemacMineria de dades
dc.rights.accessOpen Access
dc.audience.mediatorEscola Universitària Politècnica de Mataró

Files in this item


This item appears in the following Collection(s)

Show simple item record

Except where otherwise noted, content on this work is licensed under a Creative Commons license: Attribution-NonCommercial-NoDerivs 3.0 Spain