Text Mining Reddit's Top Posts: The Potential Information In Internet-Based Communities
Tutor / director / evaluatorFont Aragones, Xavier
Document typeBachelor thesis
Rights accessOpen Access
As of late, there has been a growing interest in the field of "data-mining" which involves, among other things, the processment of massive amounts of data for a higher purpose. The result of such processes usually allows for the assessment of interesting variables concerning huge pools of population, like trending fads or the development of political currents. One such pool is Reddit, a web content aggregator service which has been gaining popularity around the world over the last few years. A statistical analysis has been performed, focused on the titles that people gave to the most popular content published in Reddit within two specific timeframes: 2013 and 2016, both in August. The results show interesting patterns regarding the most popular words and combinations of words, which points towards promising results should further investigation be undertaken.