Using annotations on Mechanical Turk to perform supervised polarity classification of Spanish customer comments
Rights accessRestricted access - publisher's policy
One of the major bottlenecks in the development of data-driven AI Systems is the cost of reliable human annotations. The recent advent of several crowdsourcing platforms such as Amazon’s Mechanical Turk, allowing requesters the access to affordable and rapid results of a global workforce, greatly facilitates the creation of massive training data. Most of the available studies on the effectiveness of crowdsourcing report on English data. We use Mechanical Turk annotations to train an Opinion Mining System to classify Spanish consumer comments. We design three different Human Intelligence Task (HIT) strategies and report high inter-annotator agreement between non-experts and expert annotators. We evaluate the advantages/drawbacks of each HIT design and show that, in our case, the use of non-expert annotations is a viable and cost-effective alternative to expert annotations.
CitationCosta-jussà, M. R., Grivolla, J., Mellebeek, B., Benavent, F., Codina, J., Codina, J., Banchs, R. Using annotations on Mechanical Turk to perform supervised polarity classification of Spanish customer comments. "Information sciences", 01 Desembre 2014, vol. 275, p. 400-412.