Bias on the web
Cita com:
hdl:2117/330972
Document typeConference report
Defense date2020
PublisherBarcelona Supercomputing Center
Rights accessOpen Access
All rights reserved. This work is protected by the corresponding intellectual and industrial
property rights. Without prejudice to any existing legal exemptions, reproduction, distribution, public
communication or transformation of this work are prohibited without permission of the copyright holder
Abstract
The Web is the most powerful communication medium and the largest public data repository
that humankind has created. Its content ranges from great reference sources
such as Wikipedia to ugly fake news. Indeed, social (digital) media is just an amplifying
mirror of ourselves. Hence, the main challenge of search engines and other websites
that rely on web data is to assess the quality of such data. However, as all people has
their own biases, web content as well as our web interactions are tainted with many biases.
Data bias includes redundancy and spam, while interaction bias includes activity
and presentation bias. In addition, sometimes algorithms add bias, particularly in the
context of search and recommendation systems. As bias generates bias, we stress the
importance of debiasing data as well as using the context and other techniques such as
explore & exploit, to break the filter bubble. The main goal of this talk is to make people
aware of the different biases that affect all of us on the Web. Awareness is the first step
to be able to fight and reduce the vicious cycle of web bias. For more details see the
article of same title in Communications of ACM, June 2018
CitationBaeza, R. Bias on the web. A: . Barcelona Supercomputing Center, 2020, p. 63-64.
Files | Description | Size | Format | View |
---|---|---|---|---|
BSC_SORS_2019-20-29_Bias on the Web.pdf | 351,5Kb | View/Open | ||
license_rdf.rdf | 1,203Kb | application/rdf+xml; charset=utf-8 | View/Open |