UPCommons està en procés de migració del dia 10 fins al 14 Juliol. L’autentificació està deshabilitada per evitar canvis durant aquesta migració.
An analysis of factors used in search engine ranking

Cita com:
hdl:2117/20465
Document typeConference report
Defense date2005
Rights accessOpen Access
All rights reserved. This work is protected by the corresponding intellectual and industrial
property rights. Without prejudice to any existing legal exemptions, reproduction, distribution, public
communication or transformation of this work are prohibited without permission of the copyright holder
Abstract
This paper investigates the influence of different page features on the ranking of search engine results. We use Google (via its API) as our testbed and analyze the result rankings for several queries of different categories using statistical methods. We reformulate the problem of learning the underlying, hidden scores as a binary classification problem. To this problem we then apply both linear and non-linear methods. In all cases, we split the data into a training set and a test set to obtain a meaningful, unbiased estimator for the quality of our predictor. Although our results clearly show that the scoring function cannot be approximated well using
only the observed features, we do obtain many interesting insights along the way and discuss ways of obtaining a better estimate and main limitations in trying to do so.
CitationBifet, A.C. [et al.]. An analysis of factors used in search engine ranking. A: International World Wide Web Conference. "Proceedings of the 4th International World Wide Web Conference". Chiba: 2005, p. 48-57.
Collections
Files | Description | Size | Format | View |
---|---|---|---|---|
Bifet at al 2005.pdf | 69,63Kb | View/Open | ||
Bifet at al 2005.pptx | 284,6Kb | Microsoft PowerPoint 2007 | View/Open |