DSpace DSpace UPC
 English   Castellano   Català  

Treballs academics UPC >
Màsters Oficials >
Master in Artificial Intelligence - MAI (Pla 2006) >

Empreu aquest identificador per citar o enllaçar aquest ítem: http://hdl.handle.net/2099.1/11321

Arxiu Descripció MidaFormat
J.Norte.pdf888.65 kBAdobe PDFVeure/Obrir

Títol: Spam Classification Using Machine Learning Techniques - Sinespam
Autor: Norte Sosa, José
Tutor/director/avaluador: Alquézar Mancho, René Veure Producció científica UPC
Universitat: Universitat Politècnica de Catalunya
Matèries: Àrees temàtiques de la UPC::Informàtica::Intel·ligència artificial::Sistemes experts
Unsolicited electronic mail messages -- Classification
Correu brossa (Correu electrònic) -- Classificació
Data: ago-2010
Tipus de document: Master thesis
Resum: Most e-mail readers spend a non-trivial amount of time regularly deleting junk e-mail (spam) messages, even as an expanding volume of such e-mail occupies server storage space and consumes network bandwidth. An ongoing challenge, therefore, rests within the development and refinement of automatic classifiers that can distinguish legitimate e-mail from spam. Some published studies have examined spam detectors using Naïve Bayesian approaches and large feature sets of binary attributes that determine the existence of common keywords in spam, and many commercial applications also use Naïve Bayesian techniques. Spammers recognize these attempts to prevent their messages and have developed tactics to circumvent these filters, but these evasive tactics are themselves patterns that human readers can often identify quickly. This work had the objectives of developing an alternative approach using a neural network (NN) classifier brained on a corpus of e-mail messages from several users. The features selection used in this work is one of the major improvements, because the feature set uses descriptive characteristics of words and messages similar to those that a human reader would use to identify spam, and the model to select the best feature set, was based on forward feature selection. Another objective in this work was to improve the spam detection near 95% of accuracy using Artificial Neural Networks; actually nobody has reached more than 89% of accuracy using ANN.
URI: http://hdl.handle.net/2099.1/11321
Condicions d'accés: Open Access
Apareix a les col·leccions:Master in Artificial Intelligence - MAI (Pla 2006)

SFX Query

Aquest ítem (excepte textos i imatges no creats per l'autor) està subjecte a una llicència de Creative Commons Llicència Creative Commons
Creative Commons


Valid XHTML 1.0! Programari DSpace Copyright © 2002-2004 MIT and Hewlett-Packard Comentaris
Universitat Politècnica de Catalunya. Servei de Biblioteques, Publicacions i Arxius