Spam Classification Using Machine Learning Techniques - Sinespam
Visualitza/Obre
Estadístiques de LA Referencia / Recolecta
Inclou dades d'ús des de 2022
Cita com:
hdl:2099.1/11321
Tipus de documentProjecte Final de Màster Oficial
Data2010-08
Condicions d'accésAccés obert
Llevat que s'hi indiqui el contrari, els
continguts d'aquesta obra estan subjectes a la llicència de Creative Commons
:
Reconeixement-NoComercial-SenseObraDerivada 3.0 Espanya
Abstract
Most e-mail readers spend a non-trivial amount of time regularly deleting junk e-mail (spam)
messages, even as an expanding volume of such e-mail occupies server storage space and
consumes network bandwidth. An ongoing challenge, therefore, rests within the development
and refinement of automatic classifiers that can distinguish legitimate e-mail from spam. Some
published studies have examined spam detectors using Naïve Bayesian approaches and large
feature sets of binary attributes that determine the existence of common keywords in spam,
and many commercial applications also use Naïve Bayesian techniques.
Spammers recognize these attempts to prevent their messages and have developed tactics to
circumvent these filters, but these evasive tactics are themselves patterns that human readers
can often identify quickly. This work had the objectives of developing an alternative approach
using a neural network (NN) classifier brained on a corpus of e-mail messages from several
users. The features selection used in this work is one of the major improvements, because the
feature set uses descriptive characteristics of words and messages similar to those that a
human reader would use to identify spam, and the model to select the best feature set, was
based on forward feature selection. Another objective in this work was to improve the spam
detection near 95% of accuracy using Artificial Neural Networks; actually nobody has reached
more than 89% of accuracy using ANN.
MatèriesUnsolicited electronic mail messages -- Classification, Correu brossa (Correu electrònic) -- Classificació
TitulacióMÀSTER UNIVERSITARI EN INTEL·LIGÈNCIA ARTIFICIAL (Pla 2009)
Col·leccions
Fitxers | Descripció | Mida | Format | Visualitza |
---|---|---|---|---|
J.Norte.pdf | 888,6Kb | Visualitza/Obre |