Vis enkel innførsel

dc.contributor.authorWu, Hao
dc.date.accessioned2010-11-30T12:15:52Z
dc.date.available2010-11-30T12:15:52Z
dc.date.issued2010
dc.identifier.urihttp://hdl.handle.net/11250/137488
dc.descriptionMasteroppgave i informasjons- og kommunikasjonsteknologi 2010 – Universitetet i Agder, Grimstaden_US
dc.description.abstractTraditionally, spam messages filtering systems are built by integrating content-based analysis technologies which are developed from the experiences of dealing with E-mail spam. Recently, the new style of information appears in the Internet, Social Media platform, which also expands the space for Internet abusers. In this thesis, we not only evaluated the traditional content-based approaches to classify spam messages, we also investigated the possibility of integrating context-based technology with con-tent-based approaches to classify spam messages. We built spam classifiers using Novelty de-tection approach combining with Naïve Bayes, k Nearest-Neighbour and Self-organizing map respectively and tested each of them with vast amount of experiment data. And we also took a further step from the previous researches by integrating Self-organizing map with Naive Bayes to carry out the spam classification. The results of this thesis show that combining context-based approaches with content-based spam classifier wisely can actually improve the performance of content-based spam classifier in variant of directions. In addition, the results from Self-organizing map classifier with Naïve Bayes show a promising future for data clustering method using in spam filtering. Thus we believe this thesis presents a new insight in Natural Language Processing and the methods and techniques proposed in this thesis provide researchers in spam filtering field a good tool to analyze context-based spam messages.en_US
dc.language.isoengen_US
dc.publisherUniversity of Agderen_US
dc.titleSpam classification for online discussionsen_US
dc.typeMaster thesisen_US
dc.source.pagenumber52en_US


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel