Incremental Naïve Bayesian Spam Mail Filtering and Variant Incremental Training

dc.contributor.authorPhimphaka Taninpongen_US
dc.contributor.authorSudsanguan Ngamsuriyarojen_US
dc.contributor.authorสุดสงวน งามสุริยโรจน์en_US
dc.contributor.otherMahidol University. Faculty of Science. Department of Computer Scienceen_US
dc.contributor.otherMahidol University. Faculty of Information and Communication Technology
dc.date.accessioned2018-04-02T09:57:50Z
dc.date.available2018-04-02T09:57:50Z
dc.date.created2018-04-02
dc.date.issued2009
dc.descriptionThe 8th IEEE/ACIS International Conference on Computer and Information Science (ICIS 2009). Pine City Hotel, Shanghai, China, page 383-387
dc.description.abstractThis paper proposes an incremental spam mail filtering using Naïve Bayesian classification which gives simplicity and adaptability. To keep the training set to a limited size and small, the sliding window is applied and the training set is updated when new emails are received. In effect, features in the training set are incrementally updated, and the model would be adaptive to a new spam pattern. In addition, we present three incremental training schemes: a window containing only the most recent emails, a window containing the previous batch of emails, and a window containing all already seen emails. The proposed model is evaluated using two spam corpora: Trec05p-1 [12] and Trec06p [13]. In our experiments, the window size is varied, the processing time per message, and the ham and spam misclassification rates are measured. The results show that the third incremental training scheme gives the best outcomes, and the window size significantly affects the misclassification rates and the processing time.en_US
dc.identifier.isbn978-0-7695-3641-5
dc.identifier.urihttps://repository.li.mahidol.ac.th/handle/20.500.14594/10453
dc.language.isoengen_US
dc.rightsMahidol Universityen_US
dc.rights.holderIEEEXPLOREen_US
dc.subjectBayesian methodsen_US
dc.subjectUnsolicited electronic mailen_US
dc.subjectPostal servicesen_US
dc.subjectFilteringen_US
dc.subjectPeer to peer computingen_US
dc.subjectAvailabilityen_US
dc.subjectSpace technologyen_US
dc.subjectComputer scienceen_US
dc.subjectComputer network reliabilityen_US
dc.subjectComputer networksen_US
dc.titleIncremental Naïve Bayesian Spam Mail Filtering and Variant Incremental Trainingen_US
dc.typeProceeding Articleen_US

Files

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: