Incremental Naïve Bayesian Spam Mail Filtering and Variant Incremental Training
dc.contributor.author | Phimphaka Taninpong | en_US |
dc.contributor.author | Sudsanguan Ngamsuriyaroj | en_US |
dc.contributor.author | สุดสงวน งามสุริยโรจน์ | en_US |
dc.contributor.other | Mahidol University. Faculty of Science. Department of Computer Science | en_US |
dc.contributor.other | Mahidol University. Faculty of Information and Communication Technology | |
dc.date.accessioned | 2018-04-02T09:57:50Z | |
dc.date.available | 2018-04-02T09:57:50Z | |
dc.date.created | 2018-04-02 | |
dc.date.issued | 2009 | |
dc.description | The 8th IEEE/ACIS International Conference on Computer and Information Science (ICIS 2009). Pine City Hotel, Shanghai, China, page 383-387 | |
dc.description.abstract | This paper proposes an incremental spam mail filtering using Naïve Bayesian classification which gives simplicity and adaptability. To keep the training set to a limited size and small, the sliding window is applied and the training set is updated when new emails are received. In effect, features in the training set are incrementally updated, and the model would be adaptive to a new spam pattern. In addition, we present three incremental training schemes: a window containing only the most recent emails, a window containing the previous batch of emails, and a window containing all already seen emails. The proposed model is evaluated using two spam corpora: Trec05p-1 [12] and Trec06p [13]. In our experiments, the window size is varied, the processing time per message, and the ham and spam misclassification rates are measured. The results show that the third incremental training scheme gives the best outcomes, and the window size significantly affects the misclassification rates and the processing time. | en_US |
dc.identifier.isbn | 978-0-7695-3641-5 | |
dc.identifier.uri | https://repository.li.mahidol.ac.th/handle/20.500.14594/10453 | |
dc.language.iso | eng | en_US |
dc.rights | Mahidol University | en_US |
dc.rights.holder | IEEEXPLORE | en_US |
dc.subject | Bayesian methods | en_US |
dc.subject | Unsolicited electronic mail | en_US |
dc.subject | Postal services | en_US |
dc.subject | Filtering | en_US |
dc.subject | Peer to peer computing | en_US |
dc.subject | Availability | en_US |
dc.subject | Space technology | en_US |
dc.subject | Computer science | en_US |
dc.subject | Computer network reliability | en_US |
dc.subject | Computer networks | en_US |
dc.title | Incremental Naïve Bayesian Spam Mail Filtering and Variant Incremental Training | en_US |
dc.type | Proceeding Article | en_US |
Files
License bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- license.txt
- Size:
- 1.71 KB
- Format:
- Item-specific license agreed upon to submission
- Description: