Publication: Combining over-sampling and under-sampling techniques for imbalance dataset
dc.contributor.author | Nutthaporn Junsomboon | en_US |
dc.contributor.author | Tanasanee Phienthrakul | en_US |
dc.contributor.other | Mahidol University | en_US |
dc.date.accessioned | 2018-12-21T07:21:49Z | |
dc.date.accessioned | 2019-03-14T08:03:24Z | |
dc.date.available | 2018-12-21T07:21:49Z | |
dc.date.available | 2019-03-14T08:03:24Z | |
dc.date.issued | 2017-02-24 | en_US |
dc.description.abstract | © 2017 ACM. An important problem in medical data analysis is imbalance dataset. This problem is a cause of diagnostic mistake. The results of diagnostic affect to life of patients. If a doctor fails in diagnostic of patient who have disease that means he cannot treat patient in timely. However, the problem can be easily solved by adding or removing the data to closely balance for performance of diagnostic in medically. This paper proposed a solution to adjust imbalance dataset by combining Neighbor Cleaning Rule (NCL) and Synthetic Minority Over-Sampling Technique (SMOTE) techniques. The process of work is using NCL technique for removing sample data that are outliers in majority class and SMOTE technique is used for increasing sample data in minority class to closely balance dataset. After that, the balanced medical dataset is classified by Naïve Bayes, SMO and KNN algorithm. The experimental results show that the recall rate can be improved from the models that were created from balanced dataset. | en_US |
dc.identifier.citation | ACM International Conference Proceeding Series. Vol.Part F128357, (2017), 243-247 | en_US |
dc.identifier.doi | 10.1145/3055635.3056643 | en_US |
dc.identifier.other | 2-s2.0-85024401679 | en_US |
dc.identifier.uri | https://repository.li.mahidol.ac.th/handle/20.500.14594/42358 | |
dc.rights | Mahidol University | en_US |
dc.rights.holder | SCOPUS | en_US |
dc.source.uri | https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85024401679&origin=inward | en_US |
dc.subject | Computer Science | en_US |
dc.title | Combining over-sampling and under-sampling techniques for imbalance dataset | en_US |
dc.type | Conference Paper | en_US |
dspace.entity.type | Publication | |
mu.datasource.scopus | https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85024401679&origin=inward | en_US |