Publication:
Combining over-sampling and under-sampling techniques for imbalance dataset

dc.contributor.authorNutthaporn Junsomboonen_US
dc.contributor.authorTanasanee Phienthrakulen_US
dc.contributor.otherMahidol Universityen_US
dc.date.accessioned2018-12-21T07:21:49Z
dc.date.accessioned2019-03-14T08:03:24Z
dc.date.available2018-12-21T07:21:49Z
dc.date.available2019-03-14T08:03:24Z
dc.date.issued2017-02-24en_US
dc.description.abstract© 2017 ACM. An important problem in medical data analysis is imbalance dataset. This problem is a cause of diagnostic mistake. The results of diagnostic affect to life of patients. If a doctor fails in diagnostic of patient who have disease that means he cannot treat patient in timely. However, the problem can be easily solved by adding or removing the data to closely balance for performance of diagnostic in medically. This paper proposed a solution to adjust imbalance dataset by combining Neighbor Cleaning Rule (NCL) and Synthetic Minority Over-Sampling Technique (SMOTE) techniques. The process of work is using NCL technique for removing sample data that are outliers in majority class and SMOTE technique is used for increasing sample data in minority class to closely balance dataset. After that, the balanced medical dataset is classified by Naïve Bayes, SMO and KNN algorithm. The experimental results show that the recall rate can be improved from the models that were created from balanced dataset.en_US
dc.identifier.citationACM International Conference Proceeding Series. Vol.Part F128357, (2017), 243-247en_US
dc.identifier.doi10.1145/3055635.3056643en_US
dc.identifier.other2-s2.0-85024401679en_US
dc.identifier.urihttps://repository.li.mahidol.ac.th/handle/20.500.14594/42358
dc.rightsMahidol Universityen_US
dc.rights.holderSCOPUSen_US
dc.source.urihttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85024401679&origin=inwarden_US
dc.subjectComputer Scienceen_US
dc.titleCombining over-sampling and under-sampling techniques for imbalance dataseten_US
dc.typeConference Paperen_US
dspace.entity.typePublication
mu.datasource.scopushttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85024401679&origin=inwarden_US

Files

Collections