Feature Comparison for Automatic Bug Report Classification

Bancha Luaphol; Boonchoo Srikudkao; Tontrakant Kachai; Natthakit Srikanjanapert; Jantima Polpinij; Poramin Bheganan

Publication:
Feature Comparison for Automatic Bug Report Classification

dc.contributor.author	Bancha Luaphol	en_US
dc.contributor.author	Boonchoo Srikudkao	en_US
dc.contributor.author	Tontrakant Kachai	en_US
dc.contributor.author	Natthakit Srikanjanapert	en_US
dc.contributor.author	Jantima Polpinij	en_US
dc.contributor.author	Poramin Bheganan	en_US
dc.contributor.other	Mahidol University	en_US
dc.contributor.other	Mahasarakham University	en_US
dc.date.accessioned	2020-01-27T03:32:14Z
dc.date.available	2020-01-27T03:32:14Z
dc.date.issued	2020-01-01	en_US
dc.description.abstract	© 2020, Springer Nature Switzerland AG. Nowadays, various bug tracking systems (BTS) such as Jira, Trace, and Bugzilla have been developed and proposed to gather the issues from users worldwide. This is because those issues, called bug reports, contain a significant information for software quality maintenance and improvement. However, many bug reports with poor quality might have been submitted to the BTS. In general, the reported bugs in the BTS are firstly analyzed and filtered out by bug triagers. However, with the increasing amount of bug reports in the BTS, manually classifying bug reports is a time-consuming task. To address this problem, automatically distinguishing of bugs and non-bugs is necessary. To the best of our knowledge, this task is never easy for bug reports classification because the problem of bug reports misclassification still occurs to date. The background of this problem may be arise from using inappropriate or confusing features. Therefore, this work aims to study and discover the most proper features for binary bug report classification. This study compares seven features such as unigram, bigram, camel case, unigram+bigram, unigram+camel case, bigram+ camel case, and all features together. The experimental results show that the unigram+camel case should be the most proper features for binary bug report classification, especially when using with the logistic regression algorithm. Consequently, the unigram+camel case should be the proper feature to distinguish bug reports from the non-bugs ones.	en_US
dc.identifier.citation	Advances in Intelligent Systems and Computing. Vol.936, (2020), 69-78	en_US
dc.identifier.doi	10.1007/978-3-030-19861-9_7	en_US
dc.identifier.issn	21945365	en_US
dc.identifier.issn	21945357	en_US
dc.identifier.other	2-s2.0-85065902706	en_US
dc.identifier.uri	https://repository.li.mahidol.ac.th/handle/20.500.14594/49588
dc.rights	Mahidol University	en_US
dc.rights.holder	SCOPUS	en_US
dc.source.uri	https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85065902706&origin=inward	en_US
dc.subject	Computer Science	en_US
dc.subject	Engineering	en_US
dc.title	Feature Comparison for Automatic Bug Report Classification	en_US
dc.type	Conference Paper	en_US
dspace.entity.type	Publication
mu.datasource.scopus	https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85065902706&origin=inward	en_US

Collections

Scopus 2020

	Office Hour: Monday-Friday 08.30-12.00 and 13.00-16.30 hrs.
	Phutthamonthon Sai 4 Rd. Salaya, Nakhon Pathom 73170, Thailand
	The office: +66 (2) 800 2680 ext.4306
	thipsuda.van@mahidol.ac.th
	https://repository.li.mahidol.ac.th

Publication: Feature Comparison for Automatic Bug Report Classification

Files

Collections

Publication:
Feature Comparison for Automatic Bug Report Classification