Publication: Recovering "lack of words" in text categorization for item banks
2
Issued Date
2005-12-01
Resource Type
ISSN
07303157
Other identifier(s)
2-s2.0-34248536520
Rights
Mahidol University
Rights Holder(s)
SCOPUS
Bibliographic Citation
Proceedings - International Computer Software and Applications Conference. Vol.2, (2005), 31-32
Suggested Citation
Atorn Nuntiyagul, Nick Cercone, Kanlaya Naruedomkul Recovering "lack of words" in text categorization for item banks. Proceedings - International Computer Software and Applications Conference. Vol.2, (2005), 31-32. doi:10.1109/COMPSAC.2005.128 Retrieved from: https://repository.li.mahidol.ac.th/handle/123456789/16464
Research Projects
Organizational Units
Authors
Journal Issue
Thesis
Title
Recovering "lack of words" in text categorization for item banks
Author(s)
Other Contributor(s)
Abstract
PKIP, Patterned Keywords in Phrase, is our feature selection approach to text categorization (TC) for item banks. An item bank is a collection of textual data in which each item consists of short sentences and has only a few relevant words for categorization. Traditional TC techniques cannot provide sufficiently accurate resulte because of a "lack of words" problem. PKIP improves categorization accuracy and recovers from the "lack of words" problem. Our sample item bank is the collection of Thai primary mathematics problems and we use SVM as our classifier. Classification results show that PKIP produces acceptable classification performance. © 2005 IEEE.
