Publication: Keyword extraction strategy for item banks text categorization
Issued Date
2007-02-01
Resource Type
ISSN
14678640
08247935
08247935
Other identifier(s)
2-s2.0-34247627069
Rights
Mahidol University
Rights Holder(s)
SCOPUS
Bibliographic Citation
Computational Intelligence. Vol.23, No.1 (2007), 28-44
Suggested Citation
Atorn Nuntiyagul, Kanlaya Naruedomkul, Nick Cercone, Damras Wongsawang Keyword extraction strategy for item banks text categorization. Computational Intelligence. Vol.23, No.1 (2007), 28-44. doi:10.1111/j.1467-8640.2007.00293.x Retrieved from: https://repository.li.mahidol.ac.th/handle/20.500.14594/24404
Research Projects
Organizational Units
Authors
Journal Issue
Thesis
Title
Keyword extraction strategy for item banks text categorization
Other Contributor(s)
Abstract
We proposed a feature selection approach, Patterned Keyword in Phrase (PKIP), to text categorization for item banks. The item bank is a collection of textual question items that are short sentences. Each sentence does not contain enough relevant words for directly categorizing by the traditional approaches such as "bag-of-words." Therefore, PKIP was designed to categorize such question item using only available keywords and their patterns. PKIP identifies the appropriate keywords by computing the weight of all words. In this paper, two keyword selection strategies are suggested to ensure the categorization accuracy of PKIP. PKIP was implemented and tested with the item bank of Thai high primary mathematics questions. The test results have proved that PKIP is able to categorize the question items correctly and the two keyword selection strategies can extract the very informative keywords. © 2007 Blackwell Publishing, Inc.