Khmer WordNet construction
Issued Date
2024
Copyright Date
2020
Resource Type
Language
eng
File Type
application/pdf
No. of Pages/File Size
x, 82 leaves: ill.
Access Rights
open access
Rights
ผลงานนี้เป็นลิขสิทธิ์ของมหาวิทยาลัยมหิดล ขอสงวนไว้สำหรับเพื่อการศึกษาเท่านั้น ต้องอ้างอิงแหล่งที่มา ห้ามดัดแปลงเนื้อหา และห้ามนำไปใช้เพื่อการค้า
Rights Holder(s)
Mahidol University
Bibliographic Citation
Thesis (M.Sc. (Computer Science))--Mahidol University, 2020
Suggested Citation
Udorm, Phon, 1996- Khmer WordNet construction. Thesis (M.Sc. (Computer Science))--Mahidol University, 2020. Retrieved from: https://repository.li.mahidol.ac.th/handle/20.500.14594/99478
Title
Khmer WordNet construction
Author(s)
Abstract
This thesis described a semi-automatic translation-based expansion approach for Khmer WordNet construction. This approach expanded the Princeton English WordNet's synsets and took into account the existing machine-readable Cambodian-English dictionary. The approach started with the cleansing and extracting the translation links and the semantic links from the Cambodian-English dictionary and the Princeton WordNet, respectively. Concerning the translation links and the semantic links, a large number of candidate links between Khmer words and English synsets can be derived. This approach applied a statistical approach sampling these candidate links and constructed a statistical model based on human verification of the sample of the candidate links. Finally, the statistical model is applied to construct the Khmer WordNet with promising coverage and accuracy. This Khmer WordNet covered 3,402 Khmer words, 3,571 synsets, 6,137 links, and 13.86% core concept coverage. The Khmer WordNet Web-based application has been developed.
Description
Computer Science (Mahidol University 2020)
Degree Name
Master of Science
Degree Level
Master's degree
Degree Department
Faculty of Information and Communication Technology
Degree Discipline
Computer Science
Degree Grantor(s)
Mahidol University