Publication: OCR with word prediction technique for bilingual documents
Issued Date
2012-07-25
Resource Type
Other identifier(s)
2-s2.0-84864049029
Rights
Mahidol University
Rights Holder(s)
SCOPUS
Bibliographic Citation
Proceedings - 2012 IEEE/ACIS 11th International Conference on Computer and Information Science, ICIS 2012. (2012), 343-347
Suggested Citation
Supachai Tangwongsan, Buntida Suvacharakulton OCR with word prediction technique for bilingual documents. Proceedings - 2012 IEEE/ACIS 11th International Conference on Computer and Information Science, ICIS 2012. (2012), 343-347. doi:10.1109/ICIS.2012.77 Retrieved from: https://repository.li.mahidol.ac.th/handle/20.500.14594/14044
Research Projects
Organizational Units
Authors
Journal Issue
Thesis
Title
OCR with word prediction technique for bilingual documents
Author(s)
Other Contributor(s)
Abstract
This paper proposes a working model of a bilingual OCR system for printed Thai and English text with word prediction technique. The main idea is that instead of recognizing individual characters from an image block as the conventional approach, it attempts to match the whole word from a list of predictive words based on n-gram trees. The matching process is done in the stage of word verification, in which positive and negative matching are both performed. If there is a match, the system will advance to the next at the end of the word boundary. Obviously, the longer the matched word is, the better the system performance will be. A series of experimental results show better performance in terms of speed improvement at 21% on average, while still being able to maintain the accuracy of recognition as expected. © 2012 IEEE.