Research on the Method of Tibetan Recognition Based on Component Location Information
The recognition of Tibetan is of great significance to the study of Tibetan culture while the progress of Tibetan character recognition is lagging behind. Especially when there are not a large number of available training samples, Tibetan character recognition is very difficult. So we propose a recognition method for Tibetan characters based on component location information without a large number of training samples. The proposed method includes three main parts: (1) The segmentation of character and the extraction of component which contain location information in the character; (2) Features extraction and classifier design; (3) The superposition of component after recognition and the retrieval of character. The testing results are: the recognition rate of single component is 98.4%, the recognition rate of multilevel component is 97.2%. It indicates that the method has a good effect on the recognition of Tibetan character, and it is helpful for the recognition of Tibetan documents.
Tibetan recognition Character segment Component combination Classifier design
Yuehui Han Weilan Wang Yiqun Wang Xiaojuan Wang
Key Laboratory of Chinas Ethnic Languages and Information Technology of Ministry of Education,North Key Laboratory of Chinas Ethnic Languages and Information Technology of Ministry of Education,North
国际会议
广州
英文
63-73
2018-11-23(万方平台首次上网日期,不代表论文的发表时间)