会议专题

FARSI/ARABIC DOCUMENT IMAGE RETRIEVAL THROUGH SUB - LETTER SHAPE CODING

In this paper, A Novel method for Recognition free Farsi document retrieval is proposed.In this method, the retrieval is done through recognition of subletters and other elements of letters such as dots and some signs like Sarkesh.So at first in pre processing phase, lines and words are extracted using blank space between them.In the next phase, each word is divided to its sub-words.A sub-word is a combination of joint letters.For each sub-word, connectors of sub-letters are removed from the initial body of it and remains are recognized as subletters by using of their extracted features.The recognized sub-letters are encoded using a dictionary that has been defined in this system.Finally, the document content is encoded and this code can be used for retrieval of existing words in this document. Experimental results show advantages of this method in the retrieval of Persian printed documents.

Retrieval shape code sub-word sub-letter base line

ZAHRA BAHMANI REZA AZMI

Computer department Alzahra University Tehran,Iran

国际会议

2011 3rd International Conference on Computer Technology and Development(2011第三届计算机技术与发展国际会议 ICCTD2011)

成都

英文

2279-2283

2011-11-25(万方平台首次上网日期,不代表论文的发表时间)