Using Genetic Algorithm for Persian Grammar Induction

Most of efficient computational approaches in NLP tasks are supervised methods which need annotated corpora. But the lack of supervised data in Persian encourages researchers to increase their interests and efforts on unsupervised and semi-supervised approaches. This paper presents a novel semi-supervised approach which called Genetic-based insideoutside (GIO), for Persian grammar inference for inducing a grammar model in a PCFG formalism. GIO is an extension of the insideoutside algorithm enriched by some notions of genetic algorithm. In pure genetic algorithm for grammar induction, randomly generated initial population make it computationally expensive, so we used inside-outside algorithm to generate initial population. Our experiments show that our approach’s result is better than other applied methods for Persian grammar induction.
Grammar induction genetic algorithm inside-outside algorithm Persian grammar
Mohsen Arabsorkhi Hesham Faili Mansoor Zolghadri Jahroumi
Computer Engineering Dept., Islamic Azad University of Saveh Saveh, Iran Computer Engineering Dept., Tehran University Tehran, Iran Computer Science and Engineering Dept., Shiraz University Shiraz, Iran
国际会议
大连
英文
1-6
2009-09-24(万方平台首次上网日期,不代表论文的发表时间)