OPTICAL FORMULA EXTRACTION BASED ON IRREGULARITY DEGREE
Optical formula extraction is considered as an important step of mathematical formula recognition, which can convert scientific papers into their corresponding electronic format. So far little research has been done in this area. This paper proposes an approach of extracting embedded formulas that first invokes a searching algorithm to find the connected components of the input document, calculates the layout feature of every component based on irregularity degree, and then locates the formula symbols according to the features. Finally, several measurements including linking grammar are used to locate the formula areas. The experimental results indicate that the proposed method can obtain favorable results.
Optical formula recognition formula extraction connected components irregularity degree linking grammar
XUE-DONG TIAN DA-ZENG TIAN MING-HU HA
College of Physics Science and Technology, Hebei University, Baoding 071002, China
国际会议
2006 International Conference on Machine Learning and Cybernetics(IEEE第五届机器学习与控制论坛)
大连
英文
3365-3369
2006-08-13(万方平台首次上网日期,不代表论文的发表时间)