A NEW APPROACH FOR UNDERSTANDING OF STRUCTURE OF PRINTED MATHEMATICAL EXPRESSION
This paper introduces a new approach for automatic understanding of structure of printed mathematical expression (ME).The method is consisted of three periods, i.e.matrix analysis, sub-expression analysis and script expression analysis.In matrix analysis (sub-expression analysis), a ME (sub-expression) is decomposed into several basic matrixes (sub-expressions) and some sub-expressions (script expressions) by reconstructing the ME global structure, and then every basic matrix (sub-expression) is analyzed from bottom to up.In script analysis, graph rewriting algorithm is adopted to build script relation trees among symbols within a script expression.In order to calculate spatial relations confidence between two symbols, spatial relation model is built based on Gaussian Mixture Model (GMM).The experiments were implemented on a database with 3268 images and the results show that the proposed method works well.Top-1 prefect analysis accuracy reaches 92.3%.
Multi-candidate Printed mathematical expression Gaussian Mixture Model Spatial relation model
YU-SHENG GUO LEI HUANG CHANG-PING LIU
Institute of Automation, Chinese Academy of Sciences, Beijing 100080, China;Graduate School of Chine Institute of Automation, Chinese Academy of Sciences, Beijing 100080, China
国际会议
2007 International Conference on Machine Learning and Cybernetics(IEEE第六届机器学习与控制论国际会议)
香港
英文
2633-2638
2007-08-19(万方平台首次上网日期,不代表论文的发表时间)