Automatic Construction of Biomedical Abbreviations Dictionary from Text
The size and growth rate of biomedical abbreviation are increasing very fast, automatic construction of biomedical abbreviations dictionary from text helps to understand biomedical literature, and to update existing databases, ontologies, and dictionaries. This paper proposes a new method for automatic construction of biomedical abbreviations dictionary from text by combining string matching algorithm and searching algorithm. The string matching algorithm extracts abbreviations and their longforms. The searching algorithm corrects the false longforms produced by the string matching algorithm. The searching algorithm is based on the idea that readers often lookup relative articles to judge the longform of an abbreviation is correct or not. Our experiments show that the algorithm has high precision (97.5%) and recall (82.2%). And because tagged corpus is not necessary, the method has high efficiency.
Text mining biomedical abbreviations
Changqin QUAN Fuji REN Tingting HE Po HU
Dept.of Computer Science,Huazhong Normal University,Wuhan,China Dept.of Info.Science & Intelligent S Dept.of Info.Science & Intelligent Systems Faculty of Engineering,The University of Tokushima,Tokush Dept.of Computer Science,Huazhong Normal University,Wuhan,China
国际会议
北京
英文
2008-10-19(万方平台首次上网日期,不代表论文的发表时间)