DISCOVERING CHINESE CONCEPT-IN-CORPUS
Concept is the basic of knowledge. A concept consists of a connotation and an extension. The paper comes up with a concept of Concept-ln-Corpus which is a special kind of formal concept, and presents a discovering algorithm called FCWFT (Filtering Concept-word Based on Feature-tree) which automatically mine the connotation and the extension for a Chinese Concept-ln-Corpus from corpus in Chinese. Our work is the first one attempting to mine formal Concepts from free texts in the area of Natural Language Processing. We test the algorithm with a large scale corpus. The result is encouraging.
Natural Language Processing Knowledge Concept Concept-word Concept-IC Feature-tree
JIAN-CHAO CHEN QI-LUN ZHENG ZHAO LI
School of Computer Science and Engineering, South China University of Technology, Guangzhou, 510640, China
国际会议
2008 International Conference on Machine Learning and Cybernetics(2008机器学习与控制论国际会议)
昆明
英文
2534-2539
2008-07-12(万方平台首次上网日期,不代表论文的发表时间)