A MANUAL EXPERIMENT ON COMMONSENSE KNOWLEDGE ACQUISITION FROM WEB CORPORA
Acquiring commonsense knowledge from text is an important but challenging problem. In this paper, we described a three-subject experiment on commonsense knowledge acquisition from Chinese sentences extracted from a web corpus, aiming to investigate how people acquire commonsensical assertions from given sentences. We analyzed the experiment results from the perspectives of agreement test, concordance test, and divergence test. An important conclusion of our experiment is that sentences are different in their suitability, i.e. difficulty grade, for commonsense knowledge acquisition. And this difficulty grade also affects the number of commonsensical assertions acquired from a sentence, as well as the difference among the acquisition performances of different human subjects. We also discussed the problem of characterizing the difficulty grade by co-occurrence frequency of words and basic level category words.
Commonsense Knowledge Acquisition Manual Ezperiment Web Corpora Agreement Test Concordance Test, Divergence Test Co-occurrence Frequency Basic Level Category
YAO ZHU LIANG-JUN ZANG YA-NAN CAO DONG-SHENG WANG CUN-GEN CAO
Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Aca Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Aca
国际会议
2008 International Conference on Machine Learning and Cybernetics(2008机器学习与控制论国际会议)
昆明
英文
1564-1569
2008-07-12(万方平台首次上网日期,不代表论文的发表时间)