Exploiting Syntactic and Semantic Information in Coarse Chinese Question Classification
Recent years have seen great process in studying English question classification. In our research, we learn Chinese question classification by exploiting the result of lexical, syntactic and semantic parsing on question sentences. Support Vector Machines are adopted to train a classifier on 6 coarse categories using single and combination of different parsing results as features. We find that even the surface information such as words and Parts of Speech could lead to a satisfying result, while augmenting the classifier with syntactic and semantic features could give even higher precision. However, the lack of words and incomplete syntactic structures among most questions cause combination of features even sparser than single features in the feature space, with much side effect brought to the performance of Chinese question classification.
Xin Kang Xiaojie Wang Fuji Ren
Beijing University of Posts and Telecommunications The University of Tokushima Beijing University of Posts and Telecommunications
国际会议
北京
英文
2008-10-19(万方平台首次上网日期,不代表论文的发表时间)