会议专题

Building A Dictionary on Constituent Structure of Chinese Compounds

This paper presents an approach of building a contemporary Chinese dictionary on constituent structure of compounds. Manual tagging and three automatic tagging methods, including bi-direction parallel analogy, paired parallel analogy and inferring based on the consistency between form and meaning, are used. More than 40,000 words of all the 54000 bi-syllabic words in Hownet are fully or half semantically tagged by these three automatic methods. In the process of manual tagging, the difficulties and corresponding solving schemes are also presented in this paper. After manual and automatic tagging, two methods are used for checking their results. First, several heuristic rules are used on manual tagging results to mine abnormal tagging. Second, the manual and automatic tagging results are comparing with each other and the inconsistent tagging results are also considered as abnormal tagging. All abnormal tagging results are reserved for further checking.

Semantic Category Grammatical Category Grammatical Structure Bi-direction Parallel Analogy Paired Parallel Analogy

Likun Qiu Xiaoqiao Zhang Ling Mao

Institute of Artificial Intelligence, Beijing City University, Beijing 100083Department of Chinese L Institute of Artificial Intelligence, Beijing City University, Beijing 100083

国际会议

International Conference on Natural Language Processing and Knowledge Engineering(IEEE自然语言处理与知识工程国际会议 IEEE NLP-KE 2009)

大连

英文

1-8

2009-09-24(万方平台首次上网日期,不代表论文的发表时间)