Introducing More Features to Improve Chinese Shift-Reduce Parsing

摘要：

Recent years, shift-reduce parsing has gained popularity for its efficiency, but its performance still has a gap with the state of art parsers. In this paper, we construct a baseline Chinese shift-reduce parser using the common features. According to the error patterns in the baseline system, we explore the use of constituent label features and Lexical Relation Pair (LRP) information. The parser is trained and evaluated on the Tsinghua Chinese Treebank. After using the new features the boundary F-score arises from 85.19％ to 86.25％. Additionally, we investigate the use of LRP extracted from raw text automatically, and the parser can also get a competitive result.

作者: Hongxian Wang Qiang Zhou Liou Chen

作者单位: Center for Speech and Language Technologies, Research Institute of Information Technology, Tsinghua Center for Speech and Language Technologies,Research Institute of Information Technology, Tsinghua U

会议类型: 国际会议

会议名称: 2011亚太信号与信息处理协会年度峰会(APSIPAASC 2011)

会议地点: 西安

会议语种:英文

页码: 1-7

在线出版日期: 2011-10-18（万方平台首次上网日期，不代表论文的发表时间）

会议专题

Introducing More Features to Improve Chinese Shift-Reduce Parsing