Automatic Identification of Non-Anaphoric Anaphora in Spoken Dialog
Identification of non-anaphoric anaphora is an important step towards a full anaphora resolution. In this paper, we present an automatic identification approach for this task. In our work, some novel features are proposed, which are based on dependency grammars, surrounding words and their POS tags. All the features are automatically extracted using a part-of-speech (POS) tagger and a dependency parser. Our experiments are on a commonly available dialogue corpus, Trains-93. Several machine learning algorithms are used in the experiments, including CME, CRF and SVM. Results show that compared to the approaches used in the previous work, our algorithm is simpler and achieves a higher accuracy.
Spoken dialog anaphora resolution non-anaphoric anaphora identification
Zhongchao FEI Xuanjing HUANG Fuliang WENG
Department of Computer Science,Fudan University Shanghai,China Research and Technology Center,Robert Bosch LLC Palo Alto,CA,United States
国际会议
北京
英文
2008-10-19(万方平台首次上网日期,不代表论文的发表时间)