会议专题

Improving First Order Temporal Fact Extraction with Unreliable Data

  In this paper,we deal with the task of extracting first order temporal facts from free text.This task is a subtask of relation extraction and it aims at extracting relations between entity and time.Currently,the field of relation extraction mainly focuses on extracting relations between entities.However,we observe that the multi-granular nature of time expressions can help us divide the dataset constructed by distant supervision into reliable and less reliable subsets,which can help to improve the extraction results on relations between entity and time.We accordingly contribute the first dataset focusing on the first order temporal fact extraction task using distant supervision.To fully utilize both the reliable and the less reliable data,we propose to use curriculum learning to rearrange the training procedure,label dropout to make the model be more conservative about less reliable data,and instance attention to help the model distinguish important instances from unimportant ones.Experiments show that these methods help the model outperform the model trained purely on the reliable dataset as well as the model trained on the dataset where all subsets are mixed together.

temporal fact extraction distant supervision knowledge base

Bingfeng Luo Yansong Feng Zheng Wang Dongyan Zhao

Institute of Computer Science and Technology,Peking University,P.R. China School of Computing and Communications,Lancaster University,UK

国际会议

第五届自然语言处理与中文计算会议(NLPCC-ICCPOL2016)

昆明

英文

1-12

2016-12-02(万方平台首次上网日期,不代表论文的发表时间)