会议专题

Reinforcement Learning for Named Entity Recognition from Noisy Data

  Named entity recognition(NER)is an important task in nat-ural language processing,and is often formalized as a sequence labeling problem.Deep learning becomes the state-of-the-art approach for NER,but the lack of high-quality labeled data remains the bottleneck for model performance.To solve the problem,we employ the distant supervision technique to obtain noisy labeled data,and propose a novel model based on reinforcement learning to revise the wrong labels and distill high-quality data for learning.Specifically,our model consists of two mod-ules,a Tag Modifier and a Tag Predictor.The Tag Modifier corrects the wrong tags with reinforcement learning and feeds the corrected tags into the Tag Predictor.The Tag Predictor makes the sentence-level predic-tion and returns rewards to the Tag Modifier.Two modules are trained jointly to optimize tag correction and prediction processes.Experiment results show that our model can effectively deal with noises with a small number of correctly labeled data and thus outperform state-of-the-art baselines.

Named entity recognition Noisy data Reinforcement learning

Jing Wan Haoming Li Lei Hou Juaizi Li

Beijing University of Chemical Technology,Beijing,China Department of Computer Science and Technology,BNRist,Beijing,China;KIRC,Institute for Artificial Int

国际会议

9th CCF International Conference on Natural Language Processing and Chinese Computing (NLPCC 2020)

郑州

英文

333-345

2020-10-14(万方平台首次上网日期,不代表论文的发表时间)