会议专题

Entity Disambiguation Algorithm for Literature in Biomedical Field

  Based on the requirements of knowledge learning and application in the domain of biomedical,a kind of entity disambiguation algorithm is proposed to solve the problem of entity ambiguity.Entity disambiguation is usually divided into two parts: candidate generation and entity disambiguation.In this paper,candidates of name mention are generated based on the knowledge base method and candidate entities are filtered based on the rule in the candidate generation stage,which ensures the recall rate of the candidate entity set and reduces the computational complexity and noise of the disambiguation stage effectively.In the stage of entity disambiguation,we propose a kind of entity disambiguation method based on probability model,estimating the probability that an entity becomes the target entity through the language model and selecting the entity with the highest probability as the target entity.The result of the method proposed in this paper shows the accuracy rate is 83%,higher than that of other algorithms.The method of entity disambiguation proposed in this paper is the best in the field of biomedical.

Domain literature Entity disambiguation Contextual characteristics Probability model Language model

Jing Wang Jianzhuo Yan Ruying Lv

Faculty of Information Technology,Beijing University of Technology

国际会议

2017 6th International Conference on Advanced Materials and Computer Science (ICAMCS 2017) 2017年第六届先进材料与计算机科学国际会议(ICAMCS 2017)

郑州

英文

1-10

2017-04-29(万方平台首次上网日期,不代表论文的发表时间)