会议专题

Research on Summary Sentences Extraction Oriented to Live Sports Text

  In order to enable automatic generation of sports news,in this paper,we propose an extraction method to extract summary sentences from live sports text.After analyzing the characteristics of live sports text,we regard extraction of summary sentence as the sequence tagging problem,and decide to use Con-ditional Random Fields(CRFs)as the extraction model.Firstly,we expend the correlated words of keywords using word2vec.Then,we select positive correlated words,negative correlated words,time and the window of score changes as features to train the model and extract summary sentences.This method get good results on the evaluation indicators of ROUGE-1,GOUGE-2 and ROUGE-SU4.And it shows that this method has a meaningful influence on automatic summarization and automatic generation of sports news.

Sports News Live Sports Text Conditional Random Fields Word2vec.

Liya Zhu Wenchao Wang Yujing Chen Xueqiang Lv Jianshe Zhou

Beijing Key Laboratory of Internet Culture and Digital Dissemination Research,Beijing Information Sc Beijing Advanced Innovation Center for Imaging Technology,Beijing,China

国际会议

第五届自然语言处理与中文计算会议(NLPCC-ICCPOL2016)

昆明

英文

1-10

2016-12-02(万方平台首次上网日期,不代表论文的发表时间)