Finding the Storyteller: Automatic Spoiler Tagging using Linguistic Cues
Given a movie comment, does it contain a spoiler? A spoiler is a comment that, when disclosed, would ruin a surprise or reveal an important plot detail. We study automatic methods to detect comments and reviews that contain spoilers and apply them to reviews from the IMDB (Internet Movie Database) website. We develop topic models, based on Latent Dirichlet Allocation (LDA), but using linguistic dependency information in place of simple features from bag of words (BOW) representations. Experimental results demonstrate the effectiveness of our technique over four movie-comment datasets of different scales.
Sheng Guo Naren Ramakrishnan
Department of Computer ScienceVirginia Tech Department of Computer Science Virginia Tech
国际会议
The 23rd International Conference on Computational Linguistics(第23届国际计算语言学大会)
北京
英文
412-420
2010-08-01(万方平台首次上网日期,不代表论文的发表时间)