A New Probabilistic Model for Bayes Document Classification
In this paper,we propose a new probabilistic model of naive Bayes method which can be used in text classification.This method not only takes account of the frequency of feature words,but also considers those important words do not appear in the test document,which overcomes the shortcoming of the Multi-variate Bernoulli event Model(MBM) and Multinomial event Model(MM).Experiments show that the method proposed in this paper has better classification result than those traditional methods.
na(i)ve Bayes text classification Multi-variate Bernoulli event Model Multinomial event Model
Ya-Shu Liu Han-Bing Yan
Department of Computer Science Beijing University of Civil Engineering and Architecture Beijing,P.R. National Institute of Network and Information Security National Computer network Emergency Response
国际会议
杭州
英文
1240-1243
2013-03-22(万方平台首次上网日期,不代表论文的发表时间)