会议专题

Collaborative Filtering on Skewed Datasets

Many real life datasets have skewed distributions of events when the probability of observing few events far exceeds the others. This paper, we observed that in skewed datasets the state of the art collaborative filtering methods perform worse than a simple probabilistic model. Our test bench includes a real ad click stream dataset which is naturally skewed. The same conclusion obtained even from the popular movie rating dataset when we pose a binary prediction problem of whether a user will give maximum rating to a movie or not.

Collaborative filtering skewed dataset pLSA.

Somnath Banerjee Krishnan Ramanathan

Hewlett-Packard Labs Bangalore, India

国际会议

第十七届国际万维网大会(the 17th International World Wide Web Conference)(WWW08)

北京

英文

2008-04-21(万方平台首次上网日期,不代表论文的发表时间)