Multimodal Joint Representation for User Interest Analysis on Content Curation Social Networks

摘要：

　　Content curation social networks(CCSNs),where users share interests by images and their text descriptions,are booming social networks.For the purpose of fully utilizing user-generated contents to analysis user interests on CCSNs,we propose a framework of learning multimodal joint representations of pins for user interest analysis.First,images are automatically annotated with category distributions,which benefit from the network characteristics and represent interests of users.Further,image representations are extracted from an intermediate layer of a fine-tuned multilabel convolutional neural network(CNN)and text representations are obtained with a trained Word2Vec.Finally,a multimodal deep Boltzmann machine(DBM)are trained to fuse two modalities.Experiments on a dataset from Huaban demonstrate that using category distributions instead of single categories as labels to fine-tune CNN significantly improve the performance of image representation,and multimodal joint representations perform better than either of unimodal representations.

关键词： Multimodal Content curation social networks User modeling Recommender systems

作者: Lifang Wu Dai Zhang Meng Jian Bowen Yang Haiying Liu

作者单位: Faculty of Information Technology,Beijing University of Technology,Beijing,China

会议类型: 国际会议

会议名称: 中国模式识别与计算机视觉大会(PRCV2018)

会议地点: 广州

会议语种:英文

页码: 363-374

在线出版日期: 2018-11-23（万方平台首次上网日期，不代表论文的发表时间）

会议专题

Multimodal Joint Representation for User Interest Analysis on Content Curation Social Networks