EXTRACTING MULTIMEDIA SEMANTICS BASED ON INDEPENDENT MODALITY DISCOVERING AND FUSION

摘要：

Learning semantics from low-level features of multimedia learning resources enables high-level access to multimedia content. Considerable amount of researches have been focused on multi-modal analysis to detect multimedia semantics.However, two fundamental issues have not been adequately addressed. First, given a set of raw features extracted from multimedia sources, what are the best independent modalities?Second, once a set of modalities has been identified, how are they optimally fused to map to the high-level semantics? In this paper, we apply statistical and machine learning techniques to answer the two questions. ISOMAP combining with support vector clustering are used to discover independent modalities from raw features. Then Maximum Entropy method is applied to optimally fuse the individual modalities. Experiments show that the proposed method can learn multimedia semantics more efficiently than traditional methods.

关键词： Multimedia semantics ISOMAP Support vector clustering Multi-modal fusion

作者: RUO-GUI XIAO YUE-TING ZHUANG FEI WU

作者单位: Institute of Artificial Intelligence of Zhejiang University Zhejiang University, Hangzhou, China, 310027

会议类型: 国际会议

会议名称: 2006 International Conference on Machine Learning and Cybernetics(IEEE第五届机器学习与控制论坛)

会议地点: 大连

会议语种:英文

页码: 2754-2758

在线出版日期: 2006-08-13（万方平台首次上网日期，不代表论文的发表时间）

会议专题

EXTRACTING MULTIMEDIA SEMANTICS BASED ON INDEPENDENT MODALITY DISCOVERING AND FUSION