Video Semantic Concept Detection Based on MultiModality Fusion

摘要：

Multiple kernel learning methods have a widespread application in visual concept learning and BoVW method has been widely used dues to its excellent categorization performance. However, most canonical multiple kernel learning methods employ a stationary kernel combination format which assigns a uniform kernel weights over the input space. And BoVW method aimed to resolve the problem that the time efficiency of BoVW method decreases as the visual data scales up. As it is true for human perception, learning from multimodalities has become an effective scheme for various information retrieval problems. In this paper, we propose a novel multi-modality fusion approach for video search, where the search modalities are derived from a diverse set of knowledge sources. Our proposed approach, explores a large set of predefined semantic concepts for computing multi-modality fusion weights by a new method. Experimental results validate the effectiveness of our approach, which outperforms the existing multi-modality fusion methods.

关键词： component Visual Semantic Concept multi-modality clustering Inter-Class Correlation

作者: Zhao Jianxun Wu BO

作者单位: Zhongzhou University Zhengzhou, China, 450044

会议类型: 国际会议

会议名称: 2012 International Conference on Computer Science and Electronic Engineering(2012 IEEE计算机科学与电子工程国际会议 ICCSEE 2012)

会议地点: 杭州

会议语种:英文

页码: 334-338

在线出版日期: 2012-03-23（万方平台首次上网日期，不代表论文的发表时间）

会议专题

Video Semantic Concept Detection Based on MultiModality Fusion