Creating Visual Vocabulary Based on SIFT Descriptor In Compressed Domain

摘要：

Recently bag-of-words (BoW) model having been widely used in textual information processing has been extended into many tasks in visual domain such as image classification, scene analysis, image annotation and image retrieval, namely bag-of-visual-words (BoVW) model. Therefore, it is essential to create an effective visual vocabulary. Most of existing approaches create visual vocabularies from image in pixel domain, which requires extra processing time for decompressed images, since most images are stored in compressed format. In this paper we propose to create a visual vocabulary based on Scale Invariant Feature Transform (SIFT) descriptor in compressed domain with the following three steps, (1) constructing low-resolution images in compressed domain; (2) extracting SIFT descriptor from lowresolution images; and (3) creating a visual vocabulary based on extracted SIFT descriptors. In order to evaluate the performance of the visual words, experiments have been conducted on identifying pornographic images. Experimental results indicate that the proposed method can recognize pornographic images accurately with much reduced computational time.

关键词： bag-of-words visual words SIFT descriptor compressed domian image recognition

作者: Lei Sui Jing Zhang Li Zhuo Yuncong Yang

作者单位: Signal & Information Processing Lab Beijing University of Technology Beijing, China Signal & Information Processing Lab Beijing University of TechnologyBeijing, China

会议类型: 国际会议

会议名称: 2011年无线通信与信号处理国际会议(WCSP 2011)

会议地点: 南京

会议语种:英文

页码: 1-5

在线出版日期: 2011-11-09（万方平台首次上网日期，不代表论文的发表时间）

会议专题

Creating Visual Vocabulary Based on SIFT Descriptor In Compressed Domain