会议专题

A WORD-BASED APPROACH FOR DUPLICATE PICTURE IN PICTURE SEQUENCE DETECTION

A novel word-based algorithm is presented to detect duplicate Picture in Picture (PiP) video sequences in this study. The conventional edgebased methods used to extract the PiP regions are not robust in the noise and blurring images. Bag of Words (BOW) model emphasizes words ambiguity and ignores spatial information. Without detecting the PiP regions and unlike the traditional word based approach, the algorithm grasps the information of visual spatial transformation via exploring the attributes of local matching keypoint pairs. The pairs are generated by directly comparing the visual words. Finally, the impact of the words representations is discussed thoroughly, such as words size, diversity and weighting. The experiment is conducted in the TRECVID 2010 content-based copy detection developing database and F-measure is up to 94%. From the results, the algorithm is effective and efficient for the PiP video sequence detection.

duplicate picture in picture video copy detection key frame retrieval spatial verification visual word vocabulary SIFT

Lezi Wang Yuan Dong Hongliang Bai Wei Liu Kun Tao

Beijing University of Posts and Telecommunications, Beijing 100876, China France Telecom Research & Development – Beijing 100080, China

国际会议

2011 4th IEEE International Conference on Broadband Network & Multimedia Technology(第四届IEEE宽带网络与多媒体国际会议 4th IEEE IC-BNMT2011)

深圳

英文

286-290

2011-10-28(万方平台首次上网日期,不代表论文的发表时间)