会议专题

Violence Detection Based on Spatio-Temporal Feature and Fisher Vector

  A novel framework based on local spatio-temporal features and a Bag-of-Words(BoW)model is proposed for violence detection.The framework utilizes Dense Trajectories(DT)and MPEG flow video descriptor(MF)as feature descriptors and employs Fisher Vector(FV)in feature coding.DT and MF algorithms are more descriptive and robust,because they are combinations of various feature descriptors,which describe trajectory shape,appearance,motion and motion boundary,respectively.FV is applied to transform low level features to high level features.FV method preserves much information,because not only the affiliations of descriptors are found in the codebook,but also the first and second order statistics are used to represent videos.Some tricks,that PCA,K-means++and codebook size,are used to improve the final performance of video classification.In comprehensive consideration of accuracy,speed and application scenarios,the proposed method for violence detection is analysed.Experimental results show that the proposed approach outperforms the state-of-the-art approaches for violence detection in both crowd scenes and non-crowd scenes.

Violence detection Dense Trajectories MPEG flow video descriptor Fisher Vector Linear support vector machine

Huangkai Cai He Jiang Xiaolin Huang Jie Yang Xiangjian He

Institution of Image Processing and Pattern Recognition,Shanghai Jiao Tong University,Shanghai,China School of Electrical and Data Engineering,University of Technology Sydney,Ultimo,Australia

国际会议

中国模式识别与计算机视觉大会(PRCV2018)

广州

英文

180-190

2018-11-23(万方平台首次上网日期,不代表论文的发表时间)