Research and Implementation of File Format Identification Method Based on Pattern Matching
According to the characteristic of file format signature and the file recognition process, this paper improved BM algorithm and apply it to achieve file format identification. BM algorithm has higher efficiency in practical applications, but number of comparisons would be too much in application of file format identification, lead to reducing efficiency. We proposed to improve by adding two index tables. The improved algorithm mainly for the following cases in pattern matching: Find one pattern string in a text string;the number of the strings in pattern strings set more, and most pattern strings at the beginning of the text string. Compared with the BM algorithm, improved algorithm is more effective due to fewer comparisons between text string and strings of pattern strings set, and satisfied the requirement of file format identification.
pattern matching BM algorithm file format signature file format identification
Tang Wenzhong Sun Qiong
Beijing Key Laboratory of Network Technology, School of Computer Science and Engineering, Beihang University, Beijing, China
国际会议
2010 International Conference on Future Information Technology(2010年未来信息技术国际会议 ICFIT 2010)
长沙
英文
1095-1099
2010-12-14(万方平台首次上网日期,不代表论文的发表时间)