会议专题

Research of Error Data Detection Algorithm Based on Rules

Data entry errors, improper integration, data environment changes, etc., will affect the quality of the data. Among them, the error data is the most serious data quality problems. To clean up the error data, to play the role of information systems and improve the quality of the data, the detection method of error data based on rules is studied, the detection process is analyzed, a common set of detection rules is established, how the SQL statements into the rules is discussed, the detection algorithm is achieved and carried out a series of optimization. This method is easy, its rules are simple, and the efficiency and the false discovery rate are high after optimization. Therefore, this approach may well be a good method of data cleaning.

data cleaning data Detection error data

ZHONG-BIN ZHANG YU-HUA ZHOU YONG-ZHI LIU

Department of Equipment Command and Management Academy of the Armored Force Engineer Beijing, China Department of Petty Officer Academy of Equipment Command and Technology Beijing, China 96542 Troop Luoyang, Henan, China

国际会议

2011 2nd International Conference on Data Storage and Data Engineering(DSDE 2011)(2011年第二届数据存储与数据工程国际会议)

西安

英文

159-163

2011-05-13(万方平台首次上网日期,不代表论文的发表时间)