Research of Error Data Detection Algorithm Based on Rules
Data entry errors, improper integration, data environment changes, etc., will affect the quality of the data. Among them, the error data is the most serious data quality problems. To clean up the error data, to play the role of information systems and improve the quality of the data, the detection method of error data based on rules is studied, the detection process is analyzed, a common set of detection rules is established, how the SQL statements into the rules is discussed, the detection algorithm is achieved and carried out a series of optimization. This method is easy, its rules are simple, and the efficiency and the false discovery rate are high after optimization. Therefore, this approach may well be a good method of data cleaning.
data cleaning data Detection error data
ZHONG-BIN ZHANG YU-HUA ZHOU YONG-ZHI LIU
Department of Equipment Command and Management Academy of the Armored Force Engineer Beijing, China Department of Petty Officer Academy of Equipment Command and Technology Beijing, China 96542 Troop Luoyang, Henan, China
国际会议
西安
英文
159-163
2011-05-13(万方平台首次上网日期,不代表论文的发表时间)