ADAPTIVE-CLUSTERING BASED METHOD TO ESTIMATE NULL VALUES IN RELATIONAL DATABASES

被引：0

作者：

Cheng, Ching-Hsue ^{[2
]}

Chang, Jing-Rong ^{[1
]}

Wei, Liang-Ying ^{[3
]}

机构：

[1] Chaoyang Univ Technol, Dept Informat Management, Wufong Township 41349, Taichung County, Taiwan

[2] Natl Yunlin Univ Sci & Technol, Dept Informat Management, Touliu 640, Yunlin, Taiwan

[3] Yuanpei Univ, Dept Informat Management, Hsinchu 30015, Taiwan

来源：

INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL | 2011年 / 7卷 / 01期

关键词：

Relational database systems; Null value; Degree of influential; K-means; Adaptive learning; FUZZY RULES; SYSTEMS;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Data preprocessing is an essential step of knowledge discovery. Data preprocessing comprises data cleaning, data integration, data transformation, data reduction and data discretization. Estimating null values is a task of data cleaning. Null values in a database are significant sources of poor data quality. Therefore, the appropriate handling of null values is an important task of data preprocessing in relational databases. We propose a new method that uses adaptive learning techniques, based on clustering, to resolve the issue of null values in relational database systems. This study uses clustering algorithms to group data and calculates the degree of influence between independent attributes (variables) and the dependent attribute through an adaptive learning method (the best adaptive parameter can be obtained by the minimum average error rate). Three databases (a human resource database, Waugh's database and a government salary study database) were selected as the experimental data to compare the mean absolute error rate (MAER) of the proposed algorithm with the other methods. The results demonstrate that the proposed method outperforms other methods.

引用

页码：223 / 235

页数：13

共 50 条

[1] A new method to estimate null values in relational database systems based on automatic clustering techniques
Chen, SM
Hsiao, HR
INFORMATION SCIENCES, 2005, 169 (1-2) : 47 - 69
[2] An efficient method for estimating null values in relational databases
Jia-Wen Wang
Ching-Hsue Cheng
Knowledge and Information Systems, 2007, 12 : 379 - 394
[3] An efficient method for estimating null values in relational databases
Wang, Jia-Wen
Cheng, Ching-Hsue
KNOWLEDGE AND INFORMATION SYSTEMS, 2007, 12 (03) : 379 - 394
[4] NULL VALUES IN NESTED RELATIONAL DATABASES
ROTH, MA
KORTH, HF
SILBERSCHATZ, A
ACTA INFORMATICA, 1989, 26 (07) : 615 - 642
[5] NULL VALUES IN NESTED RELATIONAL DATABASES - CORRECTION
LEVENE, M
LOIZOU, G
ACTA INFORMATICA, 1991, 28 (06) : 603 - 605
[6] NULL VALUES IN NESTED RELATIONAL DATABASES - ADDENDUM
ROTH, MA
KORTH, HF
SILBERSCHATZ, A
ACTA INFORMATICA, 1991, 28 (06) : 607 - 610
[7] Estimating null values in the distributed relational databases environment
Chen, SM
Chen, HH
CYBERNETICS AND SYSTEMS, 2000, 31 (08) : 851 - 871
[8] Null values in relational databases and sure information answers
Klein, HJ
SEMANTICS IN DATABASES, 2003, 2582 : 119 - 138
[9] Estimating Null Values in Relational Databases Using Analogical Proportions
Beltran, William Correa
Jaudoin, Helene
Pivert, Olivier
INFORMATION PROCESSING AND MANAGEMENT OF UNCERTAINTY IN KNOWLEDGE-BASED SYSTEMS, PT III, 2014, 444 : 110 - 119
[10] A SOUND AND SOMETIMES COMPLETE QUERY EVALUATION ALGORITHM FOR RELATIONAL DATABASES WITH NULL VALUES
REITER, R
JOURNAL OF THE ACM, 1986, 33 (02) : 349 - 370

← 1 2 3 4 5 →