ADAPTIVE-CLUSTERING BASED METHOD TO ESTIMATE NULL VALUES IN RELATIONAL DATABASES

被引:0
|
作者
Cheng, Ching-Hsue [2 ]
Chang, Jing-Rong [1 ]
Wei, Liang-Ying [3 ]
机构
[1] Chaoyang Univ Technol, Dept Informat Management, Wufong Township 41349, Taichung County, Taiwan
[2] Natl Yunlin Univ Sci & Technol, Dept Informat Management, Touliu 640, Yunlin, Taiwan
[3] Yuanpei Univ, Dept Informat Management, Hsinchu 30015, Taiwan
关键词
Relational database systems; Null value; Degree of influential; K-means; Adaptive learning; FUZZY RULES; SYSTEMS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data preprocessing is an essential step of knowledge discovery. Data preprocessing comprises data cleaning, data integration, data transformation, data reduction and data discretization. Estimating null values is a task of data cleaning. Null values in a database are significant sources of poor data quality. Therefore, the appropriate handling of null values is an important task of data preprocessing in relational databases. We propose a new method that uses adaptive learning techniques, based on clustering, to resolve the issue of null values in relational database systems. This study uses clustering algorithms to group data and calculates the degree of influence between independent attributes (variables) and the dependent attribute through an adaptive learning method (the best adaptive parameter can be obtained by the minimum average error rate). Three databases (a human resource database, Waugh's database and a government salary study database) were selected as the experimental data to compare the mean absolute error rate (MAER) of the proposed algorithm with the other methods. The results demonstrate that the proposed method outperforms other methods.
引用
收藏
页码:223 / 235
页数:13
相关论文
共 50 条
  • [41] A clustering-based feature selection method for automatically generated relational attributes
    Rezaei, Mostafa
    Cribben, Ivor
    Samorani, Michele
    ANNALS OF OPERATIONS RESEARCH, 2021, 303 (1-2) : 233 - 263
  • [42] A clustering-based feature selection method for automatically generated relational attributes
    Mostafa Rezaei
    Ivor Cribben
    Michele Samorani
    Annals of Operations Research, 2021, 303 : 233 - 263
  • [43] An Adaptive Page Clustering Based weighting Method for Information Retrieval
    Lin, Yi-Xian
    Kao, Hung-Yu
    2013 CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI), 2013, : 199 - 204
  • [44] Adaptive Clustering Method for Femtocells Based on Soft Frequency Reuse
    Kwon, Young Min
    Choi, Bum-Gon
    Bae, Sueng Jae
    Chung, Min Young
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2011, PT V, 2011, 6786 : 11 - 21
  • [46] Modelling method with missing values based on clustering and support vector regression
    Wang, Ling
    Fu, Dongmei
    Li, Qing
    Mu, Zhichun
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2010, 21 (01) : 142 - 147
  • [47] Energy Based Clustering Method to Estimate Channel Occupation of LTE in Unlicensed Spectrum
    Malafaia, Daniel
    Vieira, Jose
    Tome, Ana
    BIOMEDICAL APPLICATIONS BASED ON NATURAL AND ARTIFICIAL COMPUTING, PT II, 2017, 10338 : 325 - 332
  • [48] Robust Adaptive Null Broadening Method Based on FDA-MIMO Radar
    Ding, Zihang
    Xie, Junwei
    Wang, Bo
    Zhang, Haowei
    IEEE ACCESS, 2020, 8 : 177976 - 177983
  • [49] Adaptive design method based on sum of p-values
    Chang, Mark
    STATISTICS IN MEDICINE, 2007, 26 (14) : 2772 - 2784
  • [50] A method to estimate missing AERONET AOD values based on artificial neural networks
    Olcese, Luis E.
    Palancar, Gustavo G.
    Toselli, Beatriz M.
    ATMOSPHERIC ENVIRONMENT, 2015, 113 : 140 - 150