Imputation of incomplete data using adaptive ellipsoids with linear regression

被引:3
|
作者
Yao, Leehter [1 ]
Weng, Kuei-Sung [1 ]
机构
[1] Natl Taipei Univ Technol, Dept Elect Engn, Taipei 10608, Taiwan
关键词
Incomplete data; fuzzy clustering; particle swarm optimization; Gustafson-Kessel algorithm; linear regression; MISSING VALUE ESTIMATION; CLUSTER SUBSTRUCTURE; VALUES; CLASSIFICATION; ALGORITHM;
D O I
10.3233/IFS-151592
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An efficient scheme for imputing the features missing of incomplete data is proposed in this paper. The missing features are imputed based on a group of nearest complete data in the space of residual features of the incomplete data to be recovered. In order to find the complete data points in the space of residual features, an algorithm called the evolutionary Gustafson-Kessel algorithm (EGKA) is proposed that learns the ellipsoid to adaptively cluster the complete data points with the recovered incomplete data points. A linear regression model is utilized to impute the missing features based on the complete data clustered by the ellipsoid learned by the EGKA.
引用
收藏
页码:253 / 265
页数:13
相关论文
共 50 条
  • [1] On robust linear regression with incomplete data
    Atkinson, AC
    Cheng, TC
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2000, 33 (04) : 361 - 380
  • [2] Handling Incomplete Data Using Evolution of Imputation Methods
    Zawistowski, Pawel
    Grzenda, Maciej
    [J]. ADAPTIVE AND NATURAL COMPUTING ALGORITHMS, 2009, 5495 : 22 - +
  • [3] On One Modification of Linear Regression Estimation Algorithm Using Ellipsoids
    Salnikov, N. N.
    [J]. JOURNAL OF AUTOMATION AND INFORMATION SCIENCES, 2012, 44 (03) : 15 - 32
  • [4] MICROARRAY MISSING DATA IMPUTATION USING REGRESSION
    Bayrak, Tuncay
    Ogul, Hasan
    [J]. 2017 13TH IASTED INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING (BIOMED), 2017, : 68 - 73
  • [5] Fuzzy Clustering and Nonlinear Regression Imputation for Incomplete Data of Tunnel Boring Machine
    Wang, Yitang
    Pang, Yong
    Zhang, Liyong
    Shi, Yanjun
    Sun, Wei
    Song, Xueguan
    [J]. Jixie Gongcheng Xuebao/Journal of Mechanical Engineering, 2023, 59 (12): : 28 - 37
  • [6] Linear regression for bivariate censored data via multiple imputation
    Pan, W
    Kooperberg, C
    [J]. STATISTICS IN MEDICINE, 1999, 18 (22) : 3111 - 3121
  • [7] Imputation Methods for Incomplete Data
    Umathe, Vaishali H.
    Chaudhary, Gauri
    [J]. 2015 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION, EMBEDDED AND COMMUNICATION SYSTEMS (ICIIECS), 2015,
  • [8] A multiple imputation approach to linear regression with clustered censored data
    Pan, W
    Connett, JE
    [J]. LIFETIME DATA ANALYSIS, 2001, 7 (02) : 111 - 123
  • [9] A Multiple Imputation Approach to Linear Regression with Clustered Censored Data
    Wei Pan
    John E. Connett
    [J]. Lifetime Data Analysis, 2001, 7 : 111 - 123