Semi-parametric optimization for missing data imputation

被引:85
|
作者
Qin, Yongsong [1 ]
Zhang, Shichao [1 ]
Zhu, Xiaofeng [1 ]
Zhang, Jilian [1 ]
Zhang, Chengqi [1 ]
机构
[1] Beijing Univ, Sch Automat, Beijing, Peoples R China
基金
澳大利亚研究理事会;
关键词
missing data; missing data imputation; semi-parametric data;
D O I
10.1007/s10489-006-0032-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Missing data imputation is an important issue in machine learning and data mining. In this paper, we propose a new and efficient imputation method for a kind of missing data: semi-parametric data. Our imputation method aims at making an optimal evaluation about Root Mean Square Error (RMSE), distribution function and quantile after missing-data are imputed. We evaluate our approaches using both simulated data and real data experimentally, and demonstrate that our stochastic semi-parametric regression imputation is much better than existing deterministic semi-parametric regression imputation in efficiency and effectiveness.
引用
收藏
页码:79 / 88
页数:10
相关论文
共 50 条
  • [21] Bayesian modeling and optimization based on semi-parametric hierarchy
    Chen X.
    Wang J.
    Yang S.
    [J]. Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2023, 45 (05): : 1580 - 1588
  • [22] Semi-parametric regression when some (expensive) covariates are missing by design
    Göran Kauermann
    Mehboob Ali
    [J]. Statistical Papers, 2021, 62 : 1675 - 1696
  • [23] Semi-parametric regression when some (expensive) covariates are missing by design
    Kauermann, Goeran
    Ali, Mehboob
    [J]. STATISTICAL PAPERS, 2021, 62 (04) : 1675 - 1696
  • [24] Semi-parametric methods of handling missing data in mortal cohorts under non-ignorable missingness
    Wen, Lan
    Seaman, Shaun R.
    [J]. BIOMETRICS, 2018, 74 (04) : 1427 - 1437
  • [25] ESTIMATED NON-PARAMETRIC AND SEMI-PARAMETRIC MODEL FOR LONGITUDINAL DATA
    AL-Adilee, Reem Tallal Kamil
    Aboudi, Emad Hazim
    [J]. INTERNATIONAL JOURNAL OF AGRICULTURAL AND STATISTICAL SCIENCES, 2021, 17 : 1963 - 1972
  • [26] Semi-parametric and Parametric Inference of Extreme Value Models for Rainfall Data
    AghaKouchak, Amir
    Nasrollahi, Nasrin
    [J]. WATER RESOURCES MANAGEMENT, 2010, 24 (06) : 1229 - 1249
  • [27] Semi-parametric and Parametric Inference of Extreme Value Models for Rainfall Data
    Amir AghaKouchak
    Nasrin Nasrollahi
    [J]. Water Resources Management, 2010, 24 : 1229 - 1249
  • [28] Learning from Biased Data: A Semi-Parametric Approach
    Bertail, Patrice
    Clemencon, Stephan
    Guyonvarch, Yannick
    Noiry, Nathan
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [29] SEMI-PARAMETRIC INFERENCE FOR COPULA MODELS FOR TRUNCATED DATA
    Emura, Takeshi
    Wang, Weijing
    Hung, Hui-Nien
    [J]. STATISTICA SINICA, 2011, 21 (01) : 349 - 367
  • [30] Semi-Parametric Models for Negative Binomial Panel Data
    Sutradhar B.C.
    Jowaheer V.
    Rao R.P.
    [J]. Sankhya A, 2016, 78 (2): : 269 - 303