Dealing with Missing Data using a Selection Algorithm on Rough Sets

被引:0
|
作者
Jonathan Prieto-Cubides
Camilo Argoty
机构
[1] Universidad EAFIT,
[2] Grupo de Investigación Pensamiento,undefined
[3] Universidad Sergio Arboleda,undefined
[4] Universidad Militar Nueva Granada,undefined
关键词
Categorical; Imputation; Missing Values; Rough Sets;
D O I
暂无
中图分类号
学科分类号
摘要
This paper discusses the so-called missing data problem, i.e. the problem of imputing missing values in information systems. A new algorithm, called the ARSI algorithm, is proposed to address the imputation problem of missing values on categorical databases using the framework of rough set theory. This algorithm can be seen as a refinement of the ROUSTIDA algorithm and combines the approach of a generalized non-symmetric similarity relation with a generalized discernibility matrix to predict the missing values on incomplete information systems. Computational experiments show that the proposed algorithm is as efficient and competitive as other imputation algorithms.
引用
收藏
页码:1307 / 1321
页数:14
相关论文
共 50 条
  • [31] A Novel Approach for Feature Selection using Rough Sets
    Yadav, Nidhika
    Chatterjee, Niladri
    [J]. 2017 INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATIONS AND ELECTRONICS (COMPTELIX), 2017, : 195 - 199
  • [32] Online streaming feature selection using rough sets
    Eskandari, S.
    Javidi, M. M.
    [J]. INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2016, 69 : 35 - 57
  • [33] Feature selection for imbalanced data based on neighborhood rough sets
    Chen, Hongmei
    Li, Tianrui
    Fan, Xin
    Luo, Chuan
    [J]. INFORMATION SCIENCES, 2019, 483 : 1 - 20
  • [34] Innovations in dealing with missing data or missing reports
    Meng, Xiao-Li
    [J]. STATISTICA SINICA, 2006, 16 (04) : 1061 - 1070
  • [35] Estimation of Missing Values in Incomplete Industrial Process Data Sets Using ECM Algorithm
    Pirehgalin, Mina Fahimi
    Vogel-Heuser, Birgit
    [J]. 2018 IEEE 16TH INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), 2018, : 245 - 251
  • [36] An improved runner-root algorithm for solving feature selection problems based on rough sets and neighborhood rough sets
    Ibrahim, Rehab Ali
    Abd Elaziz, Mohamed
    Oliva, Diego
    Lu, Songfeng
    [J]. APPLIED SOFT COMPUTING, 2020, 97
  • [37] Dealing with deficient and missing data
    Dohoo, Ian R.
    [J]. PREVENTIVE VETERINARY MEDICINE, 2015, 122 (1-2) : 221 - 228
  • [38] Efficient feature selection and classification algorithm based on PSO and rough sets
    Huda, Ramesh Kumar
    Banka, Haider
    [J]. NEURAL COMPUTING & APPLICATIONS, 2019, 31 (08): : 4287 - 4303
  • [39] Efficient feature selection and classification algorithm based on PSO and rough sets
    Ramesh Kumar Huda
    Haider Banka
    [J]. Neural Computing and Applications, 2019, 31 : 4287 - 4303
  • [40] Model Selection Criteria for Missing-Data Problems Using the EM Algorithm
    Ibrahim, Joseph G.
    Zhu, Hongtu
    Tang, Niansheng
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2008, 103 (484) : 1648 - 1658