Dealing with Missing Data using a Selection Algorithm on Rough Sets

被引:0
|
作者
Jonathan Prieto-Cubides
Camilo Argoty
机构
[1] Universidad EAFIT,
[2] Grupo de Investigación Pensamiento,undefined
[3] Universidad Sergio Arboleda,undefined
[4] Universidad Militar Nueva Granada,undefined
关键词
Categorical; Imputation; Missing Values; Rough Sets;
D O I
暂无
中图分类号
学科分类号
摘要
This paper discusses the so-called missing data problem, i.e. the problem of imputing missing values in information systems. A new algorithm, called the ARSI algorithm, is proposed to address the imputation problem of missing values on categorical databases using the framework of rough set theory. This algorithm can be seen as a refinement of the ROUSTIDA algorithm and combines the approach of a generalized non-symmetric similarity relation with a generalized discernibility matrix to predict the missing values on incomplete information systems. Computational experiments show that the proposed algorithm is as efficient and competitive as other imputation algorithms.
引用
收藏
页码:1307 / 1321
页数:14
相关论文
共 50 条
  • [41] A simple reduction analysis and algorithm using rough sets
    Xu, Ning
    Zhang, Yun
    Yu, Yongquan
    [J]. ROUGH SETS AND INTELLIGENT SYSTEMS PARADIGMS, PROCEEDINGS, 2007, 4585 : 332 - +
  • [42] A Data Preprocessing Algorithm for Classification Model Based On Rough Sets
    Li Xiang-wei
    Qi Yian-fang
    [J]. INTERNATIONAL CONFERENCE ON SOLID STATE DEVICES AND MATERIALS SCIENCE, 2012, 25 : 2025 - 2029
  • [43] An Improved Data Discretization Algorithm based on Rough Sets Theory
    Liu, Han
    Jiang, Chunyu
    Wang, Miaoqiong
    Wei, Kai
    Yan, Shu
    [J]. 2020 IEEE INTL SYMP ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, INTL CONF ON BIG DATA & CLOUD COMPUTING, INTL SYMP SOCIAL COMPUTING & NETWORKING, INTL CONF ON SUSTAINABLE COMPUTING & COMMUNICATIONS (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2020), 2020, : 1432 - 1437
  • [44] Granule-specific feature selection for continuous data classification using neighborhood rough sets
    Sewwandi, Mahawaga Arachchige Nayomi Dulanjala
    Li, Yuefeng
    Zhang, Jinglan
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238
  • [45] A Conservative Feature Subset Selection Algorithm with Missing Data
    Aussem, Alex
    de Morais, Sergio Rodrigues
    [J]. ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, : 725 - 730
  • [46] A conservative feature subset selection algorithm with missing data
    Aussem, Alex
    de Morais, Sergio Rodrigues
    [J]. NEUROCOMPUTING, 2010, 73 (4-6) : 585 - 590
  • [47] The Method of Dealing with Uncertainty Knowledge Based on the Rough Sets
    Wang, Quan-Rui
    Zhu, Yan-Li
    Yan, Shi-Tao
    [J]. 2010 INTERNATIONAL CONFERENCE ON THE DEVELOPMENT OF EDUCATIONAL SCIENCE AND COMPUTER TECHNOLOGY, 2010, : 358 - 361
  • [48] DEALING WITH LARGE DATA SETS
    GRAEFE, JF
    WOOD, RW
    [J]. NEUROTOXICOLOGY AND TERATOLOGY, 1990, 12 (05) : 449 - 454
  • [49] ATTRIBUTE SELECTION USING ROUGH SETS IN SOFTWARE QUALITY CLASSIFICATION
    Khoshgoftaar, Taghi M.
    Bullard, Lofton A.
    Gao, Kehan
    [J]. INTERNATIONAL JOURNAL OF RELIABILITY QUALITY AND SAFETY ENGINEERING, 2009, 16 (01) : 73 - 89
  • [50] Attribute Selection Using Rough Sets in Software Quality Classification
    Khoshgoftaar, Taghi M.
    Bullard, Lofton A.
    Gao, Kehan
    [J]. 14TH ISSAT INTERNATIONAL CONFERENCE ON RELIABILITY AND QUALITY IN DESIGN, PROCEEDINGS, 2008, : 146 - +