Feature selection using self-information and entropy-based uncertainty measure for fuzzy neighborhood rough set

被引:34
|
作者
Xu, Jiucheng [1 ,2 ]
Yuan, Meng [1 ,2 ]
Ma, Yuanyuan [1 ,2 ]
机构
[1] Henan Normal Univ, Coll Comp & Informat Engn, 46 Jianshe East Rd, Xinxiang 453007, Henan, Peoples R China
[2] Engn Technol Res Ctr Comp Intelligence & Data Min, Xinxiang 453007, Henan, Peoples R China
基金
中国国家自然科学基金;
关键词
Fuzzy neighborhood rough set; Feature selection; Self-information; Fuzzy neighborhood joint entropy; Uncertainty measure; KOLMOGOROV-SMIRNOV TEST; ATTRIBUTE REDUCTION; GRANULATION; DECISIONS; KNOWLEDGE; ALGORITHM; SYSTEMS;
D O I
10.1007/s40747-021-00356-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection based on the fuzzy neighborhood rough set model (FNRS) is highly popular in data mining. However, the dependent function of FNRS only considers the information present in the lower approximation of the decision while ignoring the information present in the upper approximation of the decision. This construction method may lead to the loss of some information. To solve this problem, this paper proposes a fuzzy neighborhood joint entropy model based on fuzzy neighborhood self-information measure (FNSIJE) and applies it to feature selection. First, to construct four uncertain fuzzy neighborhood self-information measures of decision variables, the concept of self-information is introduced into the upper and lower approximations of FNRS from the algebra view. The relationships between these measures and their properties are discussed in detail. It is found that the fourth measure, named tolerance fuzzy neighborhood self-information, has better classification performance. Second, an uncertainty measure based on the fuzzy neighborhood joint entropy has been proposed from the information view. Inspired by both algebra and information views, the FNSIJE is proposed. Third, the K-S test is used to delete features with weak distinguishing performance, which reduces the dimensionality of high-dimensional gene datasets, thereby reducing the complexity of high-dimensional gene datasets, and then, a forward feature selection algorithm is provided. Experimental results show that compared with related methods, the presented model can select less important features and have a higher classification accuracy.
引用
收藏
页码:287 / 305
页数:19
相关论文
共 50 条
  • [1] Feature selection using self-information and entropy-based uncertainty measure for fuzzy neighborhood rough set
    Jiucheng Xu
    Meng Yuan
    Yuanyuan Ma
    Complex & Intelligent Systems, 2022, 8 : 287 - 305
  • [2] Feature Selection Using Fuzzy Neighborhood Entropy-Based Uncertainty Measures for Fuzzy Neighborhood Multigranulation Rough Sets
    Sun, Lin
    Wang, Lanying
    Ding, Weiping
    Qian, Yuhua
    Xu, Jiucheng
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2021, 29 (01) : 19 - 33
  • [3] Feature selection using self-information uncertainty measures in neighborhood information systems
    Jiucheng Xu
    Kanglin Qu
    Yuanhao Sun
    Jie Yang
    Applied Intelligence, 2023, 53 : 4524 - 4540
  • [4] Online group streaming feature selection using entropy-based uncertainty measures for fuzzy neighborhood rough sets
    Jiucheng Xu
    Yuanhao Sun
    Kanglin Qu
    Xiangru Meng
    Qinchen Hou
    Complex & Intelligent Systems, 2022, 8 : 5309 - 5328
  • [5] Online group streaming feature selection using entropy-based uncertainty measures for fuzzy neighborhood rough sets
    Xu, Jiucheng
    Sun, Yuanhao
    Qu, Kanglin
    Meng, Xiangru
    Hou, Qinchen
    COMPLEX & INTELLIGENT SYSTEMS, 2022, 8 (06) : 5309 - 5328
  • [6] Feature selection using self-information uncertainty measures in neighborhood information systems
    Xu, Jiucheng
    Qu, Kanglin
    Sun, Yuanhao
    Yang, Jie
    APPLIED INTELLIGENCE, 2023, 53 (04) : 4524 - 4540
  • [7] Feature Selection Based on Neighborhood Self-Information
    Wang, Changzhong
    Huang, Yang
    Shao, Mingwen
    Hu, Qinghua
    Chen, Degang
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (09) : 4031 - 4042
  • [8] A hybrid feature gene selection method based on fuzzy neighborhood rough set with information entropy
    Chen, Tao
    Hong, Zenglin
    Deng, Fang-An
    Cui, Man
    International Journal of Signal Processing, Image Processing and Pattern Recognition, 2014, 7 (06) : 95 - 110
  • [9] Uncertainty measures and feature selection based on composite entropy for generalized multigranulation fuzzy neighborhood rough set
    Zhang, Xiaoyan
    Zhao, Weicheng
    FUZZY SETS AND SYSTEMS, 2024, 486
  • [10] Granularity self-information based uncertainty measure for feature selection and robust classification
    An, Shuang
    Xiao, Qijin
    Wang, Changzhong
    Zhao, Suyun
    FUZZY SETS AND SYSTEMS, 2023, 470