Feature selection using Information Gain and decision information in neighborhood decision system

被引:20
|
作者
Qu, Kanglin [1 ,2 ]
Xu, Jiucheng [1 ,2 ]
Hou, Qincheng [1 ,2 ]
Qu, Kangjian [3 ]
Sun, Yuanhao [1 ,2 ]
机构
[1] Henan Normal Univ, Engn Technol Res Ctr Comp Intelligence & Data Min, Xinxiang 453007, Peoples R China
[2] Henan Normal Univ, Coll Comp & Informat Engn, Xinxiang 453007, Peoples R China
[3] Nanjing Inst Technol, Coll Comp Engn, Nanjing 210000, Peoples R China
基金
中国国家自然科学基金;
关键词
Neighborhood rough set; Entropy measures; Information Gain; Nonmonotonic algorithm; Feature selection; GENE SELECTION; REDUCTION; CLASSIFIER; ALGORITHM; ENTROPY;
D O I
10.1016/j.asoc.2023.110100
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection is a significant preprocessing technique for data mining, which can promote the accuracy of data classification and shrink feature space by eliminating redundant features. Since traditional feature selection algorithms have high time complexity and low classification accuracy, an effective algorithm using Information Gain and decision information is designed. The algorithm introduces Information Gain for performing preliminary dimensionality reduction on high dimensional datasets, and then the decision information is regarded as an evaluation function of features to select features with important information. First, the concept of joint information granule is defined, and neighborhood information entropy measures are proposed based on the joint information granule. In addition, the relationship between these measures is studied, which is helpful to study the uncertainty in data. Second, a nonmonotonic algorithm using the decision information in the neighborhood information entropy measures is proposed to overcome the shortcoming of algorithms based on monotonic evaluation functions, thereby improving the accuracy of data classification. Third, to reduce the time cost of the designed algorithm for high dimensional datasets, Information Gain is introduced to preliminarily eliminate irrelevant features in high dimensional datasets. Finally, the ablation and comparison experiments on twelve public datasets demonstrate the low time cost and high classification accuracy of our algorithm, respectively.(c) 2023 Elsevier B.V. All rights reserved.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Feature Subset Selection for Multi-scale Neighborhood Decision Information System
    Zhang L.
    Lin G.
    Lin Y.
    Kou Y.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2023, 36 (01): : 49 - 59
  • [2] Feature subset selection for multi-scale neighborhood decision information system via mutual information
    Zhang, Lujing
    Lin, Guoping
    Wei, Ling
    Kou, Yi
    ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (01)
  • [3] Feature subset selection for multi-scale neighborhood decision information system via mutual information
    Lujing Zhang
    Guoping Lin
    Ling Wei
    Yi Kou
    Artificial Intelligence Review, 2024, 57
  • [4] Feature Selection Combining Information Theory View and Algebraic View in the Neighborhood Decision System
    Xu, Jiucheng
    Qu, Kanglin
    Yuan, Meng
    Yang, Jie
    ENTROPY, 2021, 23 (06)
  • [5] Feature selection in a neighborhood decision information system with application to single cell RNA data classification
    Zhang, Jie
    Zhang, Gangqiang
    Li, Zhaowen
    Qu, Liangdong
    Wen, Ching-Feng
    APPLIED SOFT COMPUTING, 2021, 113
  • [6] Multilabel feature selection using ML-ReliefF and neighborhood mutual information for multilabel neighborhood decision systems
    Sun, Lin
    Yin, Tengyu
    Ding, Weiping
    Qian, Yuhua
    Xu, Jiucheng
    INFORMATION SCIENCES, 2020, 537 : 401 - 424
  • [7] Feature selection based on self-information and entropy measures for incomplete neighborhood decision systems
    Yuan, Meng
    Xu, Jiucheng
    Li, Tao
    Sun, Yuanhao
    COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (02) : 1773 - 1790
  • [8] Feature selection based on self-information and entropy measures for incomplete neighborhood decision systems
    Meng Yuan
    Jiucheng Xu
    Tao Li
    Yuanhao Sun
    Complex & Intelligent Systems, 2023, 9 : 1773 - 1790
  • [9] An attribute reduction algorithm using relative decision mutual information in fuzzy neighborhood decision system
    Xu, Jiucheng
    Zhang, Shan
    Ma, Miaoxian
    Niu, Wulin
    Duan, Jianghao
    APPLIED INTELLIGENCE, 2025, 55 (03)
  • [10] A decision model for information system project selection
    Chen, CT
    IEMC-2002: IEEE INTERNATIONAL ENGINEERING MANAGEMENT CONFERENCE, VOLS I AND II, PROCEEDINGS: MANAGING TECHNOLOGY FOR THE NEW ECONOMY, 2002, : 585 - 589