Three-way selection random forest algorithm based on decision boundary entropy

被引:10
|
作者
Zhang, Chunying [1 ,2 ]
Ren, Jing [1 ]
Liu, Fengchun [3 ]
Li, Xiaoqi [1 ]
Liu, Shouyue [1 ]
机构
[1] North China Univ Sci & Technol, Coll Sci, Tangshan 063210, Hebei, Peoples R China
[2] Key Lab Data Sci & Applicat Hebei Prov, Tangshan 063210, Hebei, Peoples R China
[3] North China Univ Sci & Technol, Coll Qianan, Tangshan 063210, Hebei, Peoples R China
关键词
Random Forest; Attribute Selection; Decision Boundary Entropy; Significance of Attribute; Three-way Decision; ATTRIBUTES;
D O I
10.1007/s10489-021-03033-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Aiming at the problem of high probability of negative impact about redundant attributes in random forest algorithms, a Three-way Selection Random Forest algorithm based on decision boundary entropy (TSRF) is proposed without losing randomness and reducing the influence of redundant attributes on decision-making results. According to the characteristics of the attribute, the concept of decision boundary entropy is defined. Then a measuring method of attribute importance based on decision boundary entropy is proposed and set as an evaluation standard. Three-way decision is constructed and the attribute is divided into three candidate domains, namely positive domain, negative domain and boundary domain. In order to ensure the randomness of attributes, three-way attribute random selection rules based on attribute randomness are established and a certain number of attributes are randomly selected from the three candidate domains. Combine the samples selected by the bootstrap sampling method with attribute sets selected by three-way decision to produce training sample sets so that we can train the decision trees and generate forest. Six datasets are selected for the experiment. Two parameters of attribute randomness and three-way decision thresholds are analyzed to verify the theoretical conclusions respectively. The results show that the TSRF algorithm can meet the different requirements of different data sets by adjusting the parameters. The classification effect on the binary data is basically the same as the comparison algorithm, but TSRF has a significant improvement effect on the multi-class data compared with other algorithms. The proposed TSRF algorithm widens the idea for the measurement method of significance of attribute, innovates the random forest three-way selection integration method, and provides a better model framework for solving multi-classification problems.
引用
收藏
页码:13384 / 13397
页数:14
相关论文
共 50 条
  • [1] Retraction Note: Three-way selection random forest algorithm based on decision boundary entropy
    Chunying Zhang
    Jing Ren
    Fengchun Liu
    Xiaoqi Li
    Shouyue Liu
    [J]. Applied Intelligence, 2025, 55 (2)
  • [2] Strategy selection under entropy measures in movement-based three-way decision
    Jiang, Chunmao
    Guo, Doudou
    Duan, Ying
    Liu, Yue
    [J]. INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2020, 119 (119) : 280 - 291
  • [3] ACTIVE LEARNING OF THREE-WAY DECISION BASED ON NEIGHBORHOOD ENTROPY
    Lv, Qiuyue
    Dong, Minggang
    [J]. INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2022, 18 (02): : 377 - 393
  • [4] Three-Way Selection Random Forest Optimization Model for Anomaly Traffic Detection
    Zhang, Chunying
    Zhang, Meng
    Yang, Guanghui
    Xue, Tao
    Zhang, Zichi
    Liu, Lu
    Wang, Liya
    Hou, Wei
    Chen, Zhihai
    [J]. ELECTRONICS, 2023, 12 (08)
  • [5] A Three-Way Clustering Method Based on Ensemble Strategy and Three-Way Decision
    Wang, Pingxin
    Liu, Qiang
    Xu, Gang
    Wang, Kangkang
    [J]. INFORMATION, 2019, 10 (02)
  • [6] Three-way decision-based tri-training with entropy minimization
    Pan, Linchao
    Gao, Can
    Zhou, Jie
    [J]. INFORMATION SCIENCES, 2022, 610 : 33 - 51
  • [7] Many-objective evolutionary algorithm based on three-way decision
    Cui, Zhihua
    Li, Bingting
    Lan, Zhuoxuan
    Xu, Yubin
    [J]. EGYPTIAN INFORMATICS JOURNAL, 2023, 24 (03)
  • [8] Three-Way Decision Collaborative Recommendation Algorithm Based on User Reputation
    Qian, Fulan
    Min, Qianqian
    Zhao, Shu
    Chen, Jie
    Wang, Xiangyang
    Zhang, Yanping
    [J]. ROUGH SETS, IJCRS 2019, 2019, 11499 : 424 - 438
  • [9] KNN Ensemble Learning Integration Algorithm Based on Three-Way Decision
    Jia, Xinyuan
    Li, Yating
    Wang, Pengling
    [J]. ROUGH SETS, IJCRS 2022, 2022, 13633 : 346 - 360
  • [10] Adaptive K-means Algorithm Based on Three-Way Decision
    Peng, Yihang
    Zhang, Qinghua
    Ai, Zhihua
    Zhi, Xuechao
    [J]. ROUGH SETS, IJCRS 2022, 2022, 13633 : 390 - 404