Incremental feature selection approach to multi-dimensional variation based on matrix dominance conditional entropy for ordered data set

被引:1
|
作者
Xu, Weihua [1 ]
Yang, Yifei [1 ]
Ding, Yi [1 ]
Chen, Xiyang [2 ]
Lv, Xiaofang [3 ]
机构
[1] Southwest Univ, Coll Artificial Intelligence, Chongqing 400715, Peoples R China
[2] Xian Univ Sci & Technol, Coll Comp Sci & Technol, Xian 710600, Peoples R China
[3] Southwest Univ, Coll Life Sci, Chongqing 400715, Peoples R China
基金
中国国家自然科学基金;
关键词
Conditional entropy; Dominance matrix; Feature selection; Ordered data set; Rough set; ATTRIBUTE REDUCTION; DYNAMIC DATA; LEARNING ALGORITHM; ROUGH SETS;
D O I
10.1007/s10489-024-05411-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Rough set theory is a mathematical tool widely employed in various fields to handle uncertainty. Feature selection, as an essential and independent research area within rough set theory, aims to identify a small subset of important features by eliminating irrelevant, redundant, or noisy ones. In human life, data characteristics constantly change over time and other factors, resulting in ordered datasets with varying features. However, existing feature extraction methods are not suitable for handling such datasets since they do not consider previous reduction results when features change and need to be recomputed, leading to significant time consumption. To address this issue, the incremental attribute reduction algorithm utilizes prior reduction results effectively reducing computation time. Motivated by this approach, this paper investigates incremental feature selection algorithms for ordered datasets with changing features. Firstly, we discuss the dominant matrix and the dominance conditional entropy while introducing update principles for the new dominant matrix and dominance diagonal matrix when features change. Subsequently, we propose two incremental feature selection algorithms for adding (IFS-A) or deleting (IFS-D) features in ordered data set. Additionally, nine UCI datasets are utilized to evaluate the performance of our proposed algorithm. The experimental results validate that the average classification accuracy of IFS-A and IFS-D under four classifiers on twelve datasets is 82.05% and 80.75%, which increases by 5.48% and 3.68% respectively compared with the original data.
引用
收藏
页码:4890 / 4910
页数:21
相关论文
共 50 条
  • [21] An entropy-based approach for generating multi-dimensional sequential patterns
    Lee, CH
    KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2005, 2005, 3721 : 585 - 592
  • [22] An incremental approach to feature selection using the weighted dominance-based neighborhood rough sets
    Pan, Yanzhou
    Xu, Weihua
    Ran, Qinwen
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (04) : 1217 - 1233
  • [23] An incremental approach to feature selection using the weighted dominance-based neighborhood rough sets
    Yanzhou Pan
    Weihua Xu
    Qinwen Ran
    International Journal of Machine Learning and Cybernetics, 2023, 14 : 1217 - 1233
  • [24] Feature selection in a discrete feature space based on fuzzy conditional information entropy iterative model and matrix operation
    Li, Zhaowen
    Chen, Yiying
    INTERNATIONAL JOURNAL OF GENERAL SYSTEMS, 2023, 52 (05) : 597 - 635
  • [25] A set optimization approach to zero-sum matrix games with multi-dimensional payoffs
    Andreas H. Hamel
    Andreas Löhne
    Mathematical Methods of Operations Research, 2018, 88 : 369 - 397
  • [26] A set optimization approach to zero-sum matrix games with multi-dimensional payoffs
    Hamel, Andreas H.
    Loehne, Andreas
    MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2018, 88 (03) : 369 - 397
  • [27] The Approach of SWIM Data Sharing Based on Multi-Dimensional Data Encryption
    Wu, Zhijun
    Liu, Liang
    Yan, Changcan
    Xu, Jia
    Lei, Jin
    2017 25TH INTERNATIONAL CONFERENCE ON SOFTWARE, TELECOMMUNICATIONS AND COMPUTER NETWORKS (SOFTCOM), 2017, : 421 - 426
  • [28] Feature selection for multiset-valued data based on fuzzy conditional information entropy using iterative model and matrix operation
    Huang, Dan
    Chen, Yiying
    Liu, Fang
    Li, Zhaowen
    APPLIED SOFT COMPUTING, 2023, 142
  • [29] Incremental updating approximations in dominance-based rough sets approach under the variation of the attribute set
    Li, Shaoyong
    Li, Tianrui
    Liu, Dun
    KNOWLEDGE-BASED SYSTEMS, 2013, 40 : 17 - 26
  • [30] Multi-Dimensional Selection Method of Port Logistics Location Based on Entropy Weight Method
    Guo, Ruiwei
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2023, 19 (04): : 407 - 416