Incremental feature selection approach to multi-dimensional variation based on matrix dominance conditional entropy for ordered data set

被引:1
|
作者
Xu, Weihua [1 ]
Yang, Yifei [1 ]
Ding, Yi [1 ]
Chen, Xiyang [2 ]
Lv, Xiaofang [3 ]
机构
[1] Southwest Univ, Coll Artificial Intelligence, Chongqing 400715, Peoples R China
[2] Xian Univ Sci & Technol, Coll Comp Sci & Technol, Xian 710600, Peoples R China
[3] Southwest Univ, Coll Life Sci, Chongqing 400715, Peoples R China
基金
中国国家自然科学基金;
关键词
Conditional entropy; Dominance matrix; Feature selection; Ordered data set; Rough set; ATTRIBUTE REDUCTION; DYNAMIC DATA; LEARNING ALGORITHM; ROUGH SETS;
D O I
10.1007/s10489-024-05411-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Rough set theory is a mathematical tool widely employed in various fields to handle uncertainty. Feature selection, as an essential and independent research area within rough set theory, aims to identify a small subset of important features by eliminating irrelevant, redundant, or noisy ones. In human life, data characteristics constantly change over time and other factors, resulting in ordered datasets with varying features. However, existing feature extraction methods are not suitable for handling such datasets since they do not consider previous reduction results when features change and need to be recomputed, leading to significant time consumption. To address this issue, the incremental attribute reduction algorithm utilizes prior reduction results effectively reducing computation time. Motivated by this approach, this paper investigates incremental feature selection algorithms for ordered datasets with changing features. Firstly, we discuss the dominant matrix and the dominance conditional entropy while introducing update principles for the new dominant matrix and dominance diagonal matrix when features change. Subsequently, we propose two incremental feature selection algorithms for adding (IFS-A) or deleting (IFS-D) features in ordered data set. Additionally, nine UCI datasets are utilized to evaluate the performance of our proposed algorithm. The experimental results validate that the average classification accuracy of IFS-A and IFS-D under four classifiers on twelve datasets is 82.05% and 80.75%, which increases by 5.48% and 3.68% respectively compared with the original data.
引用
收藏
页码:4890 / 4910
页数:21
相关论文
共 50 条
  • [11] Incremental feature selection based on rough set in dynamic incomplete data
    Shu, Wenhao
    Shen, Hong
    PATTERN RECOGNITION, 2014, 47 (12) : 3890 - 3906
  • [12] Comparison Analysis of Feature Subset Selection Algorithms for Multi-dimensional Data
    Ruan, Jing
    Zhang, Chang-Sheng
    Guo, Jun-Fang
    2015 INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND INFORMATION SYSTEM (SEIS 2015), 2015, : 681 - 686
  • [13] A Rough Set Approach to Feature Selection Based on Relative Decision Entropy
    Zhou, Lin
    Jiang, Feng
    ROUGH SETS AND KNOWLEDGE TECHNOLOGY, 2011, 6954 : 110 - 119
  • [14] Rough set Theory-Based group incremental approach to feature selection
    Zhao, Jie
    Wu, Dai-yang
    Zhou, Yong-xin
    Liang, Jia-ming
    Wei, WenHong
    Li, Yun
    INFORMATION SCIENCES, 2024, 675
  • [15] A Dynamic Dominance-Based Rough Set Approach for Processing Ordered Data
    Li, Shaoyong
    Hong, Zhiyong
    ROUGH SETS, IJCRS 2019, 2019, 11499 : 312 - 320
  • [16] Interval Dominance-Based Feature Selection for Interval-Valued Ordered Data
    Li, Wentao
    Zhou, Haoxiang
    Xu, Weihua
    Wang, Xi-Zhao
    Pedrycz, Witold
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (10) : 6898 - 6912
  • [17] Active Incremental Feature Selection Using a Fuzzy-Rough-Set-Based Information Entropy
    Zhang, Xiao
    Mei, Changlin
    Chen, Degang
    Yang, Yanyan
    Li, Jinhai
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2020, 28 (05) : 901 - 915
  • [18] Granular-ball-matrix-based incremental semi-supervised feature selection approach to high-dimensional variation using neighbourhood discernibility degree for ordered partially labelled dataset
    Xu, Weihua
    Li, Jinlong
    APPLIED INTELLIGENCE, 2025, 55 (04)
  • [19] Feature Subset Selection Approach Based on Fuzzy Rough Set for High-dimensional Data
    Guo, Changyou
    Zheng, Xuefeng
    2014 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING (GRC), 2014, : 72 - 75
  • [20] Parallel Rough Set: Dimensionality Reduction and Feature Discovery of Multi-dimensional Data in Visualization
    Huang, Tze-Haw
    Huang, Mao Lin
    Jin, Jesse S.
    NEURAL INFORMATION PROCESSING, PT II, 2011, 7063 : 99 - +