Neighborhood rough set based heterogeneous feature subset selection

被引:944
|
作者
Hu, Qinghua [1 ]
Yu, Daren [1 ]
Liu, Jinfu [1 ]
Wu, Congxin [1 ]
机构
[1] Harbin Inst Technol, Harbin 150001, Peoples R China
基金
中国国家自然科学基金;
关键词
categorical feature; numerical feature; heterogeneous feature; feature selection; neighborhood; rough sets;
D O I
10.1016/j.ins.2008.05.024
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Feature subset selection is viewed as an important preprocessing step for pattern recognition, machine learning and data mining. Most of researches are focused on dealing with homogeneous feature selection, namely, numerical or categorical features. In this paper, we introduce a neighborhood rough set model to deal with the problem of heterogeneous feature subset selection. As the classical rough set model can just be used to evaluate categorical features, we generalize this model with neighborhood relations and introduce a neighborhood rough set model. The proposed model will degrade to the classical one if we specify the size of neighborhood zero. The neighborhood model is used to reduce numerical and categorical features by assigning different thresholds for different kinds of attributes. In this model the sizes of the neighborhood lower and upper approximations of decisions reflect the discriminating capability of feature subsets. The size of lower approximation is computed as the dependency between decision and condition attributes. We use the neighborhood dependency to evaluate the significance of a subset of heterogeneous features and construct forward feature subset selection algorithms. The proposed algorithms are compared with some classical techniques. Experimental results show that the neighborhood model based method is more flexible to deal with heterogeneous data. (C) 2008 Elsevier Inc. All rights reserved.
引用
收藏
页码:3577 / 3594
页数:18
相关论文
共 50 条
  • [31] Fast feature selection algorithm for neighborhood rough set model based on Bucket and Trie structures
    Benouini, Rachid
    Batioua, Imad
    Ezghari, Soufiane
    Zenkouar, Khalid
    Zahi, Azeddine
    [J]. GRANULAR COMPUTING, 2020, 5 (03) : 329 - 347
  • [32] Information-theoretic partially labeled heterogeneous feature selection based on neighborhood rough sets
    Zhang, Hongying
    Sun, Qianqian
    Dong, Kezhen
    [J]. INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2023, 154 : 200 - 217
  • [33] Noise-resistant multilabel fuzzy neighborhood rough sets for feature subset selection
    Yin, Tengyu
    Chen, Hongmei
    Yuan, Zhong
    Li, Tianrui
    Liu, Keyu
    [J]. INFORMATION SCIENCES, 2023, 621 : 200 - 226
  • [34] Feature selection by ordered rough set based feature weighting
    Al-Radaideh, QA
    Sulaiman, MN
    Selamat, MH
    Ibrahim, HT
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2005, 3588 : 105 - 112
  • [35] Incremental feature selection for dynamic hybrid data using neighborhood rough set
    Shu, Wenhao
    Qian, Wenbin
    Xie, Yonghong
    [J]. KNOWLEDGE-BASED SYSTEMS, 2020, 194
  • [36] Heterogeneous Feature Selection Based on Neighborhood Combination Entropy
    Zhang, Pengfei
    Li, Tianrui
    Yuan, Zhong
    Luo, Chuan
    Liu, Keyu
    Yang, Xiaoling
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (03) : 3514 - 3527
  • [37] Feature selection based on neighborhood rough sets and Gini index
    Zhang, Yuchao
    Nie, Bin
    Du, Jianqiang
    Chen, Jiandong
    Du, Yuwen
    Jin, Haike
    Zheng, Xuepeng
    Chen, Xingxin
    Miao, Zhen
    [J]. PeerJ Computer Science, 2023, 9
  • [38] Feature selection based on neighborhood rough sets and Gini index
    Zhang, Yuchao
    Nie, Bin
    Du, Jianqiang
    Chen, Jiandong
    Du, Yuwen
    Jin, Haike
    Zheng, Xuepeng
    Chen, Xingxin
    Miao, Zhen
    [J]. PEERJ, 2023, 11
  • [39] Feature selection for imbalanced data based on neighborhood rough sets
    Chen, Hongmei
    Li, Tianrui
    Fan, Xin
    Luo, Chuan
    [J]. INFORMATION SCIENCES, 2019, 483 : 1 - 20
  • [40] Feature selection based on rough set and information entropy
    Han, JC
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, VOLS 1 AND 2, 2005, : 153 - 158