Neighborhood rough set based heterogeneous feature subset selection

被引:944
|
作者
Hu, Qinghua [1 ]
Yu, Daren [1 ]
Liu, Jinfu [1 ]
Wu, Congxin [1 ]
机构
[1] Harbin Inst Technol, Harbin 150001, Peoples R China
基金
中国国家自然科学基金;
关键词
categorical feature; numerical feature; heterogeneous feature; feature selection; neighborhood; rough sets;
D O I
10.1016/j.ins.2008.05.024
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Feature subset selection is viewed as an important preprocessing step for pattern recognition, machine learning and data mining. Most of researches are focused on dealing with homogeneous feature selection, namely, numerical or categorical features. In this paper, we introduce a neighborhood rough set model to deal with the problem of heterogeneous feature subset selection. As the classical rough set model can just be used to evaluate categorical features, we generalize this model with neighborhood relations and introduce a neighborhood rough set model. The proposed model will degrade to the classical one if we specify the size of neighborhood zero. The neighborhood model is used to reduce numerical and categorical features by assigning different thresholds for different kinds of attributes. In this model the sizes of the neighborhood lower and upper approximations of decisions reflect the discriminating capability of feature subsets. The size of lower approximation is computed as the dependency between decision and condition attributes. We use the neighborhood dependency to evaluate the significance of a subset of heterogeneous features and construct forward feature subset selection algorithms. The proposed algorithms are compared with some classical techniques. Experimental results show that the neighborhood model based method is more flexible to deal with heterogeneous data. (C) 2008 Elsevier Inc. All rights reserved.
引用
收藏
页码:3577 / 3594
页数:18
相关论文
共 50 条
  • [1] Feature subset selection based on fuzzy neighborhood rough sets
    Wang, Changzhong
    Shao, Mingwen
    He, Qiang
    Qian, Yuhua
    Qi, Yali
    [J]. KNOWLEDGE-BASED SYSTEMS, 2016, 111 : 173 - 179
  • [2] Fault feature subset selection based on rough set theory
    Zhao, Yueling
    Xu, Lin
    Wang, Jianhui
    Gu, Shusheng
    [J]. Complexity Analysis and Control for Social, Economical and Biological Systems, 2006, 1 : 162 - 171
  • [3] Feature Subset Selection Based on Variable Precision Neighborhood Rough Sets
    Chen, Yingyue
    Chen, Yumin
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2021, 14 (01) : 572 - 581
  • [4] Feature Selection Based on Neighborhood Systems and Rough Set Theory
    He, Ming
    [J]. WKDD: 2009 SECOND INTERNATIONAL WORKSHOP ON KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2009, : 3 - 5
  • [5] Online streaming feature selection based on neighborhood rough set
    Li, Shuangjie
    Zhang, Kaixiang
    Li, Yali
    Wang, Shuqin
    Zhang, Shaoqiang
    [J]. APPLIED SOFT COMPUTING, 2021, 113
  • [6] Label distribution feature selection based on neighborhood rough set
    Wu, Yilin
    Guo, Wenzhong
    Lin, Yaojin
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2024, 36 (23):
  • [7] Neighborhood rough set with neighborhood equivalence relation for feature selection
    Shangzhi Wu
    Litai Wang
    Shuyue Ge
    Zhengwei Hao
    Yulin Liu
    [J]. Knowledge and Information Systems, 2024, 66 : 1833 - 1859
  • [8] Neighborhood rough set with neighborhood equivalence relation for feature selection
    Wu, Shangzhi
    Wang, Litai
    Ge, Shuyue
    Hao, Zhengwei
    Liu, Yulin
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (03) : 1833 - 1859
  • [9] Supervised spectral feature selection with neighborhood rough set
    Liu, Qiong
    Cai, Mingjie
    Li, Qingguo
    [J]. APPLIED SOFT COMPUTING, 2024, 165
  • [10] Feature subset selection based on mahalanobis distance: a statistical rough set method
    孙亮
    韩崇昭
    [J]. Journal of Pharmaceutical Analysis, 2008, (01) : 14 - 18