Neighborhood rough set based heterogeneous feature subset selection

被引:944
|
作者
Hu, Qinghua [1 ]
Yu, Daren [1 ]
Liu, Jinfu [1 ]
Wu, Congxin [1 ]
机构
[1] Harbin Inst Technol, Harbin 150001, Peoples R China
基金
中国国家自然科学基金;
关键词
categorical feature; numerical feature; heterogeneous feature; feature selection; neighborhood; rough sets;
D O I
10.1016/j.ins.2008.05.024
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Feature subset selection is viewed as an important preprocessing step for pattern recognition, machine learning and data mining. Most of researches are focused on dealing with homogeneous feature selection, namely, numerical or categorical features. In this paper, we introduce a neighborhood rough set model to deal with the problem of heterogeneous feature subset selection. As the classical rough set model can just be used to evaluate categorical features, we generalize this model with neighborhood relations and introduce a neighborhood rough set model. The proposed model will degrade to the classical one if we specify the size of neighborhood zero. The neighborhood model is used to reduce numerical and categorical features by assigning different thresholds for different kinds of attributes. In this model the sizes of the neighborhood lower and upper approximations of decisions reflect the discriminating capability of feature subsets. The size of lower approximation is computed as the dependency between decision and condition attributes. We use the neighborhood dependency to evaluate the significance of a subset of heterogeneous features and construct forward feature subset selection algorithms. The proposed algorithms are compared with some classical techniques. Experimental results show that the neighborhood model based method is more flexible to deal with heterogeneous data. (C) 2008 Elsevier Inc. All rights reserved.
引用
收藏
页码:3577 / 3594
页数:18
相关论文
共 50 条
  • [21] A hybrid genetic algorithm for feature subset selection in rough set theory
    Jing, Si-Yuan
    [J]. SOFT COMPUTING, 2014, 18 (07) : 1373 - 1382
  • [22] A New Online Feature Selection Method Using Neighborhood Rough Set
    Zhou, Peng
    Hu, Xuegang
    Li, Peipei
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIG KNOWLEDGE (IEEE ICBK 2017), 2017, : 135 - 142
  • [23] Online streaming feature selection using adapted Neighborhood Rough Set
    Zhou, Peng
    Hu, Xuegang
    Li, Peipei
    Wu, Xindong
    [J]. INFORMATION SCIENCES, 2019, 481 : 258 - 279
  • [24] Uncertainty optimization based feature subset selection model using rough set and uncertainty theory
    Sinha A.K.
    Shende P.
    Namdev N.
    [J]. International Journal of Information Technology, 2022, 14 (5) : 2723 - 2739
  • [25] Rough Set Based Feature Selection: A Review
    Anaraki, Javad Rahimipour
    Eftekhari, Mahdi
    [J]. 2013 5TH CONFERENCE ON INFORMATION AND KNOWLEDGE TECHNOLOGY (IKT), 2013, : 301 - 306
  • [26] A novel hybrid feature selection method considering feature interaction in neighborhood rough set
    Wan, Jihong
    Chen, Hongmei
    Yuan, Zhong
    Li, Tianrui
    Yang, Xiaoling
    Sang, BinBin
    [J]. KNOWLEDGE-BASED SYSTEMS, 2021, 227
  • [27] Feature Subset Selection Approach Based on Fuzzy Rough Set for High-dimensional Data
    Guo, Changyou
    Zheng, Xuefeng
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING (GRC), 2014, : 72 - 75
  • [28] Neighborhood rough set based ensemble feature selection with cross-class sample granulation
    Liu, Keyu
    Li, Tianrui
    Yang, Xibei
    Yang, Xin
    Liu, Dun
    [J]. APPLIED SOFT COMPUTING, 2022, 131
  • [29] Fast feature selection algorithm for neighborhood rough set model based on Bucket and Trie structures
    Rachid Benouini
    Imad Batioua
    Soufiane Ezghari
    Khalid Zenkouar
    Azeddine Zahi
    [J]. Granular Computing, 2020, 5 : 329 - 347
  • [30] A new method for feature selection based on weighted k-nearest neighborhood rough set
    Wang, Ning
    Zhao, Enhui
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238