Feature Subset Selection Approach Based on Fuzzy Rough Set for High-dimensional Data

被引:0
|
作者
Guo, Changyou [1 ,2 ]
Zheng, Xuefeng [1 ,2 ]
机构
[1] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Beijing 100083, Peoples R China
[2] Beijing Key Lab Knowledge Engn Mat Sci, Beijing, Peoples R China
关键词
Feature subset selection; fuzzy rough set; rough set; high-dimensional data; data mining; ATTRIBUTE REDUCTION;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Feature subset selection, as an important processing step to knowledge discovery and machine learning, is effective method in reducing irrelevant and or redundant features, compressing repeated data, and improving classification accuracy. Rough set theory is an important tool to select feature subset from high-dimensional data. In this work, feature subset selection based on fuzzy rough set is introduced, and the efficient measure of feature significance is designed. Based on the fuzzy rough set model, a quick feature subset selection approach is presented, which can efficiently identify relevant features as well as redundancy among all features. In addition, the KNN-based classifier based on the proposed approach is constructed. The experimental results show that the proposed feature subset selection approach achieves better classification on UCI datasets.
引用
收藏
页码:72 / 75
页数:4
相关论文
共 50 条
  • [1] A new population initialization of metaheuristic algorithms based on hybrid fuzzy rough set for high-dimensional gene data feature selection
    Guo, Xuanming
    Hu, Jiao
    Yu, Helong
    Wang, Mingjing
    Yang, Bo
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 166
  • [2] Online Streaming Feature Selection for High-Dimensional and Class-Imbalanced Data Based on Neighborhood Rough Set
    Chen, Xiangyan
    Lin, Yaojin
    Wang, Chenxi
    [J]. Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2019, 32 (08): : 726 - 735
  • [3] A Fast Clustering-Based Feature Subset Selection Algorithm for High-Dimensional Data
    Song, Qinbao
    Ni, Jingjie
    Wang, Guangtao
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (01) : 1 - 14
  • [4] On fuzzy feature selection in designing fuzzy classifiers for high-dimensional data
    Mansoori E.G.
    Shafiee K.S.
    [J]. Evolving Systems, 2016, 7 (4) : 255 - 265
  • [5] A hybrid approach using rough set theory and hypergraph for feature selection on high-dimensional medical datasets
    Raman, M. R. Gauthama
    Nivethitha, Somu
    Kannan, Krithivasan
    Sriram, V. S. Shankar
    [J]. SOFT COMPUTING, 2019, 23 (23) : 12655 - 12672
  • [6] A hybrid approach using rough set theory and hypergraph for feature selection on high-dimensional medical datasets
    M. R. Gauthama Raman
    Somu Nivethitha
    Krithivasan Kannan
    V. S. Shankar Sriram
    [J]. Soft Computing, 2019, 23 : 12655 - 12672
  • [7] A Feature Subset Selection Method Based On High-Dimensional Mutual Information
    Zheng, Yun
    Kwoh, Chee Keong
    [J]. ENTROPY, 2011, 13 (04) : 860 - 901
  • [8] Implementation of FAST Clustering-Based Feature Subset Selection Algorithm for High-Dimensional Data
    Shilu, Smit
    Sheth, Kushal
    Mehul, Ekata
    [J]. PROCEEDINGS OF INTERNATIONAL CONFERENCE ON ICT FOR SUSTAINABLE DEVELOPMENT ICT4SD 2015, VOL 2, 2016, 409 : 203 - 213
  • [9] Fault feature subset selection based on rough set theory
    Zhao, Yueling
    Xu, Lin
    Wang, Jianhui
    Gu, Shusheng
    [J]. Complexity Analysis and Control for Social, Economical and Biological Systems, 2006, 1 : 162 - 171
  • [10] Neighborhood rough set based heterogeneous feature subset selection
    Hu, Qinghua
    Yu, Daren
    Liu, Jinfu
    Wu, Congxin
    [J]. INFORMATION SCIENCES, 2008, 178 (18) : 3577 - 3594