A Study on Feature Subset Selection Using Rough Set Theory

被引:3
|
作者
Han, Jianchao [1 ]
机构
[1] Calif State Univ Dominguez Hills, Dept Comp Sci, 1000 E Victoria St, Carson, CA 90747 USA
关键词
Feature Selection; Data Reduction; Rough Set Theory; Information Entropy; Classification;
D O I
10.1166/jama.2012.1018
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Feature subset selection is an important component of knowledge discovery and data mining systems to help reduce the data dimensionality. Rough sets theory provides a mechanism of selecting feature subsets. Most existing rough set-based feature selection algorithms suffer from intensive computation of either discernibility functions or positive regions to find attribute reduct. In order to improve the efficiency, we propose a new concept, called relative attribute dependency, with which we present a sufficient and necessary condition of the minimum conditional attribute reduct of a decision table and develop a new computation model to find the minimum reduct of condition attributes. The relative attribute dependency can be calculated by counting the distinct rows of the sub-decision table, instead of generating discernibility functions or positive regions. Thus the computation efficiency of minimum reducts is highly improved. Two novel algorithms to find optimal reducts of condition attributes based on the relative attribute dependency are proposed and implemented using Java, one brute-force algorithm and the other heuristic algorithm using attribute entropy as the heuristic function. The algorithms proposed are experimented with 10 data sets from UCI Machine Learning Repository. We conduct the comparison of data classification using C4.5 with the original their their usefulness and are analyzed for further reserach.
引用
收藏
页码:239 / 249
页数:11
相关论文
共 50 条
  • [31] An approach for selective ensemble feature selection based on rough set theory
    Yang, Yong
    Wang, Guoyin
    He, Kun
    [J]. ROUGH SETS AND KNOWLEDGE TECHNOLOGY, PROCEEDINGS, 2007, 4481 : 518 - +
  • [32] Feature Selection Based on Ant Colony Optimization and Rough Set Theory
    He, Ming
    [J]. ISCSCT 2008: INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND COMPUTATIONAL TECHNOLOGY, VOL 1, PROCEEDINGS, 2008, : 247 - 250
  • [33] New filter approaches for feature selection using differential evolution and fuzzy rough set theory
    Hancer, Emrah
    [J]. NEURAL COMPUTING & APPLICATIONS, 2020, 32 (07): : 2929 - 2944
  • [34] A group incremental feature selection for classification using rough set theory based genetic algorithm
    Das, Asit K.
    Sengupta, Shampa
    Bhattacharyya, Siddhartha
    [J]. APPLIED SOFT COMPUTING, 2018, 65 : 400 - 411
  • [35] New filter approaches for feature selection using differential evolution and fuzzy rough set theory
    Emrah Hancer
    [J]. Neural Computing and Applications, 2020, 32 : 2929 - 2944
  • [36] Feature Selection in Gene Expression Data Using Principal Component Analysis and Rough Set Theory
    Mishra, Debahuti
    Dash, Rajashree
    Rath, Amiya Kumar
    Acharya, Milu
    [J]. SOFTWARE TOOLS AND ALGORITHMS FOR BIOLOGICAL SYSTEMS, 2011, 696 : 91 - 100
  • [37] A Rough Set Based Feature Selection Approach using Random Feature Vectors
    Raza, Muhammad Summair
    Qamar, Usman
    [J]. PROCEEDINGS OF 14TH INTERNATIONAL CONFERENCE ON FRONTIERS OF INFORMATION TECHNOLOGY PROCEEDINGS - FIT 2016, 2016, : 229 - 234
  • [38] Feature Subset Selection Approach Based on Fuzzy Rough Set for High-dimensional Data
    Guo, Changyou
    Zheng, Xuefeng
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING (GRC), 2014, : 72 - 75
  • [39] Gene Subset Selection for Cancer Classification Using Statsitical and Rough Set Approach
    Das, Asit Kumar
    Pati, Soumen Kumar
    [J]. SWARM, EVOLUTIONARY, AND MEMETIC COMPUTING, (SEMCCO 2012), 2012, 7677 : 294 - +
  • [40] SUBSET SELECTION USING ROUGH SET IN WAVELET PACKET BASED TEXTURE CLASSIFICATION
    Wang, Qiong
    Li, Hong
    Liu, Jian
    [J]. PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON WAVELET ANALYSIS AND PATTERN RECOGNITION, VOLS 1 AND 2, 2008, : 662 - +