Structural risk minimization of rough set-based classifier

被引:0
|
作者
Jinfu Liu
Mingliang Bai
Na Jiang
Daren Yu
机构
[1] Harbin Institute of Technology,School of Energy Science and Engineering
来源
Soft Computing | 2020年 / 24卷
关键词
Structural risk minimization; Rough set-based classifiers; Complexity control; Genetic multi-objective optimization;
D O I
暂无
中图分类号
学科分类号
摘要
The classification ability in unseen objects, namely generalization ability, remains a long-standing challenge in rough set-based classifier. Current research mainly focuses on introducing thresholds to tolerate some errors in seen objects. The reason for introducing thresholds and the selection of threshold still lack sufficient theoretical support. The structural risk minimization (SRM) inductive principle is one of the most effective theories to control the generalization ability, which suggests a trade-off between errors in seen objects and complexity. Therefore, this paper introduces the SRM principle into rough set-based classifier and proposes SRM algorithm of rough set-based classifier called SRM-R algorithm. SRM-R algorithm uses the number of rules to characterize the actual complexity of rough set-based classifier and obtains the optimal trade-off between errors in seen objects and complexity through genetic multi-objective optimization. The tenfold cross-validation experiment in 12 UCI datasets shows SRM-R algorithm can significantly improve the generalization ability compared with conventional threshold algorithm. Besides, this paper uses other two possible complexity metrics including the number of attributes and attribute space to construct corresponding SRM algorithms, respectively, and compared their classification accuracy with that of SRM-R algorithm. Comparison result shows SRM-R algorithm obtains optimal classification accuracy. This indicates that the number of rules characterizes the complexity more effectively than the number of attributes and attribute space. Further experiments show that SRM-R algorithm obtains fewer rules and larger support coefficient, which means it extracts stronger rules. This explains why it obtains better generalization ability to some extent.
引用
收藏
页码:2049 / 2066
页数:17
相关论文
共 50 条
  • [1] Structural risk minimization of rough set-based classifier
    Liu, Jinfu
    Bai, Mingliang
    Jiang, Na
    Yu, Daren
    [J]. SOFT COMPUTING, 2020, 24 (03) : 2049 - 2066
  • [2] Apply a rough set-based classifier to dependency parsing
    Ji, Yangsheng
    Shang, Lin
    Dai, Xinyu
    Ma, Ruoce
    [J]. ROUGH SETS AND KNOWLEDGE TECHNOLOGY, 2008, 5009 : 97 - 105
  • [3] Rough Set-based SVM Classifier for Text Categorization
    Chen, Peng
    Liu, Shuang
    [J]. ICNC 2008: FOURTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 2, PROCEEDINGS, 2008, : 153 - +
  • [4] Rough Set-Based Analysis of Characteristic Features for ANN Classifier
    Stanczyk, Urszula
    [J]. HYBRID ARTIFICIAL INTELLIGENCE SYSTEMS, PT 1, 2010, 6076 : 565 - 572
  • [5] A Rough Set-based SVM Classifier for ATR on the Basis of Invariant Moment
    Huang, Lei
    Ma, Ying-jun
    Guo, Lei
    [J]. 2009 WRI INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND MOBILE COMPUTING: CMC 2009, VOL 3, 2009, : 620 - +
  • [6] A rough set-based fuzzy clustering
    Zhao, YQ
    Zhou, XZ
    Tang, GZ
    [J]. INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2005, 3689 : 401 - 409
  • [7] On the Definability of a Set and Rough Set-Based Rule Generation
    Sakai, Hiroshi
    Wu, Mao
    Yamaguchi, Naoto
    [J]. 2014 IIAI 3RD INTERNATIONAL CONFERENCE ON ADVANCED APPLIED INFORMATICS (IIAI-AAI 2014), 2014, : 122 - 125
  • [8] Dataset condensation using OWA fuzzy-rough set-based nearest neighbor classifier
    Amiri, Mehran
    Jensen, Richard
    Eftekhari, Mahdi
    Mac Parthalain, Neil
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2016, : 1934 - 1941
  • [9] A support vector machine classifier with rough set-based feature selection for breast cancer diagnosis
    Chen, Hui-Ling
    Yang, Bo
    Liu, Jie
    Liu, Da-You
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (07) : 9014 - 9022
  • [10] A rough set-based approach to text classification
    Chouchoulas, A
    Shen, Q
    [J]. NEW DIRECTIONS IN ROUGH SETS, DATA MINING, AND GRANULAR-SOFT COMPUTING, 1999, 1711 : 118 - 127