New feature selection and voting scheme to improve classification accuracy

被引:0
|
作者
Cheng-Jung Tsai
机构
[1] National Changhua University of Education,Department of Mathematics, Graduate Institute of Statistics and Information Science
来源
Soft Computing | 2019年 / 23卷
关键词
Data mining; Classification; Decision tree; Ensemble learning; Feature selection; Voting;
D O I
暂无
中图分类号
学科分类号
摘要
Classification is a classic technique employed in data mining. Many ensemble learning methods have been introduced to improve the predictive accuracy of classification. A typical ensemble learning method consists of three steps: selection, building, and integration. Of the three steps, the first and third significantly affect the predictive accuracy of the classification. In this paper, we propose a new selection and integration scheme. Our method can improve the accuracy of subtrees and maintain their diversity. Through a new voting scheme, the predictive accuracy of ensemble learning is improved. We also theoretically analyzed the selection and integration steps of our method. The results of experimental analyses show that our method can achieve better accuracy than two state-of-the-art tree-based ensemble learning approaches.
引用
收藏
页码:12017 / 12030
页数:13
相关论文
共 50 条
  • [41] Fault classification on vibration data with wavelet based feature selection scheme
    Yen, GG
    Leong, WF
    ISA TRANSACTIONS, 2006, 45 (02) : 141 - 151
  • [42] An Efficient Traffic Classification Scheme Using Embedded Feature Selection and LightGBM
    Hua, Yanpei
    2020 INFORMATION COMMUNICATION TECHNOLOGIES CONFERENCE (ICTC), 2020, : 125 - 130
  • [43] Fault classification on vibration data with wavelet based feature selection scheme
    Yen, GG
    Leong, WF
    IECON 2005: THIRTY-FIRST ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, VOLS 1-3, 2005, : 1644 - 1649
  • [44] PCA-based feature selection scheme for machine defect classification
    Malhi, A
    Gao, RX
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2004, 53 (06) : 1517 - 1525
  • [45] Variable Global Feature Selection Scheme for automatic classification of text documents
    Agnihotri, Deepak
    Verma, Kesari
    Tripathi, Priyanka
    EXPERT SYSTEMS WITH APPLICATIONS, 2017, 81 : 268 - 281
  • [46] A new optimal feature selection scheme for classification of power quality disturbances based on ant colony framework
    Singh, Utkarsh
    Singh, Shyam Narain
    APPLIED SOFT COMPUTING, 2019, 74 : 216 - 225
  • [47] Does Feature Selection Improve Classification? A Large Scale Experiment in OpenML
    Post, Martijn J.
    van der Putten, Peter
    van Rijn, Jan N.
    ADVANCES IN INTELLIGENT DATA ANALYSIS XV, 2016, 9897 : 158 - 170
  • [48] An Approach Based on Resampling and Feature Selection to Improve the Classification of Microarray Data
    Soleymani, Nafiseh
    Moattar, Mohammad Hussein
    2018 6TH IRANIAN JOINT CONGRESS ON FUZZY AND INTELLIGENT SYSTEMS (CFIS), 2018, : 61 - 64
  • [49] Derivation of an artificial gene to improve classification accuracy upon gene selection
    Seo, Minseok
    Oh, Sejong
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2012, 36 : 1 - 12
  • [50] Soft voting technique to improve the performance of global filter based feature selection in text corpus
    Agnihotri, Deepak
    Verma, Kesari
    Tripathi, Priyanka
    Singh, Bikesh Kumar
    APPLIED INTELLIGENCE, 2019, 49 (04) : 1597 - 1619