The random subspace method for constructing decision forests

被引:24
|
作者
Ho, TK [1 ]
机构
[1] AT&T Bell Labs, Lucent Technol, Murray Hill, NJ 07974 USA
关键词
pattern recognition; decision tree; decision forest; stochastic discrimination; decision combination; classifier combination; multiple-classifier system; bootstrapping;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Much of previous attention on decision trees focuses on the splitting criteria and optimization of tree sizes. The dilemma between overfitting and achieving maximum accuracy is seldom resolved. A method to construct a decision tree based classifier is proposed that maintains highest accuracy on training data and improves on generalization accuracy as it grows in complexity. The classifier consists of multiple trees constructed systematically by pseudorandomly selecting subsets of components of the feature vector, that is, trees constructed in randomly chosen subspaces. The subspace method is compared to single-tree classifiers and other forest construction methods by experiments on publicly available datasets, where the method's superiority is demonstrated. We also discuss independence between trees in a forest and relate that to the combined classification accuracy.
引用
收藏
页码:832 / 844
页数:13
相关论文
共 50 条
  • [1] Local Random Subspace Method for Constructing Multiple Decision Stumps
    Kotsiantis, S. B.
    2009 INTERNATIONAL CONFERENCE ON INFORMATION AND FINANCIAL ENGINEERING, PROCEEDINGS, 2009, : 125 - 129
  • [2] Research and application of a method for constructing decision forests
    Qiang, Zhu
    2007 INTERNATIONAL CONFERENCE ON SERVICE SYSTEMS AND SERVICE MANAGEMENT, VOLS 1-3, 2007, : 490 - 495
  • [3] Efficient random subspace decision forests with a simple probability dimensionality setting scheme
    Wang, Quan
    Wang, Fei
    Li, Zhongheng
    Jiang, Peilin
    Ren, Fuji
    Nie, Feiping
    INFORMATION SCIENCES, 2023, 638
  • [4] Constructing rough decision forests
    Hu, QH
    Yu, DR
    Wang, MY
    ROUGH SETS, FUZZY SETS, DATA MINING, AND GRANULAR COMPUTING, PT 2, PROCEEDINGS, 2005, 3642 : 147 - 156
  • [5] A Novel Feature Subspace Selection Method in Random Forests for High Dimensional Data
    Wang, Yisen
    Xia, Shu-Tao
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 4383 - 4389
  • [6] The application of TOPSIS decision and random forests method in tone recognition
    Guo, Rui
    Zhang, Chi
    Hanachi, Houman
    Liu, Jie
    2018 INTERNATIONAL CONFERENCE ON SENSING, DIAGNOSTICS, PROGNOSTICS, AND CONTROL (SDPC), 2018, : 214 - 220
  • [7] Decision trees and random forests
    Becker, Thijs
    Rousseau, Axel-Jan
    Geubbelmans, Melvin
    Burzykowski, Tomasz
    Valkenborg, Dirk
    AMERICAN JOURNAL OF ORTHODONTICS AND DENTOFACIAL ORTHOPEDICS, 2023, 164 (06) : 894 - 897
  • [8] Constructing Regression Models with High Prediction Accuracy and Interpretability Based on Decision Tree and Random Forests
    Shimizu, Naoto
    Kaneko, Hiromasa
    JOURNAL OF COMPUTER CHEMISTRY-JAPAN, 2021, 20 (02) : 71 - 87
  • [9] Boosting random subspace method
    Garcia-Pedrajas, Nicolas
    Ortiz-Boyer, Domingo
    NEURAL NETWORKS, 2008, 21 (09) : 1344 - 1362
  • [10] The deterministic subspace method for constructing classifier ensembles
    Michał Koziarski
    Bartosz Krawczyk
    Michał Woźniak
    Pattern Analysis and Applications, 2017, 20 : 981 - 990