A data complexity analysis of comparative advantages of decision forest constructors

被引:133
|
作者
Ho, TK [1 ]
机构
[1] Bell Labs, Lucent Technol, Murray Hill, NJ 07974 USA
关键词
bagging; classifier combination; data complexity; decision forest; decision tree; random subspace method;
D O I
10.1007/s100440200009
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Using a number of measures for characterising the complexity of classification problems, we studied the comparative advantages of two methods for constructing decision forests - bootstrapping and random subspaces. We investigated a collection of 392 two-class problems from the UCI depository, and observed that there are strong correlations between the classifier accuracies and measures of length of class boundaries, thickness of the class manifolds, and nonlinearities of decision boundaries. We found characteristics of both difficult and easy cases where combination methods are no better than single classifiers. Also, we observed that the bootstrapping method is better when the training samples are sparse, and the subspace method is better when the classes are compact and the boundaries are smooth.
引用
收藏
页码:102 / 112
页数:11
相关论文
共 50 条
  • [1] A Data Complexity Analysis of Comparative Advantages of Decision Forest Constructors
    Tin Kam Ho
    [J]. Pattern Analysis & Applications, 2002, 5 : 102 - 112
  • [2] CLASSIFICATION OF TREES IN HYPERSPECTRAL CANOPY DATA USING MACHINE LEARNING: COMPARATIVE ANALYSIS OF FOREST STRUCTURE COMPLEXITY
    Galdames, F.
    Gonzalez, P.
    Magni-Perez, F.
    Funk, S. M.
    Lepin, F.
    Saavedra, R.
    Hernandez, H. J.
    [J]. GEOSPATIAL WEEK 2023, VOL. 48-1, 2023, : 1737 - 1742
  • [3] Complexity of classification problems and comparative advantages of combined classifiers
    Ho, TK
    [J]. MULTIPLE CLASSIFIER SYSTEMS, 2000, 1857 : 97 - 106
  • [4] Decision support systems for forest management: A comparative analysis and assessment
    Segura, Marina
    Ray, Duncan
    Maroto, Concepcion
    [J]. COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2014, 101 : 55 - 67
  • [5] Comparative analysis of deterministic and nondeterministic decision tree complexity local approach
    Moshkov, MJ
    [J]. TRANSACTIONS ON ROUGH SETS IV, 2005, 3700 : 125 - 143
  • [6] COMPARATIVE ANALYSIS OF DECISION TREE AND RANDOM FOREST TECHNIQUE FOR ANALYSIS OF WATER IN MAHARASHTRA
    Mahadik, Swapnali D.
    Girdhar, Anup
    [J]. INTERNATIONAL JOURNAL OF EARLY CHILDHOOD SPECIAL EDUCATION, 2022, 14 (05) : 3605 - 3608
  • [7] Comparative advantages of Russian forest products on the global market
    Gordeev, Roman
    [J]. FOREST POLICY AND ECONOMICS, 2020, 119
  • [8] A grey decision-making analysis on regional comparative advantages of agricultural produce of China
    Mu, Yue-Ying
    Wang, Xue-Meng
    [J]. PROCEEDINGS OF 2007 IEEE INTERNATIONAL CONFERENCE ON GREY SYSTEMS AND INTELLIGENT SERVICES, VOLS 1 AND 2, 2007, : 759 - 763
  • [9] PRODUCT SPACE, POTENTIAL COMPARATIVE ADVANTAGES AND EXPORT TECHNOLOGICAL COMPLEXITY
    Ding, Yibing
    Li, Jiantong
    [J]. AUSTRALIAN ECONOMIC PAPERS, 2018, 57 (03) : 218 - 237
  • [10] COMPARATIVE ANALYSIS OF DECISION AND DATA FEEDBACK TRANSMISSION SYSTEMS
    KANEVSKI.ZM
    [J]. TELECOMMUNICATIONS AND RADIO ENGINEER-USSR, 1968, (06): : 1 - &