Splitting Choice and Computational Complexity Analysis of Decision Trees

被引:1
|
作者
Zhao, Xin [1 ]
Nie, Xiaokai [2 ]
机构
[1] Southeast Univ, Sch Math, Nanjing 211189, Peoples R China
[2] Southeast Univ, Sch Automat, Nanjing 210096, Peoples R China
基金
中国国家自然科学基金;
关键词
decision tree; splitting bias; splitting criteria; computational complexity; noise variable; CLASSIFICATION; SELECTION;
D O I
10.3390/e23101241
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Some theories are explored in this research about decision trees which give theoretical support to the applications based on decision trees. The first is that there are many splitting criteria to choose in the tree growing process. The splitting bias that influences the criterion chosen due to missing values and variables with many possible values has been studied. Results show that the Gini index is superior to entropy information as it has less bias regarding influences. The second is that noise variables with more missing values have a better chance to be chosen while informative variables do not. The third is that when there are many noise variables involved in the tree building process, it influences the corresponding computational complexity. Results show that the computational complexity increase is linear to the number of noise variables. So methods that decompose more information from the original data but increase the variable dimension can also be considered in real applications.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Statistical analysis of various splitting criteria for decision trees
    Aaboub, Fadwa
    Chamlal, Hasna
    Ouaderhman, Tayeb
    [J]. JOURNAL OF ALGORITHMS & COMPUTATIONAL TECHNOLOGY, 2023, 17
  • [2] Computational Complexity Analysis of Decision Tree Algorithms
    Sani, Habiba Muhammad
    Lei, Ci
    Neagu, Daniel
    [J]. ARTIFICIAL INTELLIGENCE XXXV (AI 2018), 2018, 11311 : 191 - 197
  • [3] Computational Complexity Analysis of Determinant Decision Diagram
    Shi, Guoyong
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2010, 57 (10) : 828 - 832
  • [4] Computational Complexity Reduction and Interpretability Improvement of Distance-Based Decision Trees
    Blachnik, Marcin
    Kordos, Miroslaw
    [J]. HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, PT I, 2012, 7208 : 288 - 297
  • [5] Time complexity of decision trees
    Moshkov, MJ
    [J]. TRANSACTIONS ON ROUGH SETS III, 2005, 3400 : 244 - 459
  • [6] The Computational Complexity of Choice Sets
    Brandt, Felix
    Fischer, Felix
    Harrenstein, Paul
    [J]. MATHEMATICAL LOGIC QUARTERLY, 2009, 55 (04) : 444 - 459
  • [7] Computational Social Choice and Computational Complexity: BFFs?
    Hemaspaandra, Lane A.
    [J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 7971 - 7977
  • [8] Mode Choice Analysis Using Random Forrest Decision Trees
    Sekhar, Ch. Ravi
    Minal
    Madhu, E.
    [J]. INTERNATIONAL CONFERENCE ON TRANSPORTATION PLANNING AND IMPLEMENTATION METHODOLOGIES FOR DEVELOPING COUNTRIES (11TH TPMDC) SELECTED PROCEEDINGS, 2016, 17 : 644 - 652
  • [9] Complexity of decision trees for boolean functions
    Freivalds, R
    Miyakawa, M
    Rosenberg, IG
    [J]. 33RD INTERNATIONAL SYMPOSIUM ON MULTIPLE-VALUED LOGIC, PROCEEDINGS, 2003, : 253 - 255
  • [10] SOCIAL CHOICE AND COMPUTATIONAL-COMPLEXITY
    KELLY, JS
    [J]. JOURNAL OF MATHEMATICAL ECONOMICS, 1988, 17 (01) : 1 - 8