Efficient evolution of decision trees via fully matrix-based fitness evaluation

被引:6
|
作者
Costa, Vinicius G. [1 ]
Salcedo-Sanz, Sancho [2 ]
Pedreira, Carlos E. [1 ]
机构
[1] Univ Fed Rio de Janeiro, Syst Engn & Comp Sci Dept, Rio De Janeiro, Brazil
[2] Univ Alcala, Dept Signal Proc & Commun, Madrid 28805, Spain
关键词
Decision trees; Interpretability; Problem encoding; Coral reef optimization; Evolutionary computation; CORAL-REEFS OPTIMIZATION; ALGORITHMS; SELECTION; DESIGN;
D O I
10.1016/j.asoc.2023.111045
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Decision Trees (DTs) are a class of supervised learning models that are widely used for both classification and regression applications. They are well-known for their interpretability and robustness, which have led them to remain popular even 60 years after they were first proposed. However, because traditional tree algorithms use greedy methods that are prone to suboptimality, several works have explored the usage of evolutionary algorithms instead. Although these algorithms are often reported to outperform the traditional greedy approach, their computational cost is much higher, since the evolutionary component requires a large number (millions or billions) of function evaluations in order to produce a single tree. Aiming to reduce this computational cost, in this work we propose an encoding that allows the training and evaluation of DTs using only matrix operations. The proposed procedure is shown to be much faster than the traditional tree implementation for complete trees with depths ranging from 2 to 6, and for datasets ranging in size from 100 to 100,000 observations. In particular, the results show speedups of nearly up to 20 times, especially when the dataset is large and the desired tree is small enough to be interpretable. The proposed procedure also benefits from GPU parallelization, although it is still highly performing without it. Furthermore, we propose an evolutionary algorithm, called Coral Reef Optimization for Decision Trees (CRO-DT), that integrates this encoding with a pre-existing ensemble algorithm to evolve better univariate trees. The results obtained show that the proposed CRO-DT is competitive with traditional and modern tree algorithms, consistently producing models of good quality across 14 tested UCI Datasets. We conclude that for most relevant situations, the proposed matrix encoding provides significant speedups over the traditional implementation, and also may serve as a basis for high quality evolutionary DT algorithms.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] The evolution of matrix-based methodology
    Proc. Int. DSM Conf., 2008, (xiii-xiv):
  • [2] AniMatrix: A Matrix-Based Visualization of Software Evolution
    Rufiange, Sebastien
    Melancon, Guy
    2014 SECOND IEEE WORKING CONFERENCE ON SOFTWARE VISUALIZATION (VISSOFT), 2014, : 137 - 146
  • [3] A companion matrix-based efficient image encryption method
    Rohit
    Tripathi, Shailendra Kumar
    Gupta, Bhupendra
    Lamba, Subir Singh
    SIGNAL PROCESSING, 2025, 228
  • [4] Fitness evaluation reuse for accelerating GPU-based evolutionary induction of decision trees
    Jurczuk, Krzysztof
    Czajkowski, Marcin
    Kretowski, Marek
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2021, 35 (01): : 20 - 32
  • [5] Discernibility Matrix-based Attribute Reduction Algorithm of Decision Table
    He, Ying
    He, Dan
    ADVANCED MATERIALS AND ENGINEERING MATERIALS, PTS 1 AND 2, 2012, 457-458 : 1230 - +
  • [6] Integrated Matrix-Based Fault Tree Generation and Evaluation
    Roth, Michael
    Wolf, Moritz
    Lindemann, Udo
    2015 CONFERENCE ON SYSTEMS ENGINEERING RESEARCH, 2015, 44 : 599 - 608
  • [7] Computationally Efficient Approximations for Matrix-Based Renyi's Entropy
    Gong, Tieliang
    Dong, Yuxin
    Yu, Shujian
    Dong, Bo
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2022, 70 : 6170 - 6184
  • [8] A Matrix-based Algorithm to Yield Efficient Allocation in Price Interval
    Zhu, Junwu
    Wang, Sicheng
    Li, Bin
    2011 6TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCES AND CONVERGENCE INFORMATION TECHNOLOGY (ICCIT), 2012, : 315 - 319
  • [9] Diode/magnetic tunnel junction cell for fully scalable matrix-based biochip
    Cardoso, F. A.
    Ferreira, H. A.
    Conde, J. P.
    Chu, V.
    Freitas, P. P.
    Vidal, D.
    Germano, J.
    Sousa, L.
    Piedade, M. S.
    Costa, B. A.
    Lemos, J. M.
    JOURNAL OF APPLIED PHYSICS, 2006, 99 (08)
  • [10] Efficient Tensor Robust Principal Analysis via Right-Invertible Matrix-Based Tensor Products
    Huang, Zhang
    Feng, Jun
    Li, Wei
    AXIOMS, 2025, 14 (02)