Realization of Random Forest for Real-Time Evaluation through Tree Framing

被引:22
|
作者
Buschjaeger, Sebastian [1 ]
Chen, Kuan-Hsun [2 ]
Chen, Jian-Jia [2 ]
Morik, Katharina [1 ]
机构
[1] TU Dortmund Univ, Artificial Intelligence Unit, Dortmund, Germany
[2] TU Dortmund Univ, Design Automat Embedded Syst Grp, Dortmund, Germany
关键词
random forest; decision trees; caching; computer architecture;
D O I
10.1109/ICDM.2018.00017
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The optimization of learning has always been of particular concern for big data analytics. However, the ongoing integration of machine learning models into everyday life also demand the evaluation to be extremely fast and in real-time. Moreover, in the Internet of Things, the computing facilities that run the learned model are restricted. Hence, the implementation of the model application must take the characteristics of the executing platform into account Although there exist some heuristics that optimize the code, principled approaches for fast execution of learned models are rare. In this paper, we introduce a method that optimizes the execution of Decision Trees (DT). Decision Trees form the basis of many ensemble methods, such as Random Forests (RF) or Extremely Randomized Trees (ET). For these methods to work best, trees should be as large as possible. This challenges the data and the instruction cache of modern CPUs and thus demand a more careful memory layout. Based on a probabilistic view of decision tree execution, we optimize the two most common implementation schemes of decision trees. We discuss the advantages and disadvantages of both implementations and present a theoretically well-founded memory layout which maximizes locality during execution in both cases. The method is applied to three computer architectures, namely ARM (RISC), PPC (Extended RISC) and Intel (CISC) and is automatically adopted to the specific architecture by a code generator. We perform over 1800 experiments on several real-world data sets and report an average speed-up of 2 to 4 across all three architectures by using the proposed memory layout. Moreover, we find that our implementation outperforms sklearn, which was used to train the models by a factor of 1500.
引用
收藏
页码:19 / 28
页数:10
相关论文
共 50 条
  • [21] Real-time prediction of tunnel face conditions using XGBoost Random Forest algorithm
    Lei-jie Wu
    Xu Li
    Ji-dong Yuan
    Shuang-jing Wang
    Frontiers of Structural and Civil Engineering, 2023, 17 : 1777 - 1795
  • [22] A Random Forest Method for Real-Time Price Forecasting in New York Electricity Market
    Mei, Jie
    He, Dawei
    Harley, Ronald
    Habetler, Thomas
    Qu, Guannan
    2014 IEEE PES GENERAL MEETING - CONFERENCE & EXPOSITION, 2014,
  • [23] Real-time prediction of tunnel face conditions using XGBoost Random Forest algorithm
    Leijie WU
    Xu LI
    Jidong YUAN
    Shuangjing WANG
    Frontiers of Structural and Civil Engineering, 2023, 17 (12) : 1777 - 1795
  • [24] Real-time prediction of tunnel face conditions using XGBoost Random Forest algorithm
    Wu, Lei-jie
    Li, Xu
    Yuan, Ji-dong
    Wang, Shuang-jing
    FRONTIERS OF STRUCTURAL AND CIVIL ENGINEERING, 2023, 17 (12) : 1777 - 1795
  • [25] Real-time prediction of high-density EAST disruptions using random forest
    Hu, W. H.
    Rea, C.
    Yuan, Q. P.
    Erickson, K. G.
    Chen, D. L.
    Shen, B.
    Huang, Y.
    Xiao, J. Y.
    Chen, J. J.
    Duan, Y. M.
    Zhang, Y.
    Zhuang, H. D.
    Xu, J. C.
    Montes, K. J.
    Granetz, R. S.
    Zeng, L.
    Qian, J. P.
    Xiao, B. J.
    Li, J. G.
    NUCLEAR FUSION, 2021, 61 (06)
  • [26] Real-time WiFi localization of heterogeneous robot teams using an online random forest
    Balaguer Benjamin
    Gorkem Erinc
    Stefano Carpin
    Autonomous Robots, 2015, 39 : 155 - 167
  • [27] Model Simplification of Deep Random Forest for Real-Time Applications of Various Sensor Data
    Kim, Sangwon
    Ko, Byoung-Chul
    Nam, Jaeyeal
    SENSORS, 2021, 21 (09)
  • [28] Real-Time Head Pose Estimation Based on Kalman Filter and Random Regression Forest
    Li C.
    Zhong F.
    Ma X.
    Qin X.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2017, 29 (12): : 2309 - 2316
  • [29] Real-time WiFi localization of heterogeneous robot teams using an online random forest
    Benjamin, Balaguer
    Erinc, Gorkem
    Carpin, Stefano
    AUTONOMOUS ROBOTS, 2015, 39 (02) : 155 - 167
  • [30] Performance evaluation of real-time speech through a packet network: a random neural networks-based approach
    Mohamed, S
    Rubino, G
    Varela, M
    PERFORMANCE EVALUATION, 2004, 57 (02) : 141 - 161