Realization of Random Forest for Real-Time Evaluation through Tree Framing

被引:22
|
作者
Buschjaeger, Sebastian [1 ]
Chen, Kuan-Hsun [2 ]
Chen, Jian-Jia [2 ]
Morik, Katharina [1 ]
机构
[1] TU Dortmund Univ, Artificial Intelligence Unit, Dortmund, Germany
[2] TU Dortmund Univ, Design Automat Embedded Syst Grp, Dortmund, Germany
关键词
random forest; decision trees; caching; computer architecture;
D O I
10.1109/ICDM.2018.00017
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The optimization of learning has always been of particular concern for big data analytics. However, the ongoing integration of machine learning models into everyday life also demand the evaluation to be extremely fast and in real-time. Moreover, in the Internet of Things, the computing facilities that run the learned model are restricted. Hence, the implementation of the model application must take the characteristics of the executing platform into account Although there exist some heuristics that optimize the code, principled approaches for fast execution of learned models are rare. In this paper, we introduce a method that optimizes the execution of Decision Trees (DT). Decision Trees form the basis of many ensemble methods, such as Random Forests (RF) or Extremely Randomized Trees (ET). For these methods to work best, trees should be as large as possible. This challenges the data and the instruction cache of modern CPUs and thus demand a more careful memory layout. Based on a probabilistic view of decision tree execution, we optimize the two most common implementation schemes of decision trees. We discuss the advantages and disadvantages of both implementations and present a theoretically well-founded memory layout which maximizes locality during execution in both cases. The method is applied to three computer architectures, namely ARM (RISC), PPC (Extended RISC) and Intel (CISC) and is automatically adopted to the specific architecture by a code generator. We perform over 1800 experiments on several real-world data sets and report an average speed-up of 2 to 4 across all three architectures by using the proposed memory layout. Moreover, we find that our implementation outperforms sklearn, which was used to train the models by a factor of 1500.
引用
收藏
页码:19 / 28
页数:10
相关论文
共 50 条
  • [41] Real-time multicast tree visualization and monitoring
    Makofske, DB
    Almeroth, KC
    SOFTWARE-PRACTICE & EXPERIENCE, 2000, 30 (09): : 1047 - 1065
  • [43] Real-Time Monitoring of Electrical Faults in Industrial Machinery using IoT and Random Forest Regression
    Kumar, B. V. Praveen
    Sivalakshmi, P.
    Muthumarilakshmi, Dr S.
    Suresh, G.
    Vijayalakshmi, Kuppan
    Srinivasan, C.
    2024 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT CYBER PHYSICAL SYSTEMS AND INTERNET OF THINGS, ICOICI 2024, 2024, : 425 - 430
  • [44] Random Forest Classification for Automatic Delineation of Myocardium in Real-Time 3D Echocardiography
    Lempitsky, Victor
    Verhoek, Michael
    Noble, J. Alison
    Blake, Andrew
    FUNCTIONAL IMAGING AND MODELING OF THE HEART, PROCEEDINGS, 2009, 5528 : 447 - +
  • [45] Real-Time Anomaly Detection in Network Traffic Using Graph Neural Networks and Random Forest
    Hassan, Waseem
    Hosseini, Seyed Ebrahim
    Pervez, Shahbaz
    INTERNET OF THINGS, SMART SPACES, AND NEXT GENERATION NETWORKS AND SYSTEMS, PT I, NEW2AN 2023, RUSMART 2023, 2024, 14542 : 194 - 207
  • [46] How to Tune a Random Forest for Real-time Segmentation in Safe Human-Robot Collaboration?
    Sharma, Vivek
    Dittrich, Frank
    Yildirim-Yayilgan, Sule
    Imran, Ali Shariq
    Worn, Heinz
    HCI INTERNATIONAL 2015 - POSTERS' EXTENDED ABSTRACTS, PT I, 2015, 528 : 700 - 704
  • [47] A Real-Time Road Boundary Detection Approach in Surface Mine Based on Meta Random Forest
    Ai, Yunfeng
    Song, Ruiqi
    Huang, Chongqing
    Cui, Chenglin
    Tian, Bin
    Chen, Long
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (01): : 1989 - 2001
  • [48] A real-time early warning classification method for natural gas leakage based on random forest
    Tan, Qiong
    Fu, Ming
    Wang, Zhengxing
    Yuan, Hongyong
    Sun, Jinhua
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2024, 251
  • [49] Issues in realization of an execution time analyzer for distributed real-time objects
    Kim, KH
    Choi, L
    Kim, MH
    3RD IEEE SYMPOSIUM ON APPLICATION SPECIFIC SYSTEMS AND SOFTWARE ENGINEERING TECHNOLOGY, PROCEEDINGS, 2000, : 171 - 178
  • [50] A Bayesian Network Model for Real-time Crash Prediction Based on Selected Variables by Random Forest
    Wu, Mingxian
    Shan, Donghui
    Wang, Zuo
    Sun, Xiaoduan
    Liu, Jianbei
    Sun, Ming
    2019 5TH INTERNATIONAL CONFERENCE ON TRANSPORTATION INFORMATION AND SAFETY (ICTIS 2019), 2019, : 670 - 677