Scalable Inference of Decision Tree Ensembles: Flexible Design for CPU-FPGA Platforms

被引:0
|
作者
Owaida, Muhsen [1 ]
Zhang, Hantian [1 ]
Zhang, Ce [1 ]
Alonso, Gustavo [1 ]
机构
[1] ETH, Syst Grp, Dept Comp Sci, Zurich, Switzerland
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Decision tree ensembles are commonly used in a wide range of applications and becoming the de facto algorithm for decision tree based classifiers. Different trees in an ensemble can be processed in parallel during tree inference, making them a suitable use case for FPGAs. Large tree ensembles, however, require careful mapping of trees to on-chip memory and management of memory accesses. As a result, existing FPGA solutions suffer from the inability to scale beyond tens of trees and lack the flexibility to support different tree ensembles. In this paper we present an FPGA tree ensemble classifier together with a software driver to efficiently manage the FPGA's memory resources. The classifier architecture efficiently utilizes the FPGA's resources to fit half a million tree nodes in on-chip memory, delivering up to 20x speedup over a 10-threaded CPU implementation when fully processing the tree ensemble on the FPGA. It can also combine the CPU and FPGA to scale to tree ensembles that do not fit in on-chip memory, achieving up to an order of magnitude speedup compared to a pure CPU implementation. In addition, the classifier architecture can be programmed at runtime to process varying tree ensemble sizes.
引用
收藏
页数:8
相关论文
共 45 条
  • [21] System Design of ATSC3.0 Broadcast Gateway Based on CPU-FPGA
    Ding, Jianhao
    Xiong, Shuai
    Liu, Yifan
    He, Dazhi
    Zhang, Wenjun
    Wang, Kaikai
    2018 IEEE 88TH VEHICULAR TECHNOLOGY CONFERENCE (VTC-FALL), 2018,
  • [22] SecureComm: A Secure Data Transfer Framework for Neural Network Inference on CPU-FPGA Heterogeneous Edge Devices
    Chen, Tian
    Tan, Yu-An
    Li, Chunying
    Zhang, Zheng
    Meng, Weizhi
    Li, Yuanzhang
    IEEE Journal on Emerging and Selected Topics in Circuits and Systems, 2024, 14 (04) : 811 - 822
  • [23] Applying frame layout to hardware design in FPGA for seamless support of cross calls in CPU-FPGA coupling architecture
    Giang Nguyen Thi Huong
    Na, Yeoul
    Kim, Seon Wook
    MICROPROCESSORS AND MICROSYSTEMS, 2011, 35 (05) : 462 - 472
  • [24] STPAcc: Structural TI-Based Pruning for Accelerating Distance-Related Algorithms on CPU-FPGA Platforms
    Wang, Yuke
    Feng, Boyuan
    Li, Gushu
    Deng, Lei
    Xie, Yuan
    Ding, Yufei
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2022, 41 (05) : 1358 - 1370
  • [25] Distributed Inference over Decision Tree Ensembles on Clusters of FPGAs
    Owaida, Muhsen
    Kulkarni, Amit
    Alonso, Gustavo
    ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2019, 12 (04)
  • [26] Exploring HW/SW Co-Design for Video Analysis on CPU-FPGA Heterogeneous Systems
    Zhang, Xiaofan
    Ma, Yuan
    Xiong, Jinjun
    Hwu, Wen-Mei W.
    Kindratenko, Volodymyr
    Chen, Deming
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2022, 41 (06) : 1606 - 1619
  • [27] A Framework for Monte-Carlo Tree Search on CPU-FPGA Heterogeneous Platform via on-chip Dynamic Tree Management
    Meng, Yuan
    Kannan, Rajgopal
    Prasanna, Viktor
    PROCEEDINGS OF THE 2023 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD PROGRAMMABLE GATE ARRAYS, FPGA 2023, 2023, : 235 - 245
  • [28] gpuRF and gpuERT: Efficient and Scalable GPU Algorithms for Decision Tree Ensembles
    Jansson, Karl
    Sundell, Hakan
    Bostrom, Henrik
    PROCEEDINGS OF 2014 IEEE INTERNATIONAL PARALLEL & DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2014, : 1612 - 1621
  • [29] FULL-KV: Flexible and Ultra-Low-Latency In-Memory Key-Value Store System Design on CPU-FPGA
    Qiu, Yunhui
    Xie, Jinyu
    Lv, Hankun
    Yin, Wenbo
    Luk, Wai-Shing
    Wang, Lingli
    Yu, Bowei
    Chen, Hua
    Ge, Xianjun
    Liao, Zhijian
    Shi, Xiaozhong
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2020, 31 (08) : 1828 - 1844
  • [30] Accelerating Queries of Big Data Systems by Storage-Side CPU-FPGA Co-Design
    Zhan, Jinyu
    Jiang, Wei
    Li, Ying
    Wu, Junting
    Zhu, Jianping
    Yu, Jinghuan
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2022, 41 (07) : 2128 - 2141