Scalable Inference of Decision Tree Ensembles: Flexible Design for CPU-FPGA Platforms

被引:0
|
作者
Owaida, Muhsen [1 ]
Zhang, Hantian [1 ]
Zhang, Ce [1 ]
Alonso, Gustavo [1 ]
机构
[1] ETH, Syst Grp, Dept Comp Sci, Zurich, Switzerland
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Decision tree ensembles are commonly used in a wide range of applications and becoming the de facto algorithm for decision tree based classifiers. Different trees in an ensemble can be processed in parallel during tree inference, making them a suitable use case for FPGAs. Large tree ensembles, however, require careful mapping of trees to on-chip memory and management of memory accesses. As a result, existing FPGA solutions suffer from the inability to scale beyond tens of trees and lack the flexibility to support different tree ensembles. In this paper we present an FPGA tree ensemble classifier together with a software driver to efficiently manage the FPGA's memory resources. The classifier architecture efficiently utilizes the FPGA's resources to fit half a million tree nodes in on-chip memory, delivering up to 20x speedup over a 10-threaded CPU implementation when fully processing the tree ensemble on the FPGA. It can also combine the CPU and FPGA to scale to tree ensembles that do not fit in on-chip memory, achieving up to an order of magnitude speedup compared to a pure CPU implementation. In addition, the classifier architecture can be programmed at runtime to process varying tree ensemble sizes.
引用
收藏
页数:8
相关论文
共 45 条
  • [1] A Quantitative Analysis on Microarchitectures of Modern CPU-FPGA Platforms
    Choi, Young-Kyu
    Cong, Jason
    Fang, Zhenman
    Hao, Yuchen
    Reinman, Glenn
    Wei, Peng
    2016 ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2016,
  • [2] Accelerating Real-Valued FFT on CPU-FPGA Platforms
    Qian, Zhuo
    Gan, Guoyou
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2024, 43 (08) : 2532 - 2536
  • [3] Accelerating Proximal Policy Optimization on CPU-FPGA Heterogeneous Platforms
    Meng, Yuan
    Kuppannagari, Sanmukh
    Prasanna, Viktor
    28TH IEEE INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM), 2020, : 19 - 27
  • [4] HeteroKV: A Scalable Line-rate Key-Value Store on Heterogeneous CPU-FPGA Platforms
    Yang, Haichang
    Li, Zhaoshi
    Wang, Jiawei
    Yin, Shouyi
    Wei, Shaojun
    Liu, Leibo
    PROCEEDINGS OF THE 2021 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2021), 2021, : 834 - 837
  • [5] A Hybrid Approach to Cache Management in Heterogeneous CPU-FPGA Platforms
    Feng, Liang
    Sinha, Sharad
    Zhang, Wei
    Liang, Yun
    2017 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN (ICCAD), 2017, : 937 - 944
  • [6] GraphACT: Accelerating GCN Training on CPU-FPGA Heterogeneous Platforms
    Zeng, Hanqing
    Prasanna, Viktor
    2020 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS (FPGA '20), 2020, : 255 - 265
  • [7] NEURAGHE: Exploiting CPU-FPGA Synergies for Efficient and Flexible CNN Inference Acceleration on Zynq SoCs
    Meloni, Paolo
    Capotondi, Alessandro
    Deriu, Gianfranco
    Brian, Michele
    Conti, Francesco
    Rossi, Davide
    Raffo, Luigi
    Benini, Luca
    ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2018, 11 (03)
  • [8] Application Partitioning on FPGA Clusters: Inference over Decision Tree Ensembles
    Owaida, Muhsen
    Alonso, Gustavo
    2018 28TH INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2018, : 295 - 300
  • [9] CAMAS: Static and Dynamic Hybrid Cache Management for CPU-FPGA Platforms
    Feng, Liang
    Sinha, Sharad
    Zhang, Wei
    Liang, Yun
    PROCEEDINGS 26TH IEEE ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM 2018), 2018, : 165 - 172
  • [10] In-Depth Analysis on Microarchitectures of Modern Heterogeneous CPU-FPGA Platforms
    Choi, Young-Kyu
    Cong, Jason
    Fang, Zhenman
    Hao, Yuchen
    Reinman, Glenn
    Wei, Peng
    ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2019, 12 (01)