An FPGA Based Accelerator for Clustering Algorithms With Custom Instructions

被引:13
|
作者
Wang, Chao [1 ]
Gong, Lei [1 ]
Jia, Fahui [2 ]
Zhou, Xuehai [1 ]
机构
[1] Univ Sci & Technol China, Hefei 230027, Anhui, Peoples R China
[2] Univ Sci & Technol China, Suzhou Inst, Suzhou 215123, Peoples R China
基金
美国国家科学基金会;
关键词
Clustering algorithms; Hardware; Field programmable gate arrays; Machine learning algorithms; Arrays; Logic arrays; Acceleration; Accelerators; clustering; custom instructions; machine learning; FPGA;
D O I
10.1109/TC.2020.2995761
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Clustering algorithms are becoming popular and widely applied in many academic fields, such as machine learning, pattern recognition, and artificial intelligence. It has posed significant challenges to accelerate the algorithms due to the explosive data scale and wide variety of applications. However, previous studies mainly focus on the raw speedup with insufficient attention to the flexibility of the accelerator to support various applications. In order to accelerate different clustering algorithms in one accelerator, in this article, we design an accelerating framework based on FPGA for four state-of-the-art clustering methods, including K-means, PAM, SLINK, and DBSCAN algorithms. Moreover, we provide both euclidean and Manhattan distances as similarity metrics in the accelerator design paradigm. Moreover, we provide a custom instruction set to operate the accelerators within each application. In order to evaluate the performance and hardware cost of the accelerator, we constructed a hardware prototype on the state-of-the-art Xilinx FPGA platform. Experimental results demonstrate that the accelerator framework is able to achieve up to 23x speedup than Intel Xeon processor, and is 9.46x more energy efficient than NVIDIA GTX 750 GPU accelerators.
引用
收藏
页码:725 / 732
页数:8
相关论文
共 50 条
  • [41] An FPGA-based Integrated MapReduce Accelerator Platform
    Kachris, Christoforos
    Diamantopoulos, Dionysios
    Sirakoulis, Georgios Ch.
    Soudris, Dimitrios
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2017, 87 (03): : 357 - 369
  • [42] FPGA-Based Vehicle Detection and Tracking Accelerator
    Zhai, Jiaqi
    Li, Bin
    Lv, Shunsen
    Zhou, Qinglei
    SENSORS, 2023, 23 (04)
  • [43] Architecture of an FPGA Accelerator for LDA-Based Inference
    Ono, Taisuke
    Waidyasooriya, Hasitha Muthumala
    Hariyama, Masanori
    Ishigaki, Tsukasa
    2017 18TH IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNDP 2017), 2017, : 357 - 362
  • [44] An FPGA-based Hardware Accelerator for Iris Segmentation
    Avey, Joe
    Jones, Phillip
    Zambreno, Joseph
    2018 INTERNATIONAL CONFERENCE ON RECONFIGURABLE COMPUTING AND FPGAS (RECONFIG), 2018,
  • [45] Custom Instructions for Networked Processor Templates
    Todman, Tim
    Luk, Wayne
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2022, 69 (07) : 3096 - 3100
  • [46] An FPGA-Based Accelerator for Frequent Itemset Mining
    Zhang, Yan
    Zhang, Fan
    Jin, Zheming
    Bakos, Jason D.
    ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2013, 6 (01)
  • [47] A Market Data Feeds Processing Accelerator Based on FPGA
    Shen, Xiaoyang
    Jiang, Jiang
    Zhou, Liyuan
    Yang, Tianyi
    Chen, Li
    COMPUTER ENGINEERING AND TECHNOLOGY, NCCET 2013, 2013, 396 : 44 - 52
  • [48] Advanced accelerator control and instrumentation modules based on FPGA
    Messmer, P.
    Ranjbar, V.
    Wade-Stein, D.
    Schoessow, P.
    2007 IEEE PARTICLE ACCELERATOR CONFERENCE, VOLS 1-11, 2007, : 1076 - +
  • [49] Packet Filtering for FPGA-Based Routing Accelerator
    Antos, David
    Rehak, Vojtech
    Holub, Petr
    CESNET CONFERENCE 2006: FIRST CESNET CONFERENCE ON ADVANCED COMMUNICATIONS AND GRIDS, 2006, : 161 - 173
  • [50] FPGA-Based Hardware Accelerator for Matrix Inversion
    Kokkiligadda V.S.K.
    Naikoti V.
    Patkotwar G.S.
    Sabat S.L.
    Peesapati R.
    SN Computer Science, 4 (2)