An FPGA Based Accelerator for Clustering Algorithms With Custom Instructions

被引:13
|
作者
Wang, Chao [1 ]
Gong, Lei [1 ]
Jia, Fahui [2 ]
Zhou, Xuehai [1 ]
机构
[1] Univ Sci & Technol China, Hefei 230027, Anhui, Peoples R China
[2] Univ Sci & Technol China, Suzhou Inst, Suzhou 215123, Peoples R China
基金
美国国家科学基金会;
关键词
Clustering algorithms; Hardware; Field programmable gate arrays; Machine learning algorithms; Arrays; Logic arrays; Acceleration; Accelerators; clustering; custom instructions; machine learning; FPGA;
D O I
10.1109/TC.2020.2995761
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Clustering algorithms are becoming popular and widely applied in many academic fields, such as machine learning, pattern recognition, and artificial intelligence. It has posed significant challenges to accelerate the algorithms due to the explosive data scale and wide variety of applications. However, previous studies mainly focus on the raw speedup with insufficient attention to the flexibility of the accelerator to support various applications. In order to accelerate different clustering algorithms in one accelerator, in this article, we design an accelerating framework based on FPGA for four state-of-the-art clustering methods, including K-means, PAM, SLINK, and DBSCAN algorithms. Moreover, we provide both euclidean and Manhattan distances as similarity metrics in the accelerator design paradigm. Moreover, we provide a custom instruction set to operate the accelerators within each application. In order to evaluate the performance and hardware cost of the accelerator, we constructed a hardware prototype on the state-of-the-art Xilinx FPGA platform. Experimental results demonstrate that the accelerator framework is able to achieve up to 23x speedup than Intel Xeon processor, and is 9.46x more energy efficient than NVIDIA GTX 750 GPU accelerators.
引用
收藏
页码:725 / 732
页数:8
相关论文
共 50 条
  • [21] Optimizing Density-Based Ant Colony Stream Clustering Using FPGA-Based Hardware Accelerator
    Graf, Jeremy R.
    Perera, Darshika G.
    2023 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS, 2023,
  • [22] Prospects of FPGA-Based Custom Supercomputing
    Hariyama M.
    Waidyasooriya H.M.
    Kyokai Joho Imeji Zasshi/Journal of the Institute of Image Information and Television Engineers, 2019, 73 (01): : 40 - 46
  • [23] Predictive maintenance with custom instructions
    不详
    HYDROCARBON PROCESSING, 2002, 81 (04): : 35 - 36
  • [24] Parallel Enumeration of Custom Instructions Based on Multidepth Graph Partitioning
    Wang, Shanshan
    Xiao, Chenglong
    Liu, Wanjun
    IEEE EMBEDDED SYSTEMS LETTERS, 2019, 11 (01) : 1 - 4
  • [25] FPGA based accelerator for parallel DBSCAN algorithm
    Shi, Shaobo
    Yue, Qi
    Wang, Qin
    Wang, Q. (337816437@qq.com), 1600, Transport and Telecommunication Institute (18): : 135 - 142
  • [26] An FPGA-Based accelerator for multiphysics modeling
    Huang, XM
    Ma, J
    ERSA '04: THE 2004 INTERNATIONAL CONFERENCE ON ENGINEERING OF RECONFIGURABLE SYSTEMS AND ALGORITHMS, 2004, : 209 - 212
  • [27] Deep Neural Network Accelerator based on FPGA
    Thang Viet Huynh
    2017 4TH NAFOSTED CONFERENCE ON INFORMATION AND COMPUTER SCIENCE (NICS), 2017, : 254 - 257
  • [28] A FPGA based C runtime hardware accelerator
    Garcia, P.
    Salgado, F.
    Cardoso, P.
    Cabral, J.
    Ekpanyapong, M.
    Tavares, A.
    2011 9TH IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), 2011,
  • [29] An Accelerator for Decoding Market Data Based on FPGA
    Dou, Yuhao
    Zhou, Yisu
    Xin, Bo
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2019, 28 (03)
  • [30] FPGA based accelerator for visual features detection
    Brenot, Francois
    Fillatreau, Philippe
    Piat, Jonathan
    2015 IEEE INTERNATIONAL WORKSHOP OF ELECTRONICS, CONTROL, MEASUREMENT, SIGNALS AND THEIR APPLICATION TO MECHATRONICS (ECMSM), 2015,