An FPGA Based Accelerator for Clustering Algorithms With Custom Instructions

被引:13
|
作者
Wang, Chao [1 ]
Gong, Lei [1 ]
Jia, Fahui [2 ]
Zhou, Xuehai [1 ]
机构
[1] Univ Sci & Technol China, Hefei 230027, Anhui, Peoples R China
[2] Univ Sci & Technol China, Suzhou Inst, Suzhou 215123, Peoples R China
基金
美国国家科学基金会;
关键词
Clustering algorithms; Hardware; Field programmable gate arrays; Machine learning algorithms; Arrays; Logic arrays; Acceleration; Accelerators; clustering; custom instructions; machine learning; FPGA;
D O I
10.1109/TC.2020.2995761
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Clustering algorithms are becoming popular and widely applied in many academic fields, such as machine learning, pattern recognition, and artificial intelligence. It has posed significant challenges to accelerate the algorithms due to the explosive data scale and wide variety of applications. However, previous studies mainly focus on the raw speedup with insufficient attention to the flexibility of the accelerator to support various applications. In order to accelerate different clustering algorithms in one accelerator, in this article, we design an accelerating framework based on FPGA for four state-of-the-art clustering methods, including K-means, PAM, SLINK, and DBSCAN algorithms. Moreover, we provide both euclidean and Manhattan distances as similarity metrics in the accelerator design paradigm. Moreover, we provide a custom instruction set to operate the accelerators within each application. In order to evaluate the performance and hardware cost of the accelerator, we constructed a hardware prototype on the state-of-the-art Xilinx FPGA platform. Experimental results demonstrate that the accelerator framework is able to achieve up to 23x speedup than Intel Xeon processor, and is 9.46x more energy efficient than NVIDIA GTX 750 GPU accelerators.
引用
收藏
页码:725 / 732
页数:8
相关论文
共 50 条
  • [1] WooKong: A Ubiquitous Accelerator for Recommendation Algorithms With Custom Instruction Sets on FPGA
    Wang, Chao
    Gong, Lei
    Ma, Xiang
    Li, Xi
    Zhou, Xuehai
    IEEE TRANSACTIONS ON COMPUTERS, 2020, 69 (07) : 1071 - 1082
  • [2] DES decoding using FPGA and custom instructions
    Lee, Tai-Chi
    Zeien, Richard
    Roach, Adam
    Robinson, Patrick
    THIRD INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: NEW GENERATIONS, PROCEEDINGS, 2006, : 575 - +
  • [3] Tracking Accelerator Based on RISC-V Custom Instructions for GNSS Receiver
    Wang, Shaoha
    Ding, Xiangji
    Yang, Ying
    IEICE ELECTRONICS EXPRESS, 2024,
  • [4] Tracking accelerator based on RISC-V custom instructions for GNSS receiver
    Wang, Shaohao
    Ding, Xiangjie
    Yang, Ying
    IEICE ELECTRONICS EXPRESS, 2024, 21 (10): : 5 - 6
  • [5] FPGA Based Custom Accelerator Architecture Framework for Complex Event Processing
    Ekanayaka, Kavinga Upul Bandara
    Pasqual, Ajith
    TENCON 2014 - 2014 IEEE REGION 10 CONFERENCE, 2014,
  • [6] Estimating area costs of custom instructions for FPGA-based reconfigurable processors
    Lam, Siew-Kei
    Srikanthan, Thambipillai
    2007 IEEE INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES, AND PROCESSORS, 2007, : 89 - 94
  • [7] An Unbiased MCMC FPGA-Based Accelerator in the Land of Custom Precision Arithmetic
    Liu, Shuanglong
    Mingas, Grigorios
    Bouganis, Christos-Savvas
    IEEE TRANSACTIONS ON COMPUTERS, 2017, 66 (05) : 745 - 758
  • [8] High level area estimation of custom instructions for FPGA-based reconfigurable processors
    Lam, Siew-Kei
    Li, Wen
    Srikanthan, Thambipillai
    2007 6TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS & SIGNAL PROCESSING, VOLS 1-4, 2007, : 48 - +
  • [9] FPGA ACCELERATOR FOR PROTEIN STRUCTURE PREDICTION ALGORITHMS
    Jain, Advait
    Gambhir, Pulkit
    Jindal, Priyanka
    Balakrishnan, M.
    Paul, Kolin
    2009 5TH SOUTHERN CONFERENCE ON PROGRAMMABLE LOGIC, PROCEEDINGS, 2009, : 123 - 128
  • [10] Exploiting FPGA-Aware Merging of Custom Instructions for Runtime Reconfiguration
    Lam, Siew-Kei
    Clarke, Christopher T.
    Srikanthan, Thambipillai
    ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2014, 7 (03)