An Efficient Selection-Based kNN Architecture for Smart Embedded Hardware Accelerators

被引:0
|
作者
Younes H. [1 ,2 ]
Ibrahim A. [1 ,2 ]
Rizk M. [2 ]
Valle M. [1 ]
机构
[1] Department of Electrical, Electronic and Telecommunications Engineering and Naval Architecture, University of Genova, Genova
[2] Department of Computer and Communication Engineering, Lebanese International University, Beirut
关键词
Approximate computing; embedded implementation; energy efficiency; fPGA; hardware accelerators; high level synthesis; k-nearest neighbor; real-Time processing; tactile sensing;
D O I
10.1109/OJCAS.2021.3108835
中图分类号
学科分类号
摘要
K-Nearest Neighbor (kNN) is an efficient algorithm used in many applications, e.g., text categorization, data mining, and predictive analysis. Despite having a high computational complexity, kNN is a candidate for hardware acceleration since it is a parallelizable algorithm. This paper presents an efficient novel architecture and implementation for a kNN hardware accelerator targeting modern System-on-Chips (SoCs). The architecture adopts a selection-based sorter dedicated for kNN that outperforms traditional sorters in terms of hardware resources, time latency, and energy efficiency. The kNN architecture has been designed using High-Level Synthesis (HLS) and implemented on the Xilinx Zynqberry platform. Compared to similar state-of-The-Art implementations, the proposed kNN provides speedups between $1.4times $ and $875times $ with 41% to 94% reductions in energy consumption. To further enhance the proposed architecture, algorithmic-level Approximate Computing Techniques (ACTs) have been applied. The proposed approximate kNN implementation accelerates the classification process by $2.3times $ with an average reduced area size of 56% for a real-Time tactile data processing case study. The approximate kNN consumes 69% less energy with an accuracy loss of less than 3% when compared to the proposed Exact kNN. © 2020 IEEE.
引用
收藏
页码:534 / 545
页数:11
相关论文
共 50 条
  • [31] EFFICIENT HARDWARE ARCHITECTURE FOR PARTICLE FILTER BASED OBJECT TRACKING
    Abd El-Halym, Howida A.
    Mahmoud, Imbaby I.
    Habib, S. E. -D.
    2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 4497 - 4500
  • [32] Efficient GHA-Based Hardware Architecture for Texture Classification
    Lin, Shiow-Jyu
    Hung, Yi-Tsan
    Hwang, Wen-Jyi
    COMPUTATIONAL COLLECTIVE INTELLIGENCE: TECHNOLOGIES AND APPLICATIONS, PT II, 2010, 6422 : 203 - 212
  • [33] An Efficient Hardware Architecture for Block Based Image Processing Algorithms
    Kryjak, Tomasz
    Gorgon, Marek
    Komorkiewicz, Mateusz
    APPLIED RECONFIGURABLE COMPUTING, ARC 2016, 2016, : 54 - 65
  • [34] An Efficient Embryonic Hardware Architecture based on Network-on-Chip
    Khalil, Kasem
    Eldash, Omar
    Dey, Bappaditya
    Kumar, Ashok
    Bayoumi, Magdy
    2021 IEEE INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS), 2021, : 449 - 452
  • [35] Efficient Hardware Approximation for Bit-Decomposition Based Deep Neural Network Accelerators
    Soliman, Taha
    Eldebiky, Amro
    De La Parra, Cecilia
    Guntoro, Andre
    Wehn, Norbert
    2022 IEEE 35TH INTERNATIONAL SYSTEM-ON-CHIP CONFERENCE (IEEE SOCC 2022), 2022, : 77 - 82
  • [36] Energy-efficient Feedback Tracking on Embedded Smart Cameras by Hardware-level Optimization
    Casares, Mauricio
    Velipasalar, Senem
    Santinelli, Paolo
    Cucchiara, Rita
    Prati, Andrea
    2011 FIFTH ACM/IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED SMART CAMERAS (ICDSC), 2011,
  • [37] Toward Composing Efficient FPGA-Based Hardware Accelerators for Physics-Based Model Predictive Control Smart Sensor for HEV Battery Cell Management
    Madsen, Anne K.
    Perera, Darshika G.
    IEEE ACCESS, 2023, 11 : 106141 - 106171
  • [38] An Efficient SRAM-Based Reconfigurable Architecture for Embedded Processors
    Tamimi, Sajjad
    Ebrahimi, Zahra
    Khaleghi, Behnam
    Asadi, Hossein
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2019, 38 (03) : 466 - 479
  • [39] Efficient Hardware Architecture for Template Matching-Based Spike Sorting
    Valencia, Daniel
    Alimohammad, Amirhossein
    IEEE TRANSACTIONS ON BIOMEDICAL CIRCUITS AND SYSTEMS, 2019, 13 (03) : 481 - 492
  • [40] An Efficient FPGA-Based Parallel Phase Unwrapping Hardware Architecture
    Chen, Huan-Yuan
    Hsu, Shu-Hao
    Hwang, Wen-Jyi
    Cheng, Chau-Jern
    IEEE TRANSACTIONS ON COMPUTATIONAL IMAGING, 2017, 3 (04): : 996 - 1007