The 3D-DTW Custom IP based FPGA Hardware Acceleration for Action Recognition

被引:1
|
作者
Vidhyapathi, C. M. [1 ]
Raj, Alex Noel Joseph [2 ]
Sundar, S. [1 ]
机构
[1] Vellore Inst Technol, Sch Elect Engn, Dept Embedded Technol, Vellore, Tamil Nadu, India
[2] Shantou Univ, Coll Engn, Dept Elect Engn, Shantou, Peoples R China
关键词
D O I
10.2352/J.ImagingSci.Technol.2021.65.1.010401
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
This article proposes an implementation of an action recognition system, which allows the user to perform operations in real time. The Microsoft Kinect (RGB-D) sensor plays a central role in this system, which provides the skeletal joint information of humans directly. Computationally efficient skeletal joint position features are considered for describing each action. The dynamic time warping algorithm (DTW) is a widely used algorithm in many applications such as similarity sequence search, classification, and speech recognition. It provides the highest accuracy compared to all other algorithms. However, the computational time of the DTW algorithm is a major drawback in real world applications. To speed up the basic DTW algorithm, a novel three-dimensional dynamic time warping (3D-DTW) classification algorithm is proposed in this work. The proposed 3D-DTW algorithm is implemented in both software and field programmable gate array (FPGA) hardware modeling techniques. The performance of the 3D-DTW algorithm is evaluated for 12 actions in which each action is described with the feature vector size of 576 over 32 frames. From our software modeling results, it has been shown that the proposed algorithm performs the action classification accurately. However, the computation time of the 3D-DTW algorithm increases linearly when we increase either the number of actions or the feature vector size of each action. For further speedup, an efficient custom 3D-DTW intellectual property (IP) core is developed using the Xilinx Vivado high-level synthesis (HLS) tool to accelerate the 3D-DTW algorithm in FPGA hardware. The CPU centric software modeling of the 3D-DTW algorithm is compared with its hardware accelerated custom IP core. It has been shown that the developed 3D-DTW Custom IP core computation time is 40 times faster than its software counterpart. As the hardware results are promising, a parallel hardware software co-design architecture is proposed for the Xilinx Zynq-7020 System on Chip (SoC) FPGA for action recognition. The HLS simulation and synthesis results are provided to support the practical implementation of the proposed architecture. Our proposed approach outperforms many of the existing state-of-the-art DTW based action recognition techniques by providing the highest accuracy of 97.77%. (C) 2021 Society for Imaging Science and Technology.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] LALP: A Language to Program Custom FPGA-Based Acceleration Engines
    Ricardo Menotti
    João M. P. Cardoso
    Marcio M. Fernandes
    Eduardo Marques
    International Journal of Parallel Programming, 2012, 40 : 262 - 289
  • [22] DESIGN OF SPORTS ACTION RECOGNITION AND EVALUATION BASED ON IMPROVED DTW ALGORITHM
    Hu, Yuli
    Liu, Di
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2025, 21 (01): : 37 - 52
  • [23] Acceleration of feature extraction for FPGA based speech recognition
    Arminas, Vytautas
    Tamulevicius, Gintautas
    Navakauskas, Dalius
    Ivanovas, Edgaras
    PHOTONICS APPLICATIONS IN ASTRONOMY, COMMUNICATIONS, INDUSTRY, AND HIGH-ENERGY PHYSICS EXPERIMENTS 2010, 2010, 7745
  • [24] Custom FPGA-based Soft-Processors for Sparse Graph Acceleration
    Kapre, Nachiket
    PROCEEDINGS OF THE ASAP2015 2015 IEEE 26TH INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS, 2015, : 9 - 16
  • [25] Hardware Acceleration-Based Scheme for UNET Implementation Using FPGA
    Khalil, Kasem
    Abdelfattah, Rabab
    Abdelfatah, Kareem
    Sherif, Ahmed
    2024 IEEE 3RD INTERNATIONAL CONFERENCE ON COMPUTING AND MACHINE INTELLIGENCE, ICMI 2024, 2024,
  • [26] Lithographic Aerial Image Simulation with FPGA-Based Hardware Acceleration
    Cong, Jason
    Zou, Yi
    FPGA 2008: SIXTEENTH ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS, 2008, : 67 - 76
  • [27] FPGA based Adaptive Hardware Acceleration for Multiple Deep Learning Tasks
    Lu, Yufan
    Zhai, Xiaojun
    Saha, Sangeet
    Ehsan, Shoaib
    McDonald-Maier, Klaus D.
    2021 IEEE 14TH INTERNATIONAL SYMPOSIUM ON EMBEDDED MULTICORE/MANY-CORE SYSTEMS-ON-CHIP (MCSOC 2021), 2021, : 204 - 209
  • [28] FPGA-Based Hardware Acceleration of Lithographic Aerial Image Simulation
    Cong, Jason
    Zou, Yi
    ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2009, 2 (03)
  • [29] FPGA based hardware acceleration for elliptic curve public key cryptosystems
    Ernst, M
    Henhapl, B
    Klupsch, S
    Huss, S
    JOURNAL OF SYSTEMS AND SOFTWARE, 2004, 70 (03) : 299 - 313
  • [30] Research Progress on FPGA-based Machine Learning Hardware Acceleration
    Wang C.
    Wang T.
    Ma X.
    Zhou X.-H.
    Jisuanji Xuebao/Chinese Journal of Computers, 2020, 43 (06): : 1161 - 1182