Convolutional neural network with spatial pyramid pooling for hand gesture recognition

被引:0
|
作者
Yong Soon Tan
Kian Ming Lim
Connie Tee
Chin Poo Lee
Cheng Yaw Low
机构
[1] Multimedia University,Faculty of Information Science and Technology (FIST)
来源
关键词
Convolutional neural network (CNN); Spatial pyramid pooling (SPP); Hand gesture recognition; Sign language recognition;
D O I
暂无
中图分类号
学科分类号
摘要
Hand gesture provides a means for human to interact through a series of gestures. While hand gesture plays a significant role in human–computer interaction, it also breaks down the communication barrier and simplifies communication process between the general public and the hearing-impaired community. This paper outlines a convolutional neural network (CNN) integrated with spatial pyramid pooling (SPP), dubbed CNN–SPP, for vision-based hand gesture recognition. SPP is discerned mitigating the problem found in conventional pooling by having multi-level pooling stacked together to extend the features being fed into a fully connected layer. Provided with inputs of varying sizes, SPP also yields a fixed-length feature representation. Extensive experiments have been conducted to scrutinize the CNN–SPP performance on two well-known American sign language (ASL) datasets and one NUS hand gesture dataset. Our empirical results disclose that CNN–SPP prevails over other deep learning-driven instances.
引用
收藏
页码:5339 / 5351
页数:12
相关论文
共 50 条
  • [1] Convolutional neural network with spatial pyramid pooling for hand gesture recognition
    Tan, Yong Soon
    Lim, Kian Ming
    Tee, Connie
    Lee, Chin Poo
    Low, Cheng Yaw
    [J]. NEURAL COMPUTING & APPLICATIONS, 2021, 33 (10): : 5339 - 5351
  • [2] Manchu Word Recognition Based on Convolutional Neural Network with Spatial Pyramid Pooling
    Li, Min
    Zheng, Ruirui
    Xu, Shuang
    Fu, Yu
    Huang, Di
    [J]. 2018 11TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2018), 2018,
  • [3] Compact Spatial Pyramid Pooling Deep Convolutional Neural Network Based Hand Gestures Decoder
    Ashiquzzaman, Akm
    Lee, Hyunmin
    Kim, Kwangki
    Kim, Hye-Young
    Park, Jaehyung
    Kim, Jinsul
    [J]. APPLIED SCIENCES-BASEL, 2020, 10 (21): : 1 - 22
  • [4] CONVOLUTIONAL NEURAL NETWORK ARCHITECTURE FOR HAND GESTURE RECOGNITION
    Pinzon Arenas, Javier Orlando
    Useche Murillo, Paula Catalina
    Jimenez Moreno, Robinson
    [J]. PROCEEDINGS OF THE 2017 IEEE XXIV INTERNATIONAL CONFERENCE ON ELECTRONICS, ELECTRICAL ENGINEERING AND COMPUTING (INTERCON), 2017,
  • [5] Hand Gesture Recognition Using Convolutional Neural Network
    Ahlawat, Savita
    Batra, Vaibhav
    Banerjee, Snehashish
    Saha, Joydeep
    Garg, Aman K.
    [J]. INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING AND COMMUNICATIONS, VOL 2, 2019, 56 : 179 - 186
  • [6] A Spatial Pyramid Pooling Convolutional Neural Network for Smoky Vehicle Detection
    Cao, Yichao
    Lu, Chang
    Lu, Xiaobo
    Xia, Xue
    [J]. 2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 9170 - 9175
  • [7] Graph Convolutional Neural Network Gesture Recognition Based on Pooling Algorithm
    Chen, Hong
    Qi, Baoqiang
    Zhao, Hongdong
    [J]. JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2022, 31 (15)
  • [8] Evaluation of Robust Spatial Pyramid Pooling Based on Convolutional Neural Network for Traffic Sign Recognition System
    Dewi, Christine
    Chen, Rung-Ching
    Tai, Shao-Kuo
    [J]. ELECTRONICS, 2020, 9 (06)
  • [9] Temporal Pyramid Pooling-Based Convolutional Neural Network for Action Recognition
    Wang, Peng
    Cao, Yuanzhouhan
    Shen, Chunhua
    Liu, Lingqiao
    Shen, Heng Tao
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2017, 27 (12) : 2613 - 2622
  • [10] DeepScene: Scene classification via convolutional neural network with spatial pyramid pooling
    Yee, Pui Sin
    Lim, Kian Ming
    Lee, Chin Poo
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 193