Convolutional neural network with spatial pyramid pooling for hand gesture recognition

被引:0
|
作者
Yong Soon Tan
Kian Ming Lim
Connie Tee
Chin Poo Lee
Cheng Yaw Low
机构
[1] Multimedia University,Faculty of Information Science and Technology (FIST)
来源
关键词
Convolutional neural network (CNN); Spatial pyramid pooling (SPP); Hand gesture recognition; Sign language recognition;
D O I
暂无
中图分类号
学科分类号
摘要
Hand gesture provides a means for human to interact through a series of gestures. While hand gesture plays a significant role in human–computer interaction, it also breaks down the communication barrier and simplifies communication process between the general public and the hearing-impaired community. This paper outlines a convolutional neural network (CNN) integrated with spatial pyramid pooling (SPP), dubbed CNN–SPP, for vision-based hand gesture recognition. SPP is discerned mitigating the problem found in conventional pooling by having multi-level pooling stacked together to extend the features being fed into a fully connected layer. Provided with inputs of varying sizes, SPP also yields a fixed-length feature representation. Extensive experiments have been conducted to scrutinize the CNN–SPP performance on two well-known American sign language (ASL) datasets and one NUS hand gesture dataset. Our empirical results disclose that CNN–SPP prevails over other deep learning-driven instances.
引用
收藏
页码:5339 / 5351
页数:12
相关论文
共 50 条
  • [1] Convolutional neural network with spatial pyramid pooling for hand gesture recognition
    Tan, Yong Soon
    Lim, Kian Ming
    Tee, Connie
    Lee, Chin Poo
    Low, Cheng Yaw
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (10): : 5339 - 5351
  • [2] Manchu Word Recognition Based on Convolutional Neural Network with Spatial Pyramid Pooling
    Li, Min
    Zheng, Ruirui
    Xu, Shuang
    Fu, Yu
    Huang, Di
    2018 11TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2018), 2018,
  • [3] Compact Spatial Pyramid Pooling Deep Convolutional Neural Network Based Hand Gestures Decoder
    Ashiquzzaman, Akm
    Lee, Hyunmin
    Kim, Kwangki
    Kim, Hye-Young
    Park, Jaehyung
    Kim, Jinsul
    APPLIED SCIENCES-BASEL, 2020, 10 (21): : 1 - 22
  • [4] CONVOLUTIONAL NEURAL NETWORK ARCHITECTURE FOR HAND GESTURE RECOGNITION
    Pinzon Arenas, Javier Orlando
    Useche Murillo, Paula Catalina
    Jimenez Moreno, Robinson
    PROCEEDINGS OF THE 2017 IEEE XXIV INTERNATIONAL CONFERENCE ON ELECTRONICS, ELECTRICAL ENGINEERING AND COMPUTING (INTERCON), 2017,
  • [5] Hand Gesture Recognition Using Convolutional Neural Network
    Ahlawat, Savita
    Batra, Vaibhav
    Banerjee, Snehashish
    Saha, Joydeep
    Garg, Aman K.
    INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING AND COMMUNICATIONS, VOL 2, 2019, 56 : 179 - 186
  • [6] A Spatial Pyramid Pooling Convolutional Neural Network for Smoky Vehicle Detection
    Cao, Yichao
    Lu, Chang
    Lu, Xiaobo
    Xia, Xue
    2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 9170 - 9175
  • [7] Graph Convolutional Neural Network Gesture Recognition Based on Pooling Algorithm
    Chen, Hong
    Qi, Baoqiang
    Zhao, Hongdong
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2022, 31 (15)
  • [8] Evaluation of Robust Spatial Pyramid Pooling Based on Convolutional Neural Network for Traffic Sign Recognition System
    Dewi, Christine
    Chen, Rung-Ching
    Tai, Shao-Kuo
    ELECTRONICS, 2020, 9 (06)
  • [9] Temporal Pyramid Pooling-Based Convolutional Neural Network for Action Recognition
    Wang, Peng
    Cao, Yuanzhouhan
    Shen, Chunhua
    Liu, Lingqiao
    Shen, Heng Tao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2017, 27 (12) : 2613 - 2622
  • [10] DeepScene: Scene classification via convolutional neural network with spatial pyramid pooling
    Yee, Pui Sin
    Lim, Kian Ming
    Lee, Chin Poo
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 193