Convolutional neural network with spatial pyramid pooling for hand gesture recognition

被引:0
|
作者
Yong Soon Tan
Kian Ming Lim
Connie Tee
Chin Poo Lee
Cheng Yaw Low
机构
[1] Multimedia University,Faculty of Information Science and Technology (FIST)
来源
关键词
Convolutional neural network (CNN); Spatial pyramid pooling (SPP); Hand gesture recognition; Sign language recognition;
D O I
暂无
中图分类号
学科分类号
摘要
Hand gesture provides a means for human to interact through a series of gestures. While hand gesture plays a significant role in human–computer interaction, it also breaks down the communication barrier and simplifies communication process between the general public and the hearing-impaired community. This paper outlines a convolutional neural network (CNN) integrated with spatial pyramid pooling (SPP), dubbed CNN–SPP, for vision-based hand gesture recognition. SPP is discerned mitigating the problem found in conventional pooling by having multi-level pooling stacked together to extend the features being fed into a fully connected layer. Provided with inputs of varying sizes, SPP also yields a fixed-length feature representation. Extensive experiments have been conducted to scrutinize the CNN–SPP performance on two well-known American sign language (ASL) datasets and one NUS hand gesture dataset. Our empirical results disclose that CNN–SPP prevails over other deep learning-driven instances.
引用
收藏
页码:5339 / 5351
页数:12
相关论文
共 50 条
  • [31] A Spatial Pyramid Pooling-Based Deep Convolutional Neural Network for the Classification of Electrocardiogram Beats
    Li, Jia
    Si, Yujuan
    Lang, Liuqi
    Liu, Lixun
    Xu, Tao
    APPLIED SCIENCES-BASEL, 2018, 8 (09):
  • [32] Comprehensive analysis of network robustness evaluation based on convolutional neural networks with spatial pyramid pooling
    Jiang, Wenjun
    Fan, Tianlong
    Li, Changhao
    Zhang, Chuanfu
    Zhang, Tao
    Luo, Zong-fu
    CHAOS SOLITONS & FRACTALS, 2024, 184
  • [33] Hand Gesture Recognition using Convolutional Neural Networks
    Lan, Shengchang
    He, Zonglong
    Chen, Weichu
    Chen, Lijia
    2018 USNC-URSI RADIO SCIENCE MEETING (JOINT WITH AP-S SYMPOSIUM), 2018, : 147 - 148
  • [34] Hand gesture recognition based on convolutional neural networks
    Hu, Yu-lu
    Wang, Lian-ming
    LIDAR IMAGING DETECTION AND TARGET RECOGNITION 2017, 2017, 10605
  • [35] Human action recognition based on convolutional neural network and spatial pyramid representation
    Xiao, Jihai
    Cui, Xiaohong
    Li, Feng
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2020, 71 (71)
  • [36] Spatial-temporal pyramid based Convolutional Neural Network for action recognition
    Zheng, Zhenxing
    An, Gaoyun
    Wu, Dapeng
    Ruan, Qiuqi
    NEUROCOMPUTING, 2019, 358 : 446 - 455
  • [37] TEXT DETECTION BASED ON CONVOLUTIONAL NEURAL NETWORKS WITH SPATIAL PYRAMID POOLING
    Zhu, Rui
    Mao, Xiao-Jiao
    Zhu, Qi-Hai
    Li, Ning
    Yang, Yu-Bin
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 1032 - 1036
  • [38] Convolutional Neural Network Based American Sign Language Static Hand Gesture Recognition
    Ahuja, Ravinder
    Jain, Daksh
    Sachdeva, Deepanshu
    Garg, Archit
    Rajput, Chirag
    INTERNATIONAL JOURNAL OF AMBIENT COMPUTING AND INTELLIGENCE, 2019, 10 (03) : 60 - 73
  • [39] Demo: Efficient Convolutional Neural Network for FMCW Radar Based Hand Gesture Recognition
    Cai, Xiaodong
    Ma, Jingyi
    Liu, Wei
    Han, Hemin
    Ma, Lili
    UBICOMP/ISWC'19 ADJUNCT: PROCEEDINGS OF THE 2019 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING AND PROCEEDINGS OF THE 2019 ACM INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTERS, 2019, : 17 - 20
  • [40] 3D separable convolutional neural network for dynamic hand gesture recognition
    Hu, Zhongxu
    Hu, Youmin
    Liu, Jie
    Wu, Bo
    Han, Dongmin
    Kurfess, Thomas
    NEUROCOMPUTING, 2018, 318 : 151 - 161