Gesture Recognition Based on YOLO Algorithm

被引:0
|
作者
Wang F.-H. [1 ,2 ,3 ]
Huang C. [1 ]
Zhao B. [1 ]
Zhang Q. [1 ]
机构
[1] School of Automation and Electrical Engineering, University of Science and Technology Beijing, Beijing
[2] The Institute of Artificial Intelligence, University of Science and Technology Beijing, Beijing
[3] Beijing Engineering Research Center of Industrial Spectrum Imaging, Beijing
关键词
Gesture recognition; Mean average precision; YOLO; YOLOv3-tiny-T algorithm;
D O I
10.15918/j.tbit1001-0645.2019.030
中图分类号
学科分类号
摘要
The application of YOLO (you only look once) algorithm in gesture recognition was studied to improve the speed and accuracy of detection under the background near the skin color, light and shade. Based on the end-to-end detection function, the YOLO algorithm could be arranged to improve operation speed greatly by automatically extracting target feature from convolution neural networks. Considering the excellent performance in target detection process, YOLO algorithm was applied to gesture recognition. Comparing with other application results with YOLO series algorithm, this application result of YOLO algorithm shows better performance in gesture recognition. At the same time, based on a YOLOv3-tiny algorithm, the fast version of YOLOv3 algorithm, a YOLOv3-tiny-T algorithm was proposed. The YOLOv3-tiny-T algorithm can achieve a mean average precision of 92.24% on the UST dataset with five gestures, increasing about 5% combined with YOLOv3-tiny. © 2020, Editorial Department of Transaction of Beijing Institute of Technology. All right reserved.
引用
收藏
页码:873 / 879
页数:6
相关论文
共 20 条
  • [1] A report on deafness and hearing loss, (2015)
  • [2] Kramer J, Leifer L., The talking glove, Acm Sigcaph Computers & the Physically Handicapped, 39, 39, pp. 12-16, (1988)
  • [3] Assan M, Grobel K., Video-based sign language recognition using hidden markov models, Proceedings of International Gesture Workshop, pp. 97-109, (1997)
  • [4] Yang Hongling, Xuan Shibin, Mo Yuanbin, Gesture recognition based on convolutional neural network, Computer Technology and Development, 28, 7, pp. 17-20, (2018)
  • [5] Dominio F, Donadeo M, Zanuttigh P., Combining multiple depth-based descriptors for hand gesture recognition, Pattern Recognition Letters, 50, pp. 101-111, (2014)
  • [6] Zhang Xiaoyan, Static gesture recognition based on skin color and template matching, Electronic World, 15, pp. 7714-7720, (2019)
  • [7] Qiang Yan, Dong Linjia, Zhao Juanjuan, Et al., A fast gesture recognition method based on stacked sparse autoencoders multi-feature fusion, Transactions of Beijing Institute of Technology, 39, 6, pp. 638-643, (2019)
  • [8] Girshick R, Donahue J, Darrell T, Et al., Rich feature hierarchies for accurate object detection and semantic segmentation, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 580-587, (2014)
  • [9] Girshick R., Fast r-cnn, Proceedings of the IEEE International Conference on Computer Vision, pp. 1440-1448, (2015)
  • [10] Ren S, He K, Girshick R, Et al., Faster r-cnn: towards real-time object detection with region proposal networks, Proceedings of Advances in Neural Information Processing Systems, pp. 91-99, (2015)