Smart training: Mask R-CNN oriented approach

被引:3
|
作者
Su, Mu-Chun [1 ]
Chen, Jieh-Haur [2 ,3 ]
Azzizi, Vidya Trisandini [3 ]
Chang, Hsiang-Ling [1 ]
Wei, Hsi-Hsien [4 ]
机构
[1] Natl Cent Univ, Dept Comp Sci & Informat Engn, Taoyuan 32001, Taiwan
[2] Natl Cent Univ, Res Ctr Smart Construct, Taoyuan 32001, Taiwan
[3] Natl Cent Univ, Dept Civil Engn, Taoyuan 32001, Taiwan
[4] Hong Kong Polytech Univ, Dept Bldg & Real Estate, Hung Hom, Kowloon, Hong Kong, Peoples R China
关键词
Smart training; Augmented reality; Hand gesture recognition; Finger-pointing analysis; Mask Regions with Convolutional Neural; Network (R-CNN); AUGMENTED REALITY; RECOGNITION; SERVICES;
D O I
10.1016/j.eswa.2021.115595
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper is aimed at the usage of an augmented reality assisted system set up on the smart-glasses for training activities. Literature review leads us to a comparison among related technologies, yielding that Mask Regions with Convolutional Neural Network (R-CNN) oriented approach fits the study needs. The proposed method including (1) pointing gesture capture, (2) finger-pointing analysis, and (3) virtual tool positioning and rotation angle are developed. Results show that the recognition of object detection is 95.5%, the Kappa value of recognition of gesture detection is 0.93, and the average time for detecting pointing gesture is 0.26 seconds. Furthermore, even under different lighting, such as indoor and outdoor, the pointing analysis accuracy is up to 79%. The error between the analysis angle and the actual angle is only 1.32 degrees. The results proved that the system is well suited to present the effect of augmented reality, making it applicable for real world usage.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Mask R-CNN
    He, Kaiming
    Gkioxari, Georgia
    Dollar, Piotr
    Girshick, Ross
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2980 - 2988
  • [2] Mask R-CNN
    He, Kaiming
    Gkioxari, Georgia
    Dollar, Piotr
    Girshick, Ross
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (02) : 386 - 397
  • [3] Oriented Cascade Mask R-CNN for Biomedical Image Segmentation
    Zhang, Bing
    Xu, Li
    [J]. 2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 2372 - 2377
  • [4] Mask Scoring R-CNN
    Huang, Zhaojin
    Huang, Lichao
    Gong, Yongchao
    Huang, Chang
    Wang, Xinggang
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 6402 - 6411
  • [5] Faster training of Mask R-CNN by focusing on instance boundaries
    Zimmermann, Roland S.
    Siems, Julien N.
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2019, 188
  • [6] Nuclei R-CNN: Improve Mask R-CNN for Nuclei Segmentation
    Lv, Guofeng
    Wen, Ke
    Wu, Zheng
    Jin, Xu
    An, Hong
    He, Jie
    [J]. 2019 2ND IEEE INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND SIGNAL PROCESSING (ICICSP), 2019, : 357 - 362
  • [7] IEMask R-CNN: Information-Enhanced Mask R-CNN
    Bi, Xiuli
    Hu, Jinwu
    Xiao, Bin
    Li, Weisheng
    Gao, Xinbo
    [J]. IEEE TRANSACTIONS ON BIG DATA, 2023, 9 (02) : 688 - 700
  • [8] SE-Mask R-CNN: An improved Mask R-CNN for apple detection and segmentation
    Liu, Yikun
    Yang, Gongping
    Huang, Yuwen
    Yin, Yilong
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 41 (06) : 6715 - 6725
  • [9] Oriented R-CNN and Beyond
    Xie, Xingxing
    Cheng, Gong
    Wang, Jiabao
    Li, Ke
    Yao, Xiwen
    Han, Junwei
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (07) : 2420 - 2442
  • [10] Mask R-CNN for Ear Detection
    Bizjak, Matic
    Peer, Peter
    Emersic, Ziga
    [J]. 2019 42ND INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2019, : 1624 - 1628