YogNet: A two-stream network for realtime multiperson yoga action recognition and posture correction

被引:14
|
作者
Yadav, Santosh Kumar [1 ,2 ]
Agarwal, Aayush [3 ]
Kumar, Ashish [3 ]
Tiwari, Kamlesh [3 ]
Pandey, Hari Mohan [4 ]
Akbar, Shaik Ali [1 ,2 ]
机构
[1] Acad Sci & Innovat Res AcSIR, Ghaziabad 201002, Uttar Pradesh, India
[2] Cent Elect Engn Res Inst CEERI, Cyber Phys Syst, CSIR, Pilani 333031, India
[3] Birla Inst Technol & Sci Pilani, Dept CSIS, Pilani Campus, Pilani 333031, Rajasthan, India
[4] Bournemouth Univ, Dept Comp & informat, Poole, England
关键词
Action recognition; Computer vision; Posture correction; Yoga and exercise;
D O I
10.1016/j.knosys.2022.109097
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Yoga is a traditional Indian exercise. It specifies various body postures called asanas, practicing them is beneficial for the physical, mental, and spiritual well-being. To support the yoga practitioners, there is a need of an expert yoga asanas recognition system that can automatically analyze practitioner's postures and could provide suitable posture correction instructions. This paper proposes YogNet, a multi-person yoga expert system for 20 asanas using a two-stream deep spatiotemporal neural network architecture. The first stream utilizes a keypoint detection approach to detect the practitioner's pose, followed by the formation of bounding boxes across the subject. The model then applies time distributed convolutional neural networks (CNNs) to extract frame-wise postural features, followed by regularized long shortterm memory (LSTM) networks to give temporal predictions. The second stream utilizes 3D-CNNs for spatiotemporal feature extraction from RGB videos. Finally, the scores of two streams are fused using multiple fusion techniques. A yoga asana recognition database (YAR) containing 1206 videos is collected using a single 2D web camera for 367 min with the help of 16 participants and contains four view variations i.e. front, back, left, and right sides. The proposed system is novel as this is the earliest two-stream deep learning-based system that can perform multi-person yoga asanas recognition and correction in realtime. Simulation result reveals that YogNet system achieved 77.29%, 89.29%, and 96.31% accuracies using pose stream, RGB stream, and via fusion of both streams, respectively. These results are impressive and sufficiently high for recommendation towards general adaption of the system.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] An Accurate Device-Free Action Recognition System Using Two-Stream Network
    Sheng, Biyun
    Fang, Yuanrun
    Xiao, Fu
    Sun, Lijuan
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (07) : 7930 - 7939
  • [32] Interactive two-stream graph neural network for skeleton-based action recognition
    Yang, Dun
    Zhou, Qing
    Wen, Ju
    JOURNAL OF ELECTRONIC IMAGING, 2021, 30 (03)
  • [33] Human Action Recognition Combining Sequential Dynamic Images and Two-Stream Convolutional Network
    Zhang Wenqiang
    Wang Zengqiang
    Zhang Liang
    LASER & OPTOELECTRONICS PROGRESS, 2021, 58 (02)
  • [34] A Two-Stream Network For Driving Hand Gesture Recognition
    Zhou, Yefan
    Lv, Zhao
    Wang, Chaoqun
    Zhang, Shengli
    20TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2020), 2020, : 553 - 560
  • [35] Two-stream spatiotemporal feature fusion for human action recognition
    Abdelbaky, Amany
    Aly, Saleh
    VISUAL COMPUTER, 2021, 37 (07): : 1821 - 1835
  • [36] Early Stopping for Two-Stream Fusion Applied to Action Recognition
    Maia, Helena de Almeida
    Souza, Marcos Roberto E.
    Sousa E Santos, Anderson Carlos
    Mendoza Bobadilla, Julio Cesar
    Vieira, Marcelo Bernardes
    Pedrini, Helio
    COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS, VISIGRAPP 2020, 2022, 1474 : 319 - 333
  • [37] Two-stream spatiotemporal feature fusion for human action recognition
    Amany Abdelbaky
    Saleh Aly
    The Visual Computer, 2021, 37 : 1821 - 1835
  • [38] Two-stream Graph Attention Convolutional for Video Action Recognition
    Zhang, Deyuan
    Gao, Hongwei
    Dai, Hailong
    Shi, Xiangbin
    2021 IEEE 15TH INTERNATIONAL CONFERENCE ON BIG DATA SCIENCE AND ENGINEERING (BIGDATASE 2021), 2021, : 23 - 27
  • [39] SALIENCY-CONTEXT TWO-STREAM CONVNETS FOR ACTION RECOGNITION
    Chen, Quan-Qi
    Liu, Feng
    Li, Xue
    Liu, Bao-Di
    Zhang, Yu-Jin
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 3076 - 3080
  • [40] A Novel Scheme for Training Two-Stream CNNs for Action Recognition
    Oves Garcia, Reinier
    Morales, Eduardo F.
    Enrique Sucar, L.
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS (CIARP 2019), 2019, 11896 : 729 - 739