Yoga pose classification: a CNN and MediaPipe inspired deep learning approach for real-world application

被引:12
|
作者
Garg S. [1 ]
Saxena A. [1 ]
Gupta R. [1 ]
机构
[1] Department of Computer Science and Engineering, Amity School of Engineering and Technology, Amity University, Uttar Pradesh, Noida
关键词
Classification; Computer vision; Convolutional neural networks; Deep learning; MediaPipe; Skeletonization;
D O I
10.1007/s12652-022-03910-0
中图分类号
学科分类号
摘要
Yoga is a centuries-old style of exercise followed by sports personnel, patients, and physiotherapist as their regime. A correct posture and technique are the key points in yoga to reap the maximum benefits. Hence, developing a model to classify yoga postures correctly is a recently emerging research topic. The paper presents a novel architecture that aims to classify various yoga poses. The proposed model estimates and classifies yoga poses into five broad categories with low latency. In the proposed architecture, the images are skeletonized before inputting into the model. The skeletonization process is done using the MediaPipe library for body keypoint detection. The paper compares the performance of various deep learning models with and without skeletonization. Different learning models showed the optimum result with the training of skeletonized images to the network. The comparison is drawn to establish the positive impact of skeletonization on the results obtained by various models. VGG16 achieves the highest validation accuracy on non-skeletonized images (95.6%), followed by InceptionV3, NASNetMobile, YogaConvo2d (proposed model) (89.9%), and lastly, InceptionResNetV2. In contrast, the proposed model YogaConvo2d using skeletonized images reports a validation accuracy of 99.62%, followed by VGG16, InceptionResNetV2, NASNetMobile, and InceptionV3. © 2022, The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature.
引用
收藏
页码:16551 / 16562
页数:11
相关论文
共 50 条
  • [1] Three-dimensional CNN-inspired deep learning architecture for Yoga pose recognition in the real-world environment
    Jain, Shrajal
    Rustagi, Aditya
    Saurav, Sumeet
    Saini, Ravi
    Singh, Sanjay
    [J]. NEURAL COMPUTING & APPLICATIONS, 2021, 33 (12): : 6427 - 6441
  • [2] Three-dimensional CNN-inspired deep learning architecture for Yoga pose recognition in the real-world environment
    Shrajal Jain
    Aditya Rustagi
    Sumeet Saurav
    Ravi Saini
    Sanjay Singh
    [J]. Neural Computing and Applications, 2021, 33 : 6427 - 6441
  • [3] Social Media Toxicity Classification Using Deep Learning: Real-World Application UK Brexit
    Fan, Hong
    Du, Wu
    Dahou, Abdelghani
    Ewees, Ahmed A.
    Yousri, Dalia
    Abd Elaziz, Mohamed
    Elsheikh, Ammar H.
    Abualigah, Laith
    Al-qaness, Mohammed A. A.
    [J]. ELECTRONICS, 2021, 10 (11)
  • [4] Exploration of deep learning architectures for real-time yoga pose recognition
    Saurav, Sumeet
    Gidde, Prashant
    Singh, Sanjay
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2024,
  • [5] A Real-Time Deep Learning Approach for Real-World Video Anomaly Detection
    Petrocchi, Stefano
    Giorgi, Giacomo
    Cimino, Mario G. C. A.
    [J]. ARES 2021: 16TH INTERNATIONAL CONFERENCE ON AVAILABILITY, RELIABILITY AND SECURITY, 2021,
  • [6] Real-World Spatial Synchronization of Event-CMOS Cameras through Deep Learning: A Novel CNN-DGCNN Approach
    Mizrahi, Dor
    Laufer, Ilan
    Zuckerman, Inon
    [J]. SENSORS, 2024, 24 (13)
  • [7] Real-world application of Spirometry Quality Control Deep-Learning Algorithm
    Stanojevic, Sanja
    Powell, Pippa
    Hall, Graham
    Topalovic, Marko
    Das, Nilakash
    Ray, Kevin
    Ratjen, Felix
    Jensen, Renee
    Jimenez-Ruiz, Carlos
    Hernandez, Carmen
    Burgos, Felip
    Janssens, Wim
    [J]. EUROPEAN RESPIRATORY JOURNAL, 2021, 58
  • [8] A Bayesian Classification Approach to Improving Performance for a Real-World Sales Forecasting Application
    Gallagher, Claire
    Madden, Michael G.
    D'Arcy, Brian
    [J]. 2015 IEEE 14TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2015, : 475 - 480
  • [9] The Effect of Real-World Interference on CNN Feature Extraction and Machine Learning Classification of Unmanned Aerial Systems
    Swinney, Carolyn J.
    Woods, John C.
    [J]. AEROSPACE, 2021, 8 (07)
  • [10] ConvGRU-CNN: Spatiotemporal Deep Learning for Real-World Anomaly Detection in Video Surveillance System
    Gandapur, Maryam Qasim
    Verdu, Elena
    [J]. INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2023, 8 (04): : 88 - 95