CAM-Vtrans: real-time sports training utilizing multi-modal robot data

被引:0
|
作者
Hong, LinLin [1 ]
Lee, Sangheang [1 ]
Song, GuanTing [2 ]
机构
[1] Jeonju Univ, Coll Phys Educ, Jeonju, Jeonrabug Do, South Korea
[2] Gongqing Inst Sci & Technol, Jiujiang, Jiangxi, Peoples R China
来源
关键词
assistive robotics; human-machine interaction; balance control; movement recovery; vision-transformer; CLIP; cross-attention; REPRESENTATION; CLASSIFICATION;
D O I
10.3389/fnbot.2024.1453571
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Introduction Assistive robots and human-robot interaction have become integral parts of sports training. However, existing methods often fail to provide real-time and accurate feedback, and they often lack integration of comprehensive multi-modal data.Methods To address these issues, we propose a groundbreaking and innovative approach: CAM-Vtrans-Cross-Attention Multi-modal Visual Transformer. By leveraging the strengths of state-of-the-art techniques such as Visual Transformers (ViT) and models like CLIP, along with cross-attention mechanisms, CAM-Vtrans harnesses the power of visual and textual information to provide athletes with highly accurate and timely feedback. Through the utilization of multi-modal robot data, CAM-Vtrans offers valuable assistance, enabling athletes to optimize their performance while minimizing potential injury risks. This novel approach represents a significant advancement in the field, offering an innovative solution to overcome the limitations of existing methods and enhance the precision and efficiency of sports training programs.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] A MULTI-MODAL SPECTROSCOPY INSTRUMENT FOR REAL-TIME EARLY DETECTION OF SKIN CANCER
    Sharma, Manu
    Lim, Liang
    Marple, Eric
    Riggs, William
    Tunnell, James W.
    LASERS IN SURGERY AND MEDICINE, 2013, 45 : 39 - 39
  • [22] Live Demonstration: Real-time Multi-modal Hearing Assistive Technology Prototype
    Gogate, Mandar
    Hussain, Adeel
    Dashtipour, Kia
    Hussain, Amir
    2023 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS, 2023,
  • [23] CheXstray: A Real-Time Multi-Modal Monitoring Workflow for Medical Imaging AI
    Merkow, Jameson
    Soin, Arjun
    Long, Jin
    Cohen, Joseph Paul
    Saligrama, Smitha
    Bridge, Christopher
    Yang, Xiyu
    Kaiser, Stephen
    Borg, Steven
    Tarapov, Ivan
    Lungren, Matthew P.
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT III, 2023, 14222 : 326 - 336
  • [24] A Model for Reconfiguration of Multi-Modal Real-Time Systems under Energy Constraints
    Nassiffe, Riad
    Camponogara, Eduardo
    Lima, George
    2011 BRAZILIAN SYMPOSIUM ON COMPUTING SYSTEM ENGINEERING (SBESC), 2011, : 127 - 132
  • [25] Multi-modal deep-learning model for real-time prediction of recurrence in early-stage esophageal cancer: A multi-modal approach
    Jung, H. A.
    Lee, D.
    Park, B.
    Lee, K.
    Lee, H. Y.
    Kim, T. J.
    Jeon, Y. J.
    Lee, J.
    Cho, J. H.
    Kim, H. K.
    Choi, Y. S.
    Park, S.
    Sun, J-M.
    Lee, S-H.
    Ahn, J. S.
    Ahn, M-J.
    ANNALS OF ONCOLOGY, 2024, 35 : S883 - S883
  • [26] A Multi-modal System for Public Speaking Pilot Study on Evaluation of Real-Time Feedback
    Dermody, Fiona
    Sutherland, Alistair
    Farren, Margaret
    HUMAN-COMPUTER INTERACTION - INTERACT 2015, PT IV, 2015, 9299 : 499 - 501
  • [27] Real-Time Control Strategy of Exoskeleton Locomotion Trajectory Based on Multi-modal Fusion
    Tao Zhen
    Lei Yan
    Journal of Bionic Engineering, 2023, 20 : 2670 - 2682
  • [28] Real-Time Control Strategy of Exoskeleton Locomotion Trajectory Based on Multi-modal Fusion
    Zhen, Tao
    Yan, Lei
    JOURNAL OF BIONIC ENGINEERING, 2023, 20 (06) : 2670 - 2682
  • [29] COSM2IC: Optimizing Real-Time Multi-Modal Instruction Comprehension
    Weerakoon, Dulanga
    Subbaraju, Vigneshwaran
    Tran, Tuan
    Misra, Archan
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (04) : 10697 - 10704
  • [30] COMPUTER VISION AND COMPUTATIONAL INTELLIGENCE FOR REAL-TIME MULTI-MODAL SPACE DOMAIN AWARENESS
    Bolden, Mark
    Schumacher, Paul
    Spencer, David
    Hussein, Islam
    Wilkins, Matthew
    Roscoe, Christopher
    SPACEFLIGHT MECHANICS 2017, PTS I - IV, 2017, 160 : 2165 - 2178