Action recognition based on multimode fusion for VR online platform

被引:0
|
作者
Xuan Li
Hengxin Chen
Shengdong He
Xinrun Chen
Shuang Dong
Ping Yan
Bin Fang
机构
[1] Chongqing University,College of Computer Science
来源
Virtual Reality | 2023年 / 27卷
关键词
Data augmentation; Action recognition; Virtual reality online platform; Remote education;
D O I
暂无
中图分类号
学科分类号
摘要
The current popular online communication platforms can convey information only in the form of text, voice, pictures, and other electronic means. The richness and reliability of information is not comparable to traditional face-to-face communication. The use of virtual reality (VR) technology for online communication is a viable alternative to face-to-face communication. In the current VR online communication platform, users are in a virtual world in the form of avatars, which can achieve “face-to-face” communication to a certain extent. However, the actions of the avatar do not follow the user, which makes the communication process less realistic. Decision-makers need to make decisions based on the behavior of VR users, but there are no effective methods for action data collection in VR environments. In our work, three modalities of nine actions from VR users are collected using a virtual reality head-mounted display (VR HMD) built-in sensors, RGB cameras and human pose estimation. Using these data and advanced multimodal fusion action recognition networks, we obtained a high accuracy action recognition model. In addition, we take advantage of the VR HMD to collect 3D position data and design a 2D key point augmentation scheme for VR users. Using the augmented 2D key point data and VR HMD sensor data, we can train action recognition models with high accuracy and strong stability. In data collection and experimental work, we focus our research on classroom scenes, and the results can be extended to other scenes.
引用
收藏
页码:1797 / 1812
页数:15
相关论文
共 50 条
  • [1] Action recognition based on multimode fusion for VR online platform
    Li, Xuan
    Chen, Hengxin
    He, Shengdong
    Chen, Xinrun
    Dong, Shuang
    Yan, Ping
    Fang, Bin
    VIRTUAL REALITY, 2023, 27 (03) : 1797 - 1812
  • [2] Human Action Recognition Based on Fusion Features
    Yang, Shiqiang
    Yang, Jiangtao
    Li, Fei
    Fan, Guohao
    Li, Dexin
    CYBER SECURITY INTELLIGENCE AND ANALYTICS, 2020, 928 : 569 - 579
  • [3] Sensor fusion based manipulative action recognition
    Ye Gu
    Meiqin Liu
    Weihua Sheng
    Yongsheng Ou
    Yongqiang Li
    Autonomous Robots, 2021, 45 : 1 - 13
  • [4] Human Action Recognition Based on Multifeature Fusion
    Zhang, Shasha
    Zhang, Weicun
    Li, Yunluo
    PROCEEDINGS OF 2016 CHINESE INTELLIGENT SYSTEMS CONFERENCE, VOL II, 2016, 405 : 183 - 192
  • [5] Sensor fusion based manipulative action recognition
    Gu, Ye
    Liu, Meiqin
    Sheng, Weihua
    Ou, Yongsheng
    Li, Yongqiang
    AUTONOMOUS ROBOTS, 2021, 45 (01) : 1 - 13
  • [6] Study on recognition of coal and gangue based on multimode feature and image fusion
    Zhao, Lijuan
    Han, Liguo
    Zhang, Haining
    Liu, Zifeng
    Gao, Feng
    Yang, Shijie
    Wang, Yadong
    PLOS ONE, 2023, 18 (02):
  • [7] Online Action Recognition
    Suarez-Hernandez, Alejandro
    Segovia-Aguas, Javier
    Torras, Carme
    Alenya, Guillem
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 11981 - 11989
  • [8] Multimode fusion perception for transparent glass recognition
    Zhang, Shixin
    Shan, Jianhua
    Sun, Fuchun
    Fang, Bin
    Yang, Yiyong
    INDUSTRIAL ROBOT-THE INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH AND APPLICATION, 2022, 49 (04): : 625 - 633
  • [9] Online robust action recognition based on a hierarchical model
    Jiang, Xinbo
    Zhong, Fan
    Peng, Qunsheng
    Qin, Xueying
    VISUAL COMPUTER, 2014, 30 (09): : 1021 - 1033
  • [10] Online robust action recognition based on a hierarchical model
    Xinbo Jiang
    Fan Zhong
    Qunsheng Peng
    Xueying Qin
    The Visual Computer, 2014, 30 : 1021 - 1033