Multimodal Deep Reinforcement Learning for Visual Security of Virtual Reality Applications

被引:0
|
作者
Andam, Amine [1 ]
Bentahar, Jamal [2 ,3 ]
Hedabou, Mustapha [1 ]
机构
[1] Mohammed VI Polytech Univ, Sch Comp Sci, Ben Guerir 43150, Morocco
[2] Khalifa Univ, Res Ctr 6G, Abu Dhabi, U Arab Emirates
[3] Concordia Univ, Concordia Inst Informat Syst Engn, Montreal, PQ H3G 1M8, Canada
来源
IEEE INTERNET OF THINGS JOURNAL | 2024年 / 11卷 / 24期
基金
加拿大自然科学与工程研究理事会;
关键词
Security; Visualization; Avatars; Internet of Things; Web conferencing; Three-dimensional displays; Deep reinforcement learning; Deep reinforcement learning (DRL); multimodal neural network; output security; virtual reality (VR); SELECTIVE ATTENTION; DOMINANCE; ONSETS;
D O I
10.1109/JIOT.2024.3450686
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The rapid development of virtual reality (VR) technologies is bringing unprecedented immersive experiences and unusual digital content. Nevertheless, these advancements introduce new security challenges, especially in safeguarding the visual content displayed by VR devices like VR glasses and head-mounted displays. Most existing approaches for visual output security rely exclusively on numerical data, such as object attributes and overlook the need of visual information necessary for thorough VR protection. Moreover, these approaches typically assume a fixed size input, failing to address the dynamic nature of VR where the number of virtual items is constantly changing. This article presents a multimodal deep reinforcement learning (MMDRL) approach to secure the visual outputs in VR applications. We formalize a Markov decision process (MDP) framework for the MMDRL agent that integrates both numerical and image data into the state space to effectively mitigate visual threats. Furthermore, our MMDRL agent is engineered to handle data of varying sizes, which makes it more suitable for VR environments. Results from our experiments demonstrate the agent's ability to successfully counteract visual attacks, significantly outperforming previous approaches. The ablation study confirms the important role of image data in improving the agent's performance, highlighting the efficacy of our multimodal approach. In addition, we provide a video demonstration to showcase these results. Finally, we open-source our VR testbed and source code for further testing and benchmarking.
引用
收藏
页码:39890 / 39900
页数:11
相关论文
共 50 条
  • [41] Overview of Deep Reinforcement Learning Improvements and Applications
    Zhang, Junjie
    Zhang, Cong
    Chien, Wei-Che
    JOURNAL OF INTERNET TECHNOLOGY, 2021, 22 (02): : 239 - 255
  • [42] Construction of a Virtual Reality Platform for UAV Deep Learning
    Wang, Shubo
    Chen, Jian
    Zhang, Zichao
    Wang, Guangqi
    Tan, Yu
    Zheng, Yongjun
    2017 CHINESE AUTOMATION CONGRESS (CAC), 2017, : 3912 - 3916
  • [43] Hand pose estimation in object-interaction based on deep learning for virtual reality applications
    Wu, Min-Yu
    Ting, Pai-Wen
    Tang, Ya-Hui
    Chou, En-Te
    Fu, Li-Chen
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2020, 70
  • [44] Visuotactile-RL: Learning Multimodal Manipulation Policies with Deep Reinforcement Learning
    Hansen, Johanna
    Hogan, Francois
    Rivkin, Dmitriy
    Meger, David
    Jenkin, Michael
    Dudek, Gregory
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 8298 - 8304
  • [45] Visual Rationalizations in Deep Reinforcement Learning for Atari Games
    Weitkamp, Laurens
    van der Pol, Elise
    Akata, Zeynep
    ARTIFICIAL INTELLIGENCE, BNAIC 2018, 2019, 1021 : 151 - 165
  • [46] Deep Reinforcement Learning with Iterative Shift for Visual Tracking
    Ren, Liangliang
    Yuan, Xin
    Lu, Jiwen
    Yang, Ming
    Zhou, Jie
    COMPUTER VISION - ECCV 2018, PT IX, 2018, 11213 : 697 - 713
  • [47] Visual Tracking via Hierarchical Deep Reinforcement Learning
    Zhang, Dawei
    Zheng, Zhonglong
    Jia, Riheng
    Li, Minglu
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 3315 - 3323
  • [48] On the Use of Deep Reinforcement Learning for Visual Tracking: A Survey
    Cruciata, Giorgio
    Lo Presti, Liliana
    La Cascia, Marco
    IEEE ACCESS, 2021, 9 : 120880 - 120900
  • [49] Deep Reinforcement Learning for Visual Semantic Navigation with Memory
    de Andrade Santos, Iury Batista
    Romero, Roseli A. F.
    2020 XVIII LATIN AMERICAN ROBOTICS SYMPOSIUM, 2020 XII BRAZILIAN SYMPOSIUM ON ROBOTICS AND 2020 XI WORKSHOP OF ROBOTICS IN EDUCATION (LARS-SBR-WRE 2020), 2020, : 114 - 119
  • [50] Deep reinforcement learning for optimized visual field analysis
    Tao, Yudong
    Khodeiry, Mohamed
    Ma, Rui
    Alawa, Karam
    Mendoza, Ximena
    Liu, Xiangxiang
    Shyu, Mei-Ling
    Lee, Richard K.
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2021, 62 (08)