Feature semantic space-based sim2real decision model

被引:3
|
作者
Xiao, Wenwen [1 ]
Luo, Xiangfeng [1 ]
Xie, Shaorong [1 ]
机构
[1] Shanghai Univ, Sch Comp Engn & Sci, Shanghai 200444, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep reinforcement learning; Semantic segmentation; Sim2real; Asychronous multiple deep deterministic policy gradient;
D O I
10.1007/s10489-022-03566-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
At present, the intelligent decision model of unmanned systems can only be applied to virtual scenes, which makes it difficult to migrate to real scenes because the image gap between virtual scenes and real scenes is relatively large. The main solutions are domain randomization, domain adaptation, and image translation. However, these methods simply add noise and transform the perceptual information and do not consider the semantic information of the agent's perceptual space. This causes the problem of low accuracy in the migration of virtual scene decision models to real scenes. Considering the above problems, we propose a feature semantic space-based sim2real decision model, which includes an environment representation module, policy optimization module and intelligent decision module. The model framework can narrow the image gap between real-world scenes and virtual scenes. First, using the environment representation module, the virtual scene and real scene are simultaneously mapped to the feature semantic space through semantic segmentation. Then, in the policy optimization module, we propose an AMDDPG policy optimization algorithm. The algorithm obtains the local and global experience in the learning process through the global and local network architecture. It also solves the problem of the slow learning rate of sim2real. Finally, in the intelligent decision module, the data in the semantic space integrating virtual scene and real scene features are used as the training data of the agent autonomous decision model. Experimental results confirm that our method has more effective generalization and robustness of the model in the real scene and can be better migrated to the real scene.
引用
收藏
页码:4890 / 4906
页数:17
相关论文
共 50 条
  • [31] A SIM2REAL METHOD BASED ON DDQN FOR TRAINING A SELF-DRIVING SCALE CAR
    Zhang, Qi
    Du, Tao
    Tian, Changzheng
    MATHEMATICAL FOUNDATIONS OF COMPUTING, 2019, 2 (04): : 315 - 331
  • [32] length Sim2real kinematics modeling of industrial robots based on FPGA-acceleration
    Liu, Wenzheng
    Zhao, Chun
    Liu, Yue
    Wang, Hongwei
    Zhao, Wei
    Zhang, Heming
    ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 2022, 77
  • [33] A feature space-based business model quality evaluation method
    Harbin Institute of Technology, School of Computer Science and Technology, West Dazhi Street 92, Harbin, China
    J. Compt. Inf. Technol., 2008, 1 (43-55):
  • [34] Sim2real Learning of Obstacle Avoidance for Robotic Manipulators in Uncertain Environments
    Zhang, Tan
    Zhang, Kefang
    Lin, Jiatao
    Louie, Wing-Yue Geoffrey
    Huang, Hui
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (01): : 65 - 72
  • [35] Sim2Real When Data Is Scarce: Image Transformation for Industrial Applications
    Weisenboehler, Moritz
    Augenstein, Philipp
    Hein, Bjoern
    Wurll, Christian
    Furmans, Kai
    INTELLIGENT AUTONOMOUS SYSTEMS 18, VOL 2, IAS18-2023, 2024, 794 : 65 - 76
  • [36] Learning Nonprehensile Dynamic Manipulation: Sim2real Vision-Based Policy With a Surgical Robot
    Gondokaryono, Radian
    Haiderbhai, Mustafa
    Suryadevara, Sai Aneesh
    Kahrs, Lueder A.
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (10) : 6763 - 6770
  • [37] Sim2Real Rope Cutting With a Surgical Robot Using Vision-Based Reinforcement Learning
    Haiderbhai, Mustafa
    Gondokaryono, Radian
    Wu, Andrew
    Kahrs, Lueder A.
    IEEE Transactions on Automation Science and Engineering, 2024, : 1 - 12
  • [38] Adaptability Preserving Domain Decomposition for Stabilizing Sim2Real Reinforcement Learning
    Gao, Haichuan
    Yang, Zhile
    Su, Xin
    Tan, Tian
    Chen, Feng
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 4403 - 4410
  • [39] Sim2Real in Endoscopy Segmentation with a Novel Structure Aware Image Translation
    Tomasini, Clara
    Riazuelo, Luis
    Murillo, Ana C.
    SIMULATION AND SYNTHESIS IN MEDICAL IMAGING, SASHIMI 2024, 2025, 15187 : 89 - 101
  • [40] DeepRacer: Autonomous Racing Platform for Experimentation with Sim2Real Reinforcement Learning
    Balaji, Bharathan
    Mallya, Sunil
    Genc, Sahika
    Gupta, Saurabh
    Dirac, Leo
    Khare, Vineet
    Roy, Gourav
    Sun, Tao
    Tao, Yunzhe
    Townsend, Brian
    Calleja, Eddie
    Muralidhara, Sunil
    Karuppasamy, Dhanasekar
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 2746 - 2754