Feature semantic space-based sim2real decision model

被引:3
|
作者
Xiao, Wenwen [1 ]
Luo, Xiangfeng [1 ]
Xie, Shaorong [1 ]
机构
[1] Shanghai Univ, Sch Comp Engn & Sci, Shanghai 200444, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep reinforcement learning; Semantic segmentation; Sim2real; Asychronous multiple deep deterministic policy gradient;
D O I
10.1007/s10489-022-03566-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
At present, the intelligent decision model of unmanned systems can only be applied to virtual scenes, which makes it difficult to migrate to real scenes because the image gap between virtual scenes and real scenes is relatively large. The main solutions are domain randomization, domain adaptation, and image translation. However, these methods simply add noise and transform the perceptual information and do not consider the semantic information of the agent's perceptual space. This causes the problem of low accuracy in the migration of virtual scene decision models to real scenes. Considering the above problems, we propose a feature semantic space-based sim2real decision model, which includes an environment representation module, policy optimization module and intelligent decision module. The model framework can narrow the image gap between real-world scenes and virtual scenes. First, using the environment representation module, the virtual scene and real scene are simultaneously mapped to the feature semantic space through semantic segmentation. Then, in the policy optimization module, we propose an AMDDPG policy optimization algorithm. The algorithm obtains the local and global experience in the learning process through the global and local network architecture. It also solves the problem of the slow learning rate of sim2real. Finally, in the intelligent decision module, the data in the semantic space integrating virtual scene and real scene features are used as the training data of the agent autonomous decision model. Experimental results confirm that our method has more effective generalization and robustness of the model in the real scene and can be better migrated to the real scene.
引用
收藏
页码:4890 / 4906
页数:17
相关论文
共 50 条
  • [41] Towards Sim2Real Transfer of Autonomy Algorithms using AutoDRIVE Ecosystem
    Samak, Chinmay
    Samak, Tanmay
    Krovi, Venkat
    IFAC PAPERSONLINE, 2023, 56 (03): : 277 - 282
  • [42] Sim2Real Rope Cutting With a Surgical Robot Using Vision-Based Reinforcement Learning
    Haiderbhai, Mustafa
    Gondokaryono, Radian
    Wu, Andrew
    Kahrs, Lueder A.
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, : 1 - 12
  • [43] Sim2Real Neural Controllers for Physics-Based Robotic Deployment of Deformable Linear Objects
    Tong, Dezhong
    Choi, Andrew
    Qin, Longhui
    Huang, Weicheng
    Joo, Jungseock
    Jawed, Mohammad Khalid
    INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2024, 43 (06): : 791 - 810
  • [44] Segmented Encoding for Sim2Real of RL-based End-to-End Autonomous Driving
    Chung, Seung-Hwan
    Kong, Seung-Hyun
    Cho, Sangjae
    Nahrendra, I. Made Aswin
    2022 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2022, : 1290 - 1296
  • [45] Object Detection Using Sim2Real Domain Randomization for Robotic Applications
    Horvath, Daniel
    Erdos, Gabor
    Istenes, Zoltan
    Horvath, Tomas
    Foldi, Sandor
    IEEE TRANSACTIONS ON ROBOTICS, 2023, 39 (02) : 1225 - 1243
  • [46] Sim2real for Autonomous Vehicle Control using Executable Digital Twin
    Allamaa, Jean Pierre
    Patrinos, Panagiotis
    Van der Auweraer, Herman
    Son, Tong Duy
    IFAC PAPERSONLINE, 2022, 55 (24): : 385 - 391
  • [47] Domain Randomization for Sim2real Transfer of Automatically Generated Grasping Datasets
    Huber, Johann
    Helenon, Francois
    Watrelot, Hippolyte
    Ben Amar, Faiz
    Doncieux, Stephane
    2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2024, 2024, : 4112 - 4118
  • [48] Research on Force Control Based on Sim2Real Transfer for Stiffness of Thin-walled Parts
    Chen P.
    Li X.
    He X.
    Cai Y.
    Zhao H.
    Ding H.
    Jixie Gongcheng Xuebao/Journal of Mechanical Engineering, 2021, 57 (17): : 53 - 63
  • [49] Sim2Real Predictivity: Does Evaluation in Simulation Predict Real-World Performance?
    Kadian, Abhishek.
    Truong, Joanne
    Gokaslan, Aaron
    Clegg, Alexander
    Wijmans, Erik
    Lee, Stefan
    Savva, Manolis
    Chernova, Sonia
    Batra, Dhruv
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (04): : 6670 - 6677
  • [50] Parallel Learning: Overview and Perspective for Computational Learning Across Syn2Real and Sim2Real
    Miao, Qinghai
    Lv, Yisheng
    Huang, Min
    Wang, Xiao
    Wang, Fei-Yue
    IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2023, 10 (03) : 603 - 631