Feature semantic space-based sim2real decision model

被引：3

作者：

Xiao, Wenwen ^{[1
]}

Luo, Xiangfeng ^{[1
]}

Xie, Shaorong ^{[1
]}

机构：

[1] Shanghai Univ, Sch Comp Engn & Sci, Shanghai 200444, Peoples R China

来源：

APPLIED INTELLIGENCE | 2023年 / 53卷 / 05期

基金：

中国国家自然科学基金;

关键词：

Deep reinforcement learning; Semantic segmentation; Sim2real; Asychronous multiple deep deterministic policy gradient;

D O I：

10.1007/s10489-022-03566-5

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

At present, the intelligent decision model of unmanned systems can only be applied to virtual scenes, which makes it difficult to migrate to real scenes because the image gap between virtual scenes and real scenes is relatively large. The main solutions are domain randomization, domain adaptation, and image translation. However, these methods simply add noise and transform the perceptual information and do not consider the semantic information of the agent's perceptual space. This causes the problem of low accuracy in the migration of virtual scene decision models to real scenes. Considering the above problems, we propose a feature semantic space-based sim2real decision model, which includes an environment representation module, policy optimization module and intelligent decision module. The model framework can narrow the image gap between real-world scenes and virtual scenes. First, using the environment representation module, the virtual scene and real scene are simultaneously mapped to the feature semantic space through semantic segmentation. Then, in the policy optimization module, we propose an AMDDPG policy optimization algorithm. The algorithm obtains the local and global experience in the learning process through the global and local network architecture. It also solves the problem of the slow learning rate of sim2real. Finally, in the intelligent decision module, the data in the semantic space integrating virtual scene and real scene features are used as the training data of the agent autonomous decision model. Experimental results confirm that our method has more effective generalization and robustness of the model in the real scene and can be better migrated to the real scene.

引用

页码：4890 / 4906

页数：17

共 50 条

[41] Towards Sim2Real Transfer of Autonomy Algorithms using AutoDRIVE Ecosystem
Samak, Chinmay
Samak, Tanmay
Krovi, Venkat
IFAC PAPERSONLINE, 2023, 56 (03): : 277 - 282
[42] Sim2Real Rope Cutting With a Surgical Robot Using Vision-Based Reinforcement Learning
Haiderbhai, Mustafa
Gondokaryono, Radian
Wu, Andrew
Kahrs, Lueder A.
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, : 1 - 12
[43] Sim2Real Neural Controllers for Physics-Based Robotic Deployment of Deformable Linear Objects
Tong, Dezhong
Choi, Andrew
Qin, Longhui
Huang, Weicheng
Joo, Jungseock
Jawed, Mohammad Khalid
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2024, 43 (06): : 791 - 810
[44] Segmented Encoding for Sim2Real of RL-based End-to-End Autonomous Driving
Chung, Seung-Hwan
Kong, Seung-Hyun
Cho, Sangjae
Nahrendra, I. Made Aswin
2022 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2022, : 1290 - 1296
[45] Object Detection Using Sim2Real Domain Randomization for Robotic Applications
Horvath, Daniel
Erdos, Gabor
Istenes, Zoltan
Horvath, Tomas
Foldi, Sandor
IEEE TRANSACTIONS ON ROBOTICS, 2023, 39 (02) : 1225 - 1243
[46] Sim2real for Autonomous Vehicle Control using Executable Digital Twin
Allamaa, Jean Pierre
Patrinos, Panagiotis
Van der Auweraer, Herman
Son, Tong Duy
IFAC PAPERSONLINE, 2022, 55 (24): : 385 - 391
[47] Domain Randomization for Sim2real Transfer of Automatically Generated Grasping Datasets
Huber, Johann
Helenon, Francois
Watrelot, Hippolyte
Ben Amar, Faiz
Doncieux, Stephane
2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2024, 2024, : 4112 - 4118
[48] Research on Force Control Based on Sim2Real Transfer for Stiffness of Thin-walled Parts
Chen P.
Li X.
He X.
Cai Y.
Zhao H.
Ding H.
Jixie Gongcheng Xuebao/Journal of Mechanical Engineering, 2021, 57 (17): : 53 - 63
[49] Sim2Real Predictivity: Does Evaluation in Simulation Predict Real-World Performance?
Kadian, Abhishek.
Truong, Joanne
Gokaslan, Aaron
Clegg, Alexander
Wijmans, Erik
Lee, Stefan
Savva, Manolis
Chernova, Sonia
Batra, Dhruv
IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (04): : 6670 - 6677
[50] Parallel Learning: Overview and Perspective for Computational Learning Across Syn2Real and Sim2Real
Miao, Qinghai
Lv, Yisheng
Huang, Min
Wang, Xiao
Wang, Fei-Yue
IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2023, 10 (03) : 603 - 631

← 1 2 3 4 5 →