Feature semantic space-based sim2real decision model

被引：3

作者：

Xiao, Wenwen ^{[1
]}

Luo, Xiangfeng ^{[1
]}

Xie, Shaorong ^{[1
]}

机构：

[1] Shanghai Univ, Sch Comp Engn & Sci, Shanghai 200444, Peoples R China

来源：

APPLIED INTELLIGENCE | 2023年 / 53卷 / 05期

基金：

中国国家自然科学基金;

关键词：

Deep reinforcement learning; Semantic segmentation; Sim2real; Asychronous multiple deep deterministic policy gradient;

D O I：

10.1007/s10489-022-03566-5

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

At present, the intelligent decision model of unmanned systems can only be applied to virtual scenes, which makes it difficult to migrate to real scenes because the image gap between virtual scenes and real scenes is relatively large. The main solutions are domain randomization, domain adaptation, and image translation. However, these methods simply add noise and transform the perceptual information and do not consider the semantic information of the agent's perceptual space. This causes the problem of low accuracy in the migration of virtual scene decision models to real scenes. Considering the above problems, we propose a feature semantic space-based sim2real decision model, which includes an environment representation module, policy optimization module and intelligent decision module. The model framework can narrow the image gap between real-world scenes and virtual scenes. First, using the environment representation module, the virtual scene and real scene are simultaneously mapped to the feature semantic space through semantic segmentation. Then, in the policy optimization module, we propose an AMDDPG policy optimization algorithm. The algorithm obtains the local and global experience in the learning process through the global and local network architecture. It also solves the problem of the slow learning rate of sim2real. Finally, in the intelligent decision module, the data in the semantic space integrating virtual scene and real scene features are used as the training data of the agent autonomous decision model. Experimental results confirm that our method has more effective generalization and robustness of the model in the real scene and can be better migrated to the real scene.

引用

页码：4890 / 4906

页数：17

共 50 条

[31] A SIM2REAL METHOD BASED ON DDQN FOR TRAINING A SELF-DRIVING SCALE CAR
Zhang, Qi
Du, Tao
Tian, Changzheng
MATHEMATICAL FOUNDATIONS OF COMPUTING, 2019, 2 (04): : 315 - 331
[32] length Sim2real kinematics modeling of industrial robots based on FPGA-acceleration
Liu, Wenzheng
Zhao, Chun
Liu, Yue
Wang, Hongwei
Zhao, Wei
Zhang, Heming
ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 2022, 77
[33] A feature space-based business model quality evaluation method
Harbin Institute of Technology, School of Computer Science and Technology, West Dazhi Street 92, Harbin, China
J. Compt. Inf. Technol., 2008, 1 (43-55):
[34] Sim2real Learning of Obstacle Avoidance for Robotic Manipulators in Uncertain Environments
Zhang, Tan
Zhang, Kefang
Lin, Jiatao
Louie, Wing-Yue Geoffrey
Huang, Hui
IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (01): : 65 - 72
[35] Sim2Real When Data Is Scarce: Image Transformation for Industrial Applications
Weisenboehler, Moritz
Augenstein, Philipp
Hein, Bjoern
Wurll, Christian
Furmans, Kai
INTELLIGENT AUTONOMOUS SYSTEMS 18, VOL 2, IAS18-2023, 2024, 794 : 65 - 76
[36] Learning Nonprehensile Dynamic Manipulation: Sim2real Vision-Based Policy With a Surgical Robot
Gondokaryono, Radian
Haiderbhai, Mustafa
Suryadevara, Sai Aneesh
Kahrs, Lueder A.
IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (10) : 6763 - 6770
[37] Sim2Real Rope Cutting With a Surgical Robot Using Vision-Based Reinforcement Learning
Haiderbhai, Mustafa
Gondokaryono, Radian
Wu, Andrew
Kahrs, Lueder A.
IEEE Transactions on Automation Science and Engineering, 2024, : 1 - 12
[38] Adaptability Preserving Domain Decomposition for Stabilizing Sim2Real Reinforcement Learning
Gao, Haichuan
Yang, Zhile
Su, Xin
Tan, Tian
Chen, Feng
2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 4403 - 4410
[39] Sim2Real in Endoscopy Segmentation with a Novel Structure Aware Image Translation
Tomasini, Clara
Riazuelo, Luis
Murillo, Ana C.
SIMULATION AND SYNTHESIS IN MEDICAL IMAGING, SASHIMI 2024, 2025, 15187 : 89 - 101
[40] DeepRacer: Autonomous Racing Platform for Experimentation with Sim2Real Reinforcement Learning
Balaji, Bharathan
Mallya, Sunil
Genc, Sahika
Gupta, Saurabh
Dirac, Leo
Khare, Vineet
Roy, Gourav
Sun, Tao
Tao, Yunzhe
Townsend, Brian
Calleja, Eddie
Muralidhara, Sunil
Karuppasamy, Dhanasekar
2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 2746 - 2754

← 1 2 3 4 5 →