Research on 3C compliant assembly strategy method of manipulator based on deep reinforcement learning

被引:0
|
作者
Ma, Hang [1 ]
Zhang, Yuhang [1 ]
Li, Ziyang [1 ]
Zhang, Jiaqi [1 ]
Wu, Xibao [1 ]
Chen, Wenbai [1 ]
机构
[1] Beijing Informat Sci & Technol Univ, Sch Automat, Beijing 100101, Peoples R China
关键词
3C assembly task; Reward shaping; Reinforcement learning; Modeling of robotic arm; Physical constraints; DESIGN; STATE;
D O I
10.1016/j.compeleceng.2024.109605
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Addressing the issues of existing 3C assembly methods that rely on precise contact state models, low sampling efficiency, and poor safety, this paper proposes a research method for a manipulator-based 3C assembly strategy utilizing deep reinforcement learning. Initially, the study constructs a simulation task for 3C assembly involving a UR manipulator and flexible printed circuits (FPC) buckling within the MuJoCo development environment to mirror real-world assembly conditions. By incorporating a Gaussian distribution-based policy network suitable for continuous action spaces and employing the maximum entropy method to enhance the algorithm's exploratory capabilities, this study develops an efficient method for training autonomous assembly behavior strategies. We have successfully established a 3C assembly simulation environment that accurately simulates key physical parameters such as position, contact force, and torque, modeling the assembly task as a Markov decision process. Considering the semi-flexible nature of FPC, we control the magnitude of adaptive contact force to achieve compliant assembly of FPCs. Comprehensive simulation experiments demonstrate that the SAC algorithm proposed in this study enables the robot to autonomously and obediently complete the 3C assembly tasks, exhibiting good accuracy and stability. The assembly success rate reaches 93 %, and after training with the reinforcement learning strategy, the contact force meets the preset range, achieving the effect of compliant assembly.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] RESEARCH ON HEV ENERGY MANAGEMENT STRATEGY BASED ON IMPROVED DEEP REINFORCEMENT LEARNING
    Wu, Zhongqiang
    Ma, Boyan
    JOURNAL OF INDUSTRIAL AND MANAGEMENT OPTIMIZATION, 2023, 19 (12) : 8451 - 8468
  • [22] Research on Optimal Control Strategy of Distributed Photovoltaic Based on Deep Reinforcement Learning
    Dai, Zhiqiang
    Xu, Yunuo
    Hu, Wei
    Wang, Haitao
    Lin, Kai
    Li, Binghui
    Guo, Qiuting
    Pei, Xun
    2023 2ND ASIAN CONFERENCE ON FRONTIERS OF POWER AND ENERGY, ACFPE, 2023, : 458 - 462
  • [23] Research on multidimensional dynamic defense strategy for microservice based on deep reinforcement learning
    Zhou D.
    Chen H.
    He W.
    Cheng G.
    Hu H.
    Tongxin Xuebao/Journal on Communications, 2023, 44 (04): : 50 - 63
  • [24] Proactive 3C Resource Allocation for Wireless Virtual Reality Using Deep Reinforcement Learning
    Chen, Weixi
    Song, Qingyang
    Lin, Peng
    Guo, Lei
    Jamalipour, Abbas
    2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2021,
  • [25] Research on Robot Intelligent Control Method Based on Deep Reinforcement Learning
    Rao, Shu
    2022 6TH INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND INTELLIGENT CONTROL, ISCSIC, 2022, : 221 - 225
  • [26] Robotic Peg-in-Hole Assembly Strategy Research Based on Reinforcement Learning Algorithm
    Li, Shaodong
    Yuan, Xiaogang
    Niu, Jie
    APPLIED SCIENCES-BASEL, 2022, 12 (21):
  • [27] Flipit Game Deception Strategy Selection Method Based on Deep Reinforcement Learning
    He, Weizhen
    Tan, Jinglei
    Guo, Yunfei
    Shang, Ke
    Kong, Guanhua
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2023, 2023
  • [28] Assembly sequence planning based on deep reinforcement learning
    Zhao M.-H.
    Zhang X.-B.
    Guo X.
    Ou Y.-S.
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2021, 38 (12): : 1901 - 1910
  • [29] Air Combat Maneuver Decision Method Based on A3C Deep Reinforcement Learning
    Fan, Zihao
    Xu, Yang
    Kang, Yuhang
    Luo, Delin
    MACHINES, 2022, 10 (11)
  • [30] Assembly strategy for large-diameter peg-in-hole based on deep reinforcement learning
    Jiang Y.-F.
    Chen D.-S.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2023, 57 (11): : 2210 - 2216