Research on 3C compliant assembly strategy method of manipulator based on deep reinforcement learning

被引:0
|
作者
Ma, Hang [1 ]
Zhang, Yuhang [1 ]
Li, Ziyang [1 ]
Zhang, Jiaqi [1 ]
Wu, Xibao [1 ]
Chen, Wenbai [1 ]
机构
[1] Beijing Informat Sci & Technol Univ, Sch Automat, Beijing 100101, Peoples R China
关键词
3C assembly task; Reward shaping; Reinforcement learning; Modeling of robotic arm; Physical constraints; DESIGN; STATE;
D O I
10.1016/j.compeleceng.2024.109605
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Addressing the issues of existing 3C assembly methods that rely on precise contact state models, low sampling efficiency, and poor safety, this paper proposes a research method for a manipulator-based 3C assembly strategy utilizing deep reinforcement learning. Initially, the study constructs a simulation task for 3C assembly involving a UR manipulator and flexible printed circuits (FPC) buckling within the MuJoCo development environment to mirror real-world assembly conditions. By incorporating a Gaussian distribution-based policy network suitable for continuous action spaces and employing the maximum entropy method to enhance the algorithm's exploratory capabilities, this study develops an efficient method for training autonomous assembly behavior strategies. We have successfully established a 3C assembly simulation environment that accurately simulates key physical parameters such as position, contact force, and torque, modeling the assembly task as a Markov decision process. Considering the semi-flexible nature of FPC, we control the magnitude of adaptive contact force to achieve compliant assembly of FPCs. Comprehensive simulation experiments demonstrate that the SAC algorithm proposed in this study enables the robot to autonomously and obediently complete the 3C assembly tasks, exhibiting good accuracy and stability. The assembly success rate reaches 93 %, and after training with the reinforcement learning strategy, the contact force meets the preset range, achieving the effect of compliant assembly.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Mastering the Complex Assembly Task With a Dual-Arm Robot Based on Deep Reinforcement Learning: A Novel Reinforcement Learning Method
    Jiang, Daqi
    Wang, Hong
    Lu, Yanzheng
    IEEE ROBOTICS & AUTOMATION MAGAZINE, 2023, 30 (02) : 57 - 66
  • [42] Using digital twin to enhance Sim2real transfer for reinforcement learning in 3C assembly
    Mu, Weiwen
    Chen, Wenbai
    Zhou, Huaidong
    Liu, Naijun
    Shi, Haobin
    Li, Jingchen
    INDUSTRIAL ROBOT-THE INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH AND APPLICATION, 2024, 51 (01): : 125 - 133
  • [43] Energy Management Strategy for Hybrid Electric Vehicle Based on the Deep Reinforcement Learning Method
    Chen Z.
    Fang Z.
    Yang R.
    Yu Q.
    Kang M.
    Diangong Jishu Xuebao/Transactions of China Electrotechnical Society, 2022, 37 (23): : 6157 - 6168
  • [44] Control strategy of robotic manipulator based on multi-task reinforcement learning
    Tao Wang
    Ziming Ruan
    Yuyan Wang
    Chong Chen
    Complex & Intelligent Systems, 2025, 11 (3)
  • [45] COLREGs-compliant multiship collision avoidance based on deep reinforcement learning
    Zhao, Luman
    Roh, Myung-Il
    OCEAN ENGINEERING, 2019, 191
  • [46] A Deep Reinforcement Learning Strategy Combining Expert Experience Guidance for a Fruit-Picking Manipulator
    Liu, Yuqi
    Gao, Po
    Zheng, Change
    Tian, Lijing
    Tian, Ye
    ELECTRONICS, 2022, 11 (03)
  • [47] Live Working Manipulator Control Technology Based on Hierarchical Deep Reinforcement Learning
    Yan D.
    Chen S.
    Peng G.
    Tan Y.
    Zhang Y.
    Wu K.
    Gaodianya Jishu/High Voltage Engineering, 2020, 46 (02): : 459 - 470
  • [48] Lightweight and fast visual detection method for 3C assembly
    Chen, Wenbai
    Yang, Genjian
    Zhang, Bo
    Li, Jingchen
    Wang, Yiqun
    Shi, Haobin
    DISPLAYS, 2024, 82
  • [49] Lightweight and fast visual detection method for 3C assembly
    Chen, Wenbai
    Yang, Genjian
    Zhang, Bo
    Li, Jingchen
    Wang, Yiqun
    Shi, Haobin
    Displays, 2024, 82
  • [50] Research on carbon asset trading strategy based on PSO-VMD and deep reinforcement learning
    Zhang, Jiayang
    Chen, Kaijie
    JOURNAL OF CLEANER PRODUCTION, 2024, 435