An experience-based policy gradient method for smooth manipulation

被引:0
|
作者
Wang, Yongchao [1 ]
Lan, Xuguang [1 ]
Feng, Chuzhen [1 ]
Wan, Lipeng [1 ]
Li, Jin [1 ]
Liu, Yuwang [1 ]
Li, Decai [1 ]
机构
[1] Xi An Jiao Tong Univ, Inst Artificial Intelligence & Robot, Xian, Shaanxi, Peoples R China
关键词
Policy Gradient; Robot Manipulation; Deep Reinforcement Learning;
D O I
10.1109/cyber46603.2019.9066580
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Policy gradient methods have achieved remarkable success in continuous controlling tasks. However, in robotic control, original policy gradient algorithms depend on the first succeed experience which is usually a suboptimal solution. To improve the performance, we propose an experience-based policy gradient method(EBDDPG) which guides the robot to move in a smooth way. Besides, extra OU-noise is added to the action space to improve exploration. We tested our algorithm on Gazebo simulation environment with Baxter robot. The experimental results show our method guides the robot to manipulate more smoothly and improves success rate of grasping tasks.
引用
收藏
页码:93 / 97
页数:5
相关论文
共 50 条
  • [21] Experience-Based Electronic Health Records
    Muro, Naiara
    Sanchez, Eider
    Toro, Carlos
    Carrasco, Eduardo
    Rios, Sebastian A.
    Guijarro, Frank
    Grana, Manuel
    CYBERNETICS AND SYSTEMS, 2016, 47 (1-2) : 126 - 139
  • [22] Choice bracketing and experience-based choice
    Hadar, Liat
    Danziger, Shai
    Morwitz, Vicki G.
    JOURNAL OF BEHAVIORAL DECISION MAKING, 2021, 34 (03) : 405 - 418
  • [23] Tool support for experience-based methodologies
    Henninger, S
    ADVANCES IN LEARNING SOFTWARE ORGANIZATIONS, 2002, 2640 : 44 - 59
  • [24] Experience-based learning in innovation and production
    Pruett, Mark
    Thomas, Howard
    R & D MANAGEMENT, 2008, 38 (02) : 141 - 153
  • [25] Salesforce design with experience-based learning
    Misra, S
    Pinker, EJ
    Shumsky, RA
    IIE TRANSACTIONS, 2004, 36 (10) : 941 - 952
  • [26] Experience-based Optimization of Robotic Perception
    Durner, Maximilian
    Kriegel, Simon
    Riedel, Sebastian
    Brucker, Manuel
    Marton, Zoltan-Csaba
    Balint-Benczedi, Ferenc
    Triebel, Rudolph
    2017 18TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS (ICAR), 2017, : 32 - 39
  • [27] An experience-based incident response system
    Capuzzi, G.
    Cardinale, E.
    Di Pietro, I.
    Spalazzi, L.
    21ST EUROPEAN CONFERENCE ON MODELLING AND SIMULATION ECMS 2007: SIMULATIONS IN UNITED EUROPE, 2007, : 750 - +
  • [28] Experience-based admission control (EBAC)
    Menth, M
    Milbrandt, J
    Oechsner, S
    ISCC2004: NINTH INTERNATIONAL SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS, VOLS 1 AND 2, PROCEEDINGS, 2004, : 903 - 910
  • [29] Experience-based issues in construction education
    Singh, Amarjit
    Journal of Professional Issues in Engineering Education and Practice, 1992, 118 (04) : 388 - 402
  • [30] Experience-based design of behaviors in videogames
    Florez Puga, Gonzalo
    Diaz-Agudo, Belen
    Gonzalez-Calero, Pedro
    ADVANCES IN CASE-BASED REASONING, PROCEEDINGS, 2008, 5239 : 180 - 194