An experience-based policy gradient method for smooth manipulation

被引:0
|
作者
Wang, Yongchao [1 ]
Lan, Xuguang [1 ]
Feng, Chuzhen [1 ]
Wan, Lipeng [1 ]
Li, Jin [1 ]
Liu, Yuwang [1 ]
Li, Decai [1 ]
机构
[1] Xi An Jiao Tong Univ, Inst Artificial Intelligence & Robot, Xian, Shaanxi, Peoples R China
关键词
Policy Gradient; Robot Manipulation; Deep Reinforcement Learning;
D O I
10.1109/cyber46603.2019.9066580
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Policy gradient methods have achieved remarkable success in continuous controlling tasks. However, in robotic control, original policy gradient algorithms depend on the first succeed experience which is usually a suboptimal solution. To improve the performance, we propose an experience-based policy gradient method(EBDDPG) which guides the robot to move in a smooth way. Besides, extra OU-noise is added to the action space to improve exploration. We tested our algorithm on Gazebo simulation environment with Baxter robot. The experimental results show our method guides the robot to manipulate more smoothly and improves success rate of grasping tasks.
引用
收藏
页码:93 / 97
页数:5
相关论文
共 50 条
  • [1] Sparse Distributed Memory for Experience-Based Robot Manipulation
    Jockel, Sascha
    Lindner, Felix
    Zhang, Jianwei
    2008 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS, VOLS 1-4, 2009, : 1298 - +
  • [2] Experience-based optimization of universal manipulation strategies for industrial assembly tasks
    Sayler, Sabine
    Dillmann, Ruediger
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2011, 59 (11) : 882 - 898
  • [3] Mathematical Models of Experience-based and Dynamic Experience-based Fuzzy Classification
    Chen, Wenyi
    Gao, Cunchen
    Dong, Junyu
    Liu, Wenbin
    PROCEEDINGS OF THE 2009 WRI GLOBAL CONGRESS ON INTELLIGENT SYSTEMS, VOL I, 2009, : 493 - +
  • [4] A New Usability Inspection Method: Experience-Based Analysis
    Piirisild, Anu
    Gomez, Ana Perandres
    Taveter, Kuldar
    REQUIREMENTS ENGINEERING: FOUNDATION FOR SOFTWARE QUALITY, REFSQ 2024, 2024, 14588 : 74 - 91
  • [5] Experience-Based Discrimination
    Lepage, Louis-Pierre
    AMERICAN ECONOMIC JOURNAL-APPLIED ECONOMICS, 2024, 16 (04) : 288 - 321
  • [6] Experience-Based Objectives
    Ingman, Benjamin C.
    Moroye, Christy McConnell
    EDUCATIONAL STUDIES-AESA, 2019, 55 (03): : 346 - 367
  • [7] Comparison between optimization method and experience-based method in soil conservation
    Yuan Feng
    Wang, Jixian
    ADVANCES IN ENVIRONMENTAL TECHNOLOGIES, PTS 1-6, 2013, 726-731 : 3811 - +
  • [8] EXPERIENCE-BASED METHOD EVALUATION AND IMPROVEMENT - A PROCESS MODELING APPROACH
    JARKE, M
    POHL, K
    ROLLAND, C
    SCHMITT, JR
    METHODS AND ASSOCIATED TOOLS FOR THE INFORMATION SYSTEMS LIFE CYCLE, 1994, 55 : 1 - 27
  • [9] THEORY OF EXPERIENCE-BASED INSTRUCTION
    RUBEN, BD
    SIMULATION & GAMING, 1977, 8 (02) : 211 - 231
  • [10] Considerations on experience-based learning
    McDermott, KJ
    Göl, Ö
    Nafalski, A
    4TH UICEE ANNUAL CONFERENCE ON ENGINEERING EDUCATION, CONFERENCE PROCEEDINGS: INNOVATION IN ENGINEERING EDUCATION, 2001, : 219 - 222