Module-Based Reinforcement Learning: Experiments with a Real Robot

被引:0
|
作者
Zsolt Kalmár
Csaba Szepesvári
András Lőrincz
机构
[1] University of Szeged,Department of Informatics, “József Attila”
[2] Uni versity of Szeged,Research Group on Artificial Intelligence, “József Attila”
[3] University of Szeged,Department of Adaptive Systems, “József Attila”
来源
Autonomous Robots | 1998年 / 5卷
关键词
reinforcement learning; module-based RL; robot learning; problem decomposition; Markovian decision problems; feature space; subgoals; local control; switching control;
D O I
暂无
中图分类号
学科分类号
摘要
The behavior of reinforcement learning (RL) algorithms is best understood in completely observable, discrete-time controlled Markov chains with finite state and action spaces. In contrast, robot-learning domains are inherently continuous both in time and space, and moreover are partially observable. Here we suggest a systematic approach to solve such problems in which the available qualitative and quantitative knowledge is used to reduce the complexity of learning task. The steps of the design process are to: (i) decompose the task into subtasks using the qualitative knowledge at hand; (ii) design local controllers to solve the subtasks using the available quantitative knowledge, and (iii) learn a coordination of these controllers by means of reinforcement learning. It is argued that the approach enables fast, semi-automatic, but still high quality robot-control as no fine-tuning of the local controllers is needed. The approach was verified on a non-trivial real-life robot task. Several RL algorithms were compared by ANOVA and it was found that the model-based approach worked significantly better than the model-free approach. The learnt switching strategy performed comparably to a handcrafted version. Moreover, the learnt strategy seemed to exploit certain properties of the environment which were not foreseen in advance, thus supporting the view that adaptive algorithms are advantageous to nonadaptive ones in complex environments.
引用
收藏
页码:273 / 295
页数:22
相关论文
共 50 条
  • [1] Module-based reinforcement learning: Experiments with a real robot
    Kalmar, Z
    Szepesvari, C
    Lorincz, A
    [J]. MACHINE LEARNING, 1998, 31 (1-3) : 55 - 85
  • [2] Module-based reinforcement learning: Experiments with a real robot
    Kalmar, Z
    Szepesvari, C
    Lorincz, A
    [J]. AUTONOMOUS ROBOTS, 1998, 5 (3-4) : 273 - 295
  • [3] Module-Based Reinforcement Learning: Experiments with a Real Robot
    Zsolt Kalmár
    Csaba Szepesvári
    András Lörincz
    [J]. Machine Learning, 1998, 31 : 55 - 85
  • [4] Middleware architecture for module-based robot
    Choi, Dong-Hee
    Kim, Seong-Hoon
    Lee, Kwang-Koog
    Beak, Bum-Hyun
    Park, Hong-Seong
    [J]. 2006 SICE-ICASE INTERNATIONAL JOINT CONFERENCE, VOLS 1-13, 2006, : 1072 - +
  • [5] Module-based structure design of wheeled mobile robot
    Luo, Zirong
    Shang, Jianzhong
    Wei, Guowu
    Ren, Lei
    [J]. MECHANICAL SCIENCES, 2018, 9 (01) : 103 - 121
  • [6] A Multi-Functional Module-Based Capsule Robot
    Zheng, Lingling
    Guo, Shuxiang
    Wang, Zixu
    Tamiya, Takashi
    [J]. IEEE SENSORS JOURNAL, 2021, 21 (10) : 12057 - 12067
  • [7] TORP: The Open Robot Project A Framework for Module-Based Robots
    Simoes, Alexandre da Silva
    Colombini, Esther Luna
    Matsuura, Jackson Paul
    Franchin, Marcelo Nicoletti
    [J]. JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2012, 66 (1-2) : 3 - 22
  • [8] The PyBullet module-based approach to control the collaborative YuMi robot
    Michalik, Roman
    Janota, Ales
    [J]. 13TH INTERNATIONAL CONFERENCE ON ELEKTRO (ELEKTRO 2020), 2020,
  • [9] TORP: The Open Robot ProjectA Framework for Module-Based Robots
    Alexandre da Silva Simões
    Esther Luna Colombini
    Jackson Paul Matsuura
    Marcelo Nicoletti Franchin
    [J]. Journal of Intelligent & Robotic Systems, 2012, 66 : 3 - 22
  • [10] Module-Based Static Structural Design of a Modular Reconfigurable Robot
    Mohamed, Richard Phillip
    Xi, Fengfeng
    Finistauri, Allan Daniel
    [J]. JOURNAL OF MECHANICAL DESIGN, 2010, 132 (01) : 0145011 - 0145017