Automated design of adaptive controllers for modular robots using reinforcement learning

被引:31
|
作者
Varshavskaya, Paulina [1 ]
Kaelbling, Leslie Pack [1 ]
Rus, Daniela [1 ]
机构
[1] MIT, Comp Sci & AI Lab, Cambridge, MA 02139 USA
来源
关键词
learning and adaptive systems; cellular and modular robots; animation and simulation;
D O I
10.1177/0278364907084983
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Designing distributed controllers for self-reconfiguring modular robots has been consistently challenging. We have developed a reinforcement learning approach which can be used both to automate controller design and to adapt robot behavior on-line. In this paper, we report on our study of reinforcement learning in the domain of self-reconfigurable modular robots: the underlying assumptions, the applicable algorithms and the issues of partial observability, large search spaces and local optima. We propose and validate experimentally in simulation a number of techniques designed to address these and other scalability issues that arise in applying machine learning to distributed systems such as modular robots. We discuss ways to make learning faster, more robust and amenable to on-line application by giving scaffolding to the learning agents in the form of policy representation, structured experience and additional information. With enough structure modular robots can run learning algorithms to both automate the generation of distributed controllers, and adapt to the changing environment and deliver on the self-organization promise with less interference from human designers, programmers and operators.
引用
收藏
页码:505 / 526
页数:22
相关论文
共 50 条
  • [31] Automated Testing with Temporal Logic Specifications for Robotic Controllers using Adaptive Experiment Design
    Innes, Craig
    Ramamoorthy, Subramanian
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 6814 - 6821
  • [32] Automated design and optimization of distributed filter circuits using reinforcement learning
    Gao, Peng
    Yu, Tao
    Wang, Fei
    Yuan, Ru-Yue
    [J]. JOURNAL OF COMPUTATIONAL DESIGN AND ENGINEERING, 2024, 11 (05) : 60 - 76
  • [33] Deep Reinforcement Learning for the Autonomous Adaptive Behavior of Social Robots
    Maroto-Gomez, Marcos
    Malfaz, Maria
    Castro-Gonzalez, Alvaro
    Angel Salichs, Miguel
    [J]. SOCIAL ROBOTICS, ICSR 2022, PT I, 2022, 13817 : 208 - 217
  • [34] Meta Reinforcement Learning for Optimal Design of Legged Robots
    Belmonte-Baeza, Alvaro
    Lee, Joonho
    Valsecchi, Giorgio
    Hutter, Marco
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (04) : 12134 - 12141
  • [35] The brainstormers: Design principles of reinforcement learning autonomous robots
    Riedmiller, Martin
    Gabel, Thomas
    Hafner, Roland
    Lange, Sascha
    Lauer, Martin
    [J]. Informatik-Spektrum, 2006, 29 (03) : 175 - 190
  • [36] CPG-Based Hierarchical Locomotion Control for Modular Quadrupedal Robots Using Deep Reinforcement Learning
    Wang, Jiayu
    Hu, Chuxiong
    Zhu, Yu
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (04) : 7193 - 7200
  • [37] Adaptive Modular Reinforcement Learning for Robot Controlled in Multiple Environments
    Iwata, Teppei
    Shibuya, Takeshi
    [J]. IEEE ACCESS, 2021, 9 : 103032 - 103043
  • [38] Modular Robot Design Synthesis with Deep Reinforcement Learning
    Whitman, Julian
    Bhirangi, Raunaq
    Travers, Matthew
    Choset, Howie
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 10418 - 10425
  • [39] Adaptive Control of Modular Robots
    Demin, Alexander V.
    Vityaev, Evgenii E.
    [J]. BIOLOGICALLY INSPIRED COGNITIVE ARCHITECTURES (BICA) FOR YOUNG SCIENTISTS, 2018, 636 : 204 - 212
  • [40] Reinforcement Learning based Design of Linear Fixed Structure Controllers
    Lawrence, Nathan P.
    Stewart, Gregory E.
    Loewen, Philip D.
    Forbes, Michael G.
    Backstrom, Johan U.
    Gopaluni, R. Bhushan
    [J]. IFAC PAPERSONLINE, 2020, 53 (02): : 237 - 242