Interactive Q-learning on heterogeneous agents system for autonomous adaptive interface

被引:0
|
作者
Ishiwaka, Y [1 ]
Yokoi, H [1 ]
Kakazu, Y [1 ]
机构
[1] Hakodate Natl Coll Technol, Dept Informat Engn, Hakodate, Hokkaido 0428501, Japan
关键词
Interactive Q-learning (IQL); POSMDP; heterogeneous multiagent system; Khepera;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Purpose of this system is to adapt the bedridden people who cannot move their body easily, so the simple reinforcement signals are applied. The application is to control the behaviors of Khepera robot, which is a small mobile robot. For the simple reinforcement signals the on-off signals are employed when the operators as the training agent feels discomfort for the behaviors of the learning agent Khepera robot. We proposed the new reinforcement learning method called Interactive Q-learning and the heterogeneous multi agent system. Our multi agent system has three kinds of heterogeneous single agent: Learning agent, Training agent and Interface Agent. The system is hierarchic. There are also three hierarchies. It is impossible to iterate the many episodes and steps to converge the learning which is adopted in general reinforcement learning in simulation world. We show the results of experiments using the Khepera robot for 3 examinees, and discuss how to give the rewards according to each operator and the significance of heterogeneous multi agent system. We confirmed the effectiveness through the some experiments which are to control the behavior of Khepera robot in real world. The convergences of our teaming system are quite quick. Furthermore the importance of the interface agent is indicated. The individual differences for the timing to give the penalties are happened even though all operators are young.
引用
收藏
页码:475 / 484
页数:10
相关论文
共 50 条
  • [21] Autonomous reconfiguration of robot shape by using Q-learning
    Shiba, Satoshi
    Uchida, Masafumi
    Nozawa, Akio
    Asano, Hirotoshi
    Onogaki, Hitoshi
    Mizuno, Tota
    Ide, Hideto
    Yokoyama, Syuichi
    ARTIFICIAL LIFE AND ROBOTICS, 2009, 14 (02) : 213 - 218
  • [22] Autonomous Warehouse Robot using Deep Q-Learning
    Peyas, Ismot Sadik
    Hasan, Zahid
    Tushar, Md Rafat Rahman
    Al Musabbir
    Azni, Raisa Mehjabin
    Siddique, Shahnewaz
    2021 IEEE REGION 10 CONFERENCE (TENCON 2021), 2021, : 857 - 862
  • [23] Autonomous Exploration for Mobile Robot using Q-learning
    Liu, Yang
    Liu, Huaping
    Wang, Bowen
    2017 2ND INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM), 2017, : 614 - 619
  • [24] Q-Learning for Autonomous Mobile Robot Obstacle Avoidance
    Ribeiro, Tiago
    Goncalves, Fernando
    Garcia, Ines
    Lopes, Gil
    Fernando Ribeiro, A.
    2019 19TH IEEE INTERNATIONAL CONFERENCE ON AUTONOMOUS ROBOT SYSTEMS AND COMPETITIONS (ICARSC 2019), 2019, : 243 - 249
  • [25] A Q-Learning Solution for Adaptive Video Streaming
    Marinca, Dana
    Barth, Dominique
    De Vleeschauwer, Danny
    2013 INTERNATIONAL CONFERENCE ON SELECTED TOPICS IN MOBILE AND WIRELESS NETWORKING (MOWNET), 2013, : 120 - 126
  • [26] Adaptive Estimation Q-learning with Uncertainty and Familiarity
    Gong, Xiaoyu
    Lu, Shuai
    Yu, Jiayu
    Zhu, Sheng
    Li, Zongze
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 3750 - 3758
  • [27] Q-learning with adaptive state space construction
    Murao, H
    Kitamura, S
    LEARNING ROBOTS, PROCEEDINGS, 1998, 1545 : 13 - 28
  • [28] Q-Learning with adaptive state segmentation (QLASS)
    Murao, H
    Kitamura, S
    1997 IEEE INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN ROBOTICS AND AUTOMATION - CIRA '97, PROCEEDINGS: TOWARDS NEW COMPUTATIONAL PRINCIPLES FOR ROBOTICS AND AUTOMATION, 1997, : 179 - 184
  • [29] An Online Home Energy Management System using Q-Learning and Deep Q-Learning
    Izmitligil, Hasan
    Karamancioglu, Abdurrahman
    SUSTAINABLE COMPUTING-INFORMATICS & SYSTEMS, 2024, 43
  • [30] Q-learning scheduler and load balancer for heterogeneous systems
    Department of Computer Science, International Islamic University, Islamabad, Pakistan
    J. Appl. Sci., 2007, 11 (1504-1510):