Interactive Q-learning on heterogeneous agents system for autonomous adaptive interface

被引:0
|
作者
Ishiwaka, Y [1 ]
Yokoi, H [1 ]
Kakazu, Y [1 ]
机构
[1] Hakodate Natl Coll Technol, Dept Informat Engn, Hakodate, Hokkaido 0428501, Japan
关键词
Interactive Q-learning (IQL); POSMDP; heterogeneous multiagent system; Khepera;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Purpose of this system is to adapt the bedridden people who cannot move their body easily, so the simple reinforcement signals are applied. The application is to control the behaviors of Khepera robot, which is a small mobile robot. For the simple reinforcement signals the on-off signals are employed when the operators as the training agent feels discomfort for the behaviors of the learning agent Khepera robot. We proposed the new reinforcement learning method called Interactive Q-learning and the heterogeneous multi agent system. Our multi agent system has three kinds of heterogeneous single agent: Learning agent, Training agent and Interface Agent. The system is hierarchic. There are also three hierarchies. It is impossible to iterate the many episodes and steps to converge the learning which is adopted in general reinforcement learning in simulation world. We show the results of experiments using the Khepera robot for 3 examinees, and discuss how to give the rewards according to each operator and the significance of heterogeneous multi agent system. We confirmed the effectiveness through the some experiments which are to control the behavior of Khepera robot in real world. The convergences of our teaming system are quite quick. Furthermore the importance of the interface agent is indicated. The individual differences for the timing to give the penalties are happened even though all operators are young.
引用
收藏
页码:475 / 484
页数:10
相关论文
共 50 条
  • [31] Q-Learning applied to the problem of scheduling on heterogeneous architectures
    Hajoui, Younes
    Bouattane, Omar
    Youssfi, Mohamed
    Illoussamen, Elhocein
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2018, 18 (02): : 153 - 159
  • [32] Distributed Q-Learning for Energy Harvesting Heterogeneous Networks
    Miozzo, Marco
    Giupponi, Lorenza
    Rossi, Michele
    Dini, Paolo
    2015 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION WORKSHOP (ICCW), 2015, : 2006 - 2011
  • [33] A Modular Autonomous Driving System for Electric Boats based on Fuzzy Controllers and Q-Learning
    Ferrandino, Emanuele
    Capillo, Antonino
    De Santis, Enrico
    Mascioli, Fabio M. F.
    Rizzi, Antonello
    PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON COMPUTATIONAL INTELLIGENCE (IJCCI), 2021, : 185 - 195
  • [34] Autonomous Driving in Roundabout Maneuvers Using Reinforcement Learning with Q-Learning
    Garcia Cuenca, Laura
    Puertas, Enrique
    Fernandez Andres, Javier
    Aliane, Nourdine
    ELECTRONICS, 2019, 8 (12)
  • [35] Q-Learning Applied to Genetic Algorithm-Fuzzy Approach for On-Line Control in Autonomous Agents
    Sarmadi, Hengameh
    JOURNAL OF INTELLIGENT SYSTEMS, 2009, 18 (1-2) : 1 - 31
  • [36] Fuzzy adaptive Q-learning method with dynamic learning parameters
    Maeda, Y
    JOINT 9TH IFSA WORLD CONGRESS AND 20TH NAFIPS INTERNATIONAL CONFERENCE, PROCEEDINGS, VOLS. 1-5, 2001, : 2778 - 2780
  • [37] Adaptive Learning Recommendation Strategy Based on Deep Q-learning
    Tan, Chunxi
    Han, Ruijian
    Ye, Rougang
    Chen, Kani
    APPLIED PSYCHOLOGICAL MEASUREMENT, 2020, 44 (04) : 251 - 266
  • [38] Acquisition of coordinated behavior by modular Q-learning agents
    Ono, N
    Ikeda, O
    Fukumoto, K
    IROS 96 - PROCEEDINGS OF THE 1996 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS - ROBOTIC INTELLIGENCE INTERACTING WITH DYNAMIC WORLDS, VOLS 1-3, 1996, : 1525 - 1529
  • [39] Reinforcement distribution in a team of cooperative Q-learning agents
    Abbasi, Zahra
    Abbasi, Mohammad Ali
    PROCEEDINGS OF NINTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING, 2008, : 154 - +
  • [40] Autonomous Decentralized Traffic Control Using Q-Learning in LPWAN
    Kaburaki, Aoto
    Adachi, Koichi
    Takyu, Osamu
    Ohta, Mai
    Fujii, Takeo
    IEEE ACCESS, 2021, 9 : 93651 - 93661