Interactive Reinforcement Learning Strategy

被引:1
|
作者
Shi, Zhenjie [1 ]
Ma, Wenming [1 ]
Yin, Shuai [1 ]
Zhang, Hailiang [1 ]
Zhao, Xiaofan [1 ]
机构
[1] Yantai Univ, Sch Comp & Control Engn, Yantai, Peoples R China
关键词
Reinforcement learning; interactive learning; path planning; Q-learning;
D O I
10.1109/SWC50871.2021.00075
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The birth of AlphaGo has set off a new wave of reinforcement learning technology. Reinforcement learning has become one of the most popular directions in the field of artificial intelligence. Its essence is the continuous integration and upgrading of various machine learning methods, and the agents continue to trial and error and obtain cumulative rewards. Q-learning is the most commonly used method in reinforcement learning, but it itself has many problems such as less early information, long learning time, low learning efficiency, and repeated trial and error. Therefore, Q-learning cannot be directly applied to the real environment. In response to this problem, the reinforcement learning discussed by the author is an interactive learning method that combines voice commands and Q-learning. This method uses part of the interaction between the agent and the human voice to find a larger target range in the early stage of learning. Then narrow the search range in turn, which can guide the agent to quickly achieve the learning effect and change the blindness of learning. Simulation experiments show that compared with the standard Q-learning algorithm, the proposed algorithm not only improves the convergence speed, shortens the learning time, but also reduces the number of collisions, enabling the agent to quickly find a better collision-free path.
引用
收藏
页码:507 / 512
页数:6
相关论文
共 50 条
  • [31] Transparent Interactive Reinforcement Learning Using Emotional Behaviours
    Angelopoulos, Georgios
    Rossi, Alessandra
    L'Arco, Gianluca
    Rossi, Silvia
    SOCIAL ROBOTICS, ICSR 2022, PT I, 2022, 13817 : 300 - 311
  • [32] Interactive Reinforcement Learning With Bayesian Fusion of Multimodal Advice
    Trick, Susanne
    Herbert, Franziska
    Rothkopf, Constantin A.
    Koert, Dorothea
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (03) : 7558 - 7565
  • [33] User modelling using evolutionary interactive reinforcement learning
    Nyongesa, H. O.
    Maleki-dizaji, S.
    INFORMATION RETRIEVAL, 2006, 9 (03): : 343 - 355
  • [34] Interactive Spoken Content Retrieval by Deep Reinforcement Learning
    Lee, Hung-Yi
    Chung, Pei-Hung
    Wu, Yen-Chen
    Lin, Tzu-Hsiang
    Wen, Tsung-Hsien
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (12) : 2447 - 2459
  • [35] Integration of Jason Reinforcement Learning Agents into an Interactive Application
    Badica, Costin
    Becheru, Alex
    Felton, Samuel
    2017 19TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING (SYNASC 2017), 2017, : 361 - 368
  • [36] A General Offline Reinforcement Learning Framework for Interactive Recommendation
    Xiao, Teng
    Wang, Donglin
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 4512 - 4520
  • [37] Interactive selection of visual features through reinforcement learning
    Jodogne, S
    Piater, JH
    RESEARCH AND DEVELOPMENT IN INTELLIGENT SYSTEMS XXI, 2005, : 285 - 298
  • [38] Persistent rule-based interactive reinforcement learning
    Adam Bignold
    Francisco Cruz
    Richard Dazeley
    Peter Vamplew
    Cameron Foale
    Neural Computing and Applications, 2023, 35 : 23411 - 23428
  • [39] Planning of interactive information retrieval by means of reinforcement learning
    Vitsentiy, V
    Spink, A
    Sachenko, A
    IDAACS'2003: PROCEEDINGS OF THE SECOND IEEE INTERNATIONAL WORKSHOP ON INTELLIGENT DATA ACQUISITION AND ADVANCED COMPUTING SYSTEMS: TECHNOLOGY AND APPLICATIONS, 2003, : 396 - 399
  • [40] Nonstrict Hierarchical Reinforcement Learning for Interactive Systems and Robots
    Cuayahuitl, Heriberto
    Kruijff-Korbayova, Ivana
    Dethlefs, Nina
    ACM TRANSACTIONS ON INTERACTIVE INTELLIGENT SYSTEMS, 2014, 4 (03)