Interactive Reinforcement Learning Strategy

被引:1
|
作者
Shi, Zhenjie [1 ]
Ma, Wenming [1 ]
Yin, Shuai [1 ]
Zhang, Hailiang [1 ]
Zhao, Xiaofan [1 ]
机构
[1] Yantai Univ, Sch Comp & Control Engn, Yantai, Peoples R China
关键词
Reinforcement learning; interactive learning; path planning; Q-learning;
D O I
10.1109/SWC50871.2021.00075
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The birth of AlphaGo has set off a new wave of reinforcement learning technology. Reinforcement learning has become one of the most popular directions in the field of artificial intelligence. Its essence is the continuous integration and upgrading of various machine learning methods, and the agents continue to trial and error and obtain cumulative rewards. Q-learning is the most commonly used method in reinforcement learning, but it itself has many problems such as less early information, long learning time, low learning efficiency, and repeated trial and error. Therefore, Q-learning cannot be directly applied to the real environment. In response to this problem, the reinforcement learning discussed by the author is an interactive learning method that combines voice commands and Q-learning. This method uses part of the interaction between the agent and the human voice to find a larger target range in the early stage of learning. Then narrow the search range in turn, which can guide the agent to quickly achieve the learning effect and change the blindness of learning. Simulation experiments show that compared with the standard Q-learning algorithm, the proposed algorithm not only improves the convergence speed, shortens the learning time, but also reduces the number of collisions, enabling the agent to quickly find a better collision-free path.
引用
收藏
页码:507 / 512
页数:6
相关论文
共 50 条
  • [41] Application of Reinforcement Learning System to Interactive Digital Art
    Cho, Ok-Hue
    Lee, Won-Hyung
    JOURNAL OF INTERNET TECHNOLOGY, 2013, 14 (01): : 99 - 106
  • [42] Reinforcement Learning-Based Interactive Video Search
    Ma, Zhixin
    Wu, Jiaxin
    Hou, Zhijian
    Ngo, Chong-Wah
    MULTIMEDIA MODELING, MMM 2022, PT II, 2022, 13142 : 549 - 555
  • [43] Belief revision with reinforcement learning for interactive object recognition
    Leopold, Thomas
    Kern-Isberner, Gabriele
    Peters, Gabriele
    ECAI 2008, PROCEEDINGS, 2008, 178 : 65 - +
  • [44] Learning from Unreliable Human Action Advice in Interactive Reinforcement Learning
    Scherf, Lisa
    Turan, Cigdem
    Koert, Dorothea
    2022 IEEE-RAS 21ST INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS (HUMANOIDS), 2022, : 895 - 902
  • [45] Interactive Learning - Implementation of ChatGPT and Reinforcement Learning in Local Energy Trading
    Chen, Yong
    Chen, Guo
    2024 IEEE 34TH AUSTRALASIAN UNIVERSITIES POWER ENGINEERING CONFERENCE, AUPEC 2024, 2024,
  • [46] Equilibrium optimizer of interswarm interactive learning strategy
    Shao, Zhi-Yuan
    Pan, Jeng-Shyang
    Hu, Pei
    Chu, Shu-Chuan
    ENTERPRISE INFORMATION SYSTEMS, 2023, 17 (01)
  • [47] A cooperative learning strategy for interactive video search
    Wei, Shikui
    Zhu, Zhenfeng
    Zhao, Yao
    Liu, Nan
    2007 6TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS & SIGNAL PROCESSING, VOLS 1-4, 2007, : 1601 - 1604
  • [48] The Study on Interactive Learning Strategy in Digital Campus
    Zhang Ling
    Liu Xiumin
    PROCEEDING OF 2012 INTERNATIONAL SYMPOSIUM - EDUCATIONAL RESEARCH AND EDUCATIONAL TECHNOLOGY, 2012, : 11 - +
  • [49] INCIDENCE OF THE USE OF INTERACTIVE MODULES AS A LEARNING STRATEGY
    Finol Valbuena, Loana del Carmen
    Leon Luxardo, Norelis del Carmen
    TELEMATIQUE, 2022, 21 (02): : 28 - 36
  • [50] Self-Augmenting Strategy for Reinforcement Learning
    Huang, Xin
    Xiao, Shuangjiu
    PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE (CSAI 2017), 2017, : 1 - 4