Interactive Reinforcement Learning Strategy

被引：1

作者：

Shi, Zhenjie ^{[1
]}

Ma, Wenming ^{[1
]}

Yin, Shuai ^{[1
]}

Zhang, Hailiang ^{[1
]}

Zhao, Xiaofan ^{[1
]}

机构：

[1] Yantai Univ, Sch Comp & Control Engn, Yantai, Peoples R China

来源：

2021 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, INTERNET OF PEOPLE, AND SMART CITY INNOVATIONS (SMARTWORLD/SCALCOM/UIC/ATC/IOP/SCI 2021) | 2021年

关键词：

Reinforcement learning; interactive learning; path planning; Q-learning;

D O I：

10.1109/SWC50871.2021.00075

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The birth of AlphaGo has set off a new wave of reinforcement learning technology. Reinforcement learning has become one of the most popular directions in the field of artificial intelligence. Its essence is the continuous integration and upgrading of various machine learning methods, and the agents continue to trial and error and obtain cumulative rewards. Q-learning is the most commonly used method in reinforcement learning, but it itself has many problems such as less early information, long learning time, low learning efficiency, and repeated trial and error. Therefore, Q-learning cannot be directly applied to the real environment. In response to this problem, the reinforcement learning discussed by the author is an interactive learning method that combines voice commands and Q-learning. This method uses part of the interaction between the agent and the human voice to find a larger target range in the early stage of learning. Then narrow the search range in turn, which can guide the agent to quickly achieve the learning effect and change the blindness of learning. Simulation experiments show that compared with the standard Q-learning algorithm, the proposed algorithm not only improves the convergence speed, shortens the learning time, but also reduces the number of collisions, enabling the agent to quickly find a better collision-free path.

引用

页码：507 / 512

页数：6

共 50 条

[41] Application of Reinforcement Learning System to Interactive Digital Art
Cho, Ok-Hue
Lee, Won-Hyung
JOURNAL OF INTERNET TECHNOLOGY, 2013, 14 (01): : 99 - 106
[42] Reinforcement Learning-Based Interactive Video Search
Ma, Zhixin
Wu, Jiaxin
Hou, Zhijian
Ngo, Chong-Wah
MULTIMEDIA MODELING, MMM 2022, PT II, 2022, 13142 : 549 - 555
[43] Belief revision with reinforcement learning for interactive object recognition
Leopold, Thomas
Kern-Isberner, Gabriele
Peters, Gabriele
ECAI 2008, PROCEEDINGS, 2008, 178 : 65 - +
[44] Learning from Unreliable Human Action Advice in Interactive Reinforcement Learning
Scherf, Lisa
Turan, Cigdem
Koert, Dorothea
2022 IEEE-RAS 21ST INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS (HUMANOIDS), 2022, : 895 - 902
[45] Interactive Learning - Implementation of ChatGPT and Reinforcement Learning in Local Energy Trading
Chen, Yong
Chen, Guo
2024 IEEE 34TH AUSTRALASIAN UNIVERSITIES POWER ENGINEERING CONFERENCE, AUPEC 2024, 2024,
[46] Equilibrium optimizer of interswarm interactive learning strategy
Shao, Zhi-Yuan
Pan, Jeng-Shyang
Hu, Pei
Chu, Shu-Chuan
ENTERPRISE INFORMATION SYSTEMS, 2023, 17 (01)
[47] A cooperative learning strategy for interactive video search
Wei, Shikui
Zhu, Zhenfeng
Zhao, Yao
Liu, Nan
2007 6TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS & SIGNAL PROCESSING, VOLS 1-4, 2007, : 1601 - 1604
[48] The Study on Interactive Learning Strategy in Digital Campus
Zhang Ling
Liu Xiumin
PROCEEDING OF 2012 INTERNATIONAL SYMPOSIUM - EDUCATIONAL RESEARCH AND EDUCATIONAL TECHNOLOGY, 2012, : 11 - +
[49] INCIDENCE OF THE USE OF INTERACTIVE MODULES AS A LEARNING STRATEGY
Finol Valbuena, Loana del Carmen
Leon Luxardo, Norelis del Carmen
TELEMATIQUE, 2022, 21 (02): : 28 - 36
[50] Self-Augmenting Strategy for Reinforcement Learning
Huang, Xin
Xiao, Shuangjiu
PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE (CSAI 2017), 2017, : 1 - 4

← 1 2 3 4 5 →