Fast-maneuvering target seeking based on double-action Q-learning

被引：0

作者：

Ngai, Daniel C. K. ^{[1
]}

Yung, Nelson H. C. ^{[1
]}

机构：

[1] Univ Hong Kong, Dept Elect & Elect Engn, Hong Kong, Hong Kong, Peoples R China

来源：

MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION, PROCEEDINGS | 2007年 / 4571卷

关键词：

moving object navigation; reinforcement learning; Q-learning;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, a reinforcement learning method called DAQL is proposed to solve the problem of seeking and homing onto a fast maneuvering target, within the context of mobile robots. This Q-learning based method considers both target and obstacle actions when determining its own action decisions, which enables the agent to learn more effectively in a dynamically changing environment. It particularly suits fast-maneuvering target cases, in which maneuvers of the target are unknown a priori. Simulation result depicts that the proposed method is able to choose a less convoluted path to reach the target when compared to the ideal proportional navigation (IPN) method in handling fast maneuvering and randomly moving target. Furthermore, it can learn to adapt to the physical limitation of the system and do not require specific initial conditions to be satisfied for successful navigation towards the moving target.

引用

页码：653 / +

页数：3

共 50 条

[41] Expertness based cooperative Q-learning
Ahmadabadi, MN
Asadpour, M
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2002, 32 (01): : 66 - 76
[42] Traffic Signal Control: a Double Q-learning Approach
Agafonov, Anton
Myasnikov, Vladislav
PROCEEDINGS OF THE 2021 16TH CONFERENCE ON COMPUTER SCIENCE AND INTELLIGENCE SYSTEMS (FEDCSIS), 2021, : 365 - 369
[43] Optimizing Traffic Routes With Enhanced Double Q-Learning
Patil, Mayur
Tambolkar, Pooja
Midlam-Mohler, Shawn
IET INTELLIGENT TRANSPORT SYSTEMS, 2025, 19 (01)
[44] The Mean-Squared Error of Double Q-Learning
Weng, Wentao
Gupta, Harsh
He, Niao
Ying, Lei
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[45] Double Q-learning Agent for Othello Board Game
Somasundaram, Thamarai Selvi
Panneerselvam, Karthikeyan
Bhuthapuri, Tarun
Mahadevan, Harini
Jose, Ashik
2018 10TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING (ICOAC), 2018, : 216 - 223
[46] Q-Learning with Double Progressive Widening: Application to Robotics
Sokolovska, Nataliya
Teytaud, Olivier
Milone, Mario
NEURAL INFORMATION PROCESSING, PT III, 2011, 7064 : 103 - +
[47] Inverted pendulum control of double q-learning reinforcement learning algorithm based on neural network
Zhang, Daode
Wang, Xiaolong
Li, Xuesheng
Wang, Dong
UPB Scientific Bulletin, Series D: Mechanical Engineering, 2020, 82 (02): : 15 - 26
[48] Dueling double Q-learning based reinforcement learning approach for the flow shop scheduling problem
Kim S.J.
Kim B.W.
Transactions of the Korean Institute of Electrical Engineers, 2021, 70 (10): : 1497 - 1508
[49] Glyph-Based Visual Analysis of Q-Learning Based Action Policy Ensembles on Racetrack
Gross, D.
Klauck, M.
Gros, T. P.
Steinmetz, M.
Hoffmann, J.
Gumhold, S.
2022 26TH INTERNATIONAL CONFERENCE INFORMATION VISUALISATION (IV), 2022, : 1 - 10
[50] A path planning approach for unmanned surface vehicles based on dynamic and fast Q-learning
Hao, Bing
Du, He
Yan, Zheping
OCEAN ENGINEERING, 2023, 270

← 1 2 3 4 5 →