Fast-maneuvering target seeking based on double-action Q-learning

被引:0
|
作者
Ngai, Daniel C. K. [1 ]
Yung, Nelson H. C. [1 ]
机构
[1] Univ Hong Kong, Dept Elect & Elect Engn, Hong Kong, Hong Kong, Peoples R China
来源
MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION, PROCEEDINGS | 2007年 / 4571卷
关键词
moving object navigation; reinforcement learning; Q-learning;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a reinforcement learning method called DAQL is proposed to solve the problem of seeking and homing onto a fast maneuvering target, within the context of mobile robots. This Q-learning based method considers both target and obstacle actions when determining its own action decisions, which enables the agent to learn more effectively in a dynamically changing environment. It particularly suits fast-maneuvering target cases, in which maneuvers of the target are unknown a priori. Simulation result depicts that the proposed method is able to choose a less convoluted path to reach the target when compared to the ideal proportional navigation (IPN) method in handling fast maneuvering and randomly moving target. Furthermore, it can learn to adapt to the physical limitation of the system and do not require specific initial conditions to be satisfied for successful navigation towards the moving target.
引用
收藏
页码:653 / +
页数:3
相关论文
共 50 条
  • [41] Expertness based cooperative Q-learning
    Ahmadabadi, MN
    Asadpour, M
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2002, 32 (01): : 66 - 76
  • [42] Traffic Signal Control: a Double Q-learning Approach
    Agafonov, Anton
    Myasnikov, Vladislav
    PROCEEDINGS OF THE 2021 16TH CONFERENCE ON COMPUTER SCIENCE AND INTELLIGENCE SYSTEMS (FEDCSIS), 2021, : 365 - 369
  • [43] Optimizing Traffic Routes With Enhanced Double Q-Learning
    Patil, Mayur
    Tambolkar, Pooja
    Midlam-Mohler, Shawn
    IET INTELLIGENT TRANSPORT SYSTEMS, 2025, 19 (01)
  • [44] The Mean-Squared Error of Double Q-Learning
    Weng, Wentao
    Gupta, Harsh
    He, Niao
    Ying, Lei
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [45] Double Q-learning Agent for Othello Board Game
    Somasundaram, Thamarai Selvi
    Panneerselvam, Karthikeyan
    Bhuthapuri, Tarun
    Mahadevan, Harini
    Jose, Ashik
    2018 10TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING (ICOAC), 2018, : 216 - 223
  • [46] Q-Learning with Double Progressive Widening: Application to Robotics
    Sokolovska, Nataliya
    Teytaud, Olivier
    Milone, Mario
    NEURAL INFORMATION PROCESSING, PT III, 2011, 7064 : 103 - +
  • [47] Inverted pendulum control of double q-learning reinforcement learning algorithm based on neural network
    Zhang, Daode
    Wang, Xiaolong
    Li, Xuesheng
    Wang, Dong
    UPB Scientific Bulletin, Series D: Mechanical Engineering, 2020, 82 (02): : 15 - 26
  • [48] Dueling double Q-learning based reinforcement learning approach for the flow shop scheduling problem
    Kim S.J.
    Kim B.W.
    Transactions of the Korean Institute of Electrical Engineers, 2021, 70 (10): : 1497 - 1508
  • [49] Glyph-Based Visual Analysis of Q-Learning Based Action Policy Ensembles on Racetrack
    Gross, D.
    Klauck, M.
    Gros, T. P.
    Steinmetz, M.
    Hoffmann, J.
    Gumhold, S.
    2022 26TH INTERNATIONAL CONFERENCE INFORMATION VISUALISATION (IV), 2022, : 1 - 10
  • [50] A path planning approach for unmanned surface vehicles based on dynamic and fast Q-learning
    Hao, Bing
    Du, He
    Yan, Zheping
    OCEAN ENGINEERING, 2023, 270