A Reinforcement Learning Approach to the Shepherding Task Using SARSA

被引:0
|
作者
Go, Clark Kendrick [1 ]
Lao, Bryan [1 ]
Yoshimoto, Junichiro [1 ]
Ikeda, Kazushi [1 ]
机构
[1] Nara Inst Sci & Technol, Ikoma, Nara, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a reinforcement learning model of the shepherding of a flock of sheep by a dog. The shepherding task, a heuristic model originally proposed by Strombom, et al., describes the dynamics of the sheep while being herded by a dog to a predefined target. This study recreates the proposed model using SARSA, an algorithm for learning the optimal policy in reinforcement learning. Results show that with a discretized state and action space, the dog is able to successfully herd a flock of a sheep to the target position by first learning to reach a subgoal. A reward is awarded when the dog reaches the neighbourhood of a subgoal, while a penalty is incurred for each time the shepherding task is not completed. The stochasticity of the interaction among sheep and dog, including the existence of multiple subgoals affect the learning time of the agent. Finally, we present an example of the learned shepherding task which shows the agent's continuous success after the 350th episode.
引用
收藏
页码:3833 / 3836
页数:4
相关论文
共 50 条
  • [11] Least-Squares SARSA(λ) Algorithms for Reinforcement Learning
    Chen, Sheng-Lei
    Wei, Yan-Mei
    ICNC 2008: FOURTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 2, PROCEEDINGS, 2008, : 632 - +
  • [12] Task scheduling, resource provisioning, and load balancing on scientific workflows using parallel SARSA reinforcement learning agents and genetic algorithm
    Asghari, Ali
    Sohrabi, Mohammad Karim
    Yaghmaee, Farzin
    JOURNAL OF SUPERCOMPUTING, 2021, 77 (03): : 2800 - 2828
  • [13] Swarm Reinforcement Learning Algorithms Based on Sarsa Method
    Iima, Hitoshi
    Kuroe, Yasuaki
    2008 PROCEEDINGS OF SICE ANNUAL CONFERENCE, VOLS 1-7, 2008, : 1963 - 1967
  • [14] Deep Reinforcement Learning with Experience Replay Based on SARSA
    Zhao, Dongbin
    Wang, Haitao
    Shao, Kun
    Zhu, Yuanheng
    PROCEEDINGS OF 2016 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2016,
  • [15] Learning to Herd Agents Amongst Obstacles: Training Robust Shepherding Behaviors Using Deep Reinforcement Learning
    Zhi, Jixuan
    Lien, Jyh-Ming
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02): : 4163 - 4168
  • [16] Deep SARSA-based reinforcement learning approach for anomaly network intrusion detection system
    Safa Mohamed
    Ridha Ejbali
    International Journal of Information Security, 2023, 22 : 235 - 247
  • [17] Deep SARSA-based reinforcement learning approach for anomaly network intrusion detection system
    Mohamed, Safa
    Ejbali, Ridha
    INTERNATIONAL JOURNAL OF INFORMATION SECURITY, 2023, 22 (01) : 235 - 247
  • [18] Deep-Sarsa: A reinforcement learning algorithm for autonomous navigation
    Andrecut, M
    Ali, MK
    INTERNATIONAL JOURNAL OF MODERN PHYSICS C, 2001, 12 (10): : 1513 - 1523
  • [19] SARSA(0) Reinforcement Learning over Fully Homomorphic Encryption
    Suh, Jihoon
    Tanaka, Takashi
    2021 SICE INTERNATIONAL SYMPOSIUM ON CONTROL SYSTEMS (SICE ISCS 2021), 2021, : 1 - 7
  • [20] Autonomous Foraging with SARSA-based Deep Reinforcement Learning
    Mesquita, Anderson
    Nogueira, Yuri
    Vidal, Creto
    Cavalcante-Neto, Joaquim
    Serafim, Paulo
    2020 22ND SYMPOSIUM ON VIRTUAL AND AUGMENTED REALITY (SVR 2020), 2020, : 425 - 433