A Reinforcement Learning Approach to the Shepherding Task Using SARSA

被引:0
|
作者
Go, Clark Kendrick [1 ]
Lao, Bryan [1 ]
Yoshimoto, Junichiro [1 ]
Ikeda, Kazushi [1 ]
机构
[1] Nara Inst Sci & Technol, Ikoma, Nara, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a reinforcement learning model of the shepherding of a flock of sheep by a dog. The shepherding task, a heuristic model originally proposed by Strombom, et al., describes the dynamics of the sheep while being herded by a dog to a predefined target. This study recreates the proposed model using SARSA, an algorithm for learning the optimal policy in reinforcement learning. Results show that with a discretized state and action space, the dog is able to successfully herd a flock of a sheep to the target position by first learning to reach a subgoal. A reward is awarded when the dog reaches the neighbourhood of a subgoal, while a penalty is incurred for each time the shepherding task is not completed. The stochasticity of the interaction among sheep and dog, including the existence of multiple subgoals affect the learning time of the agent. Finally, we present an example of the learned shepherding task which shows the agent's continuous success after the 350th episode.
引用
收藏
页码:3833 / 3836
页数:4
相关论文
共 50 条
  • [1] Deep Reinforcement Learning with Sarsa and Q-Learning: A Hybrid Approach
    Xu, Zhi-xiong
    Cao, Lei
    Chen, Xi-liang
    Li, Chen-xi
    Zhang, Yong-liang
    Lai, Jun
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2018, E101D (09) : 2315 - 2322
  • [2] Genetic Network Programming with reinforcement learning using sarsa algorithm
    Mabu, Shingo
    Hatakeyama, Hiroyuki
    Hirasawa, Kotaro
    Hu, Jinglu
    2006 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-6, 2006, : 463 - +
  • [3] Factored SARSA(λ) algorithm of reinforcement learning
    Chen, H.W.
    Xie, J.P.
    Xie, L.J.
    2001, Science Press (38):
  • [4] Multi-Drone Collaborative Shepherding Through Multi-Task Reinforcement Learning
    Wang, Guanghui
    Peng, Junkun
    Guan, Chenyang
    Chen, Jinhua
    Guo, Bing
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (11): : 10311 - 10318
  • [5] Reinforcement Learning for Solving Communication Problems in Shepherding
    Mohamed, Reem E.
    Elsayed, Saber
    Hunjet, Robert
    Abbass, Hussein
    2022 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2022, : 1626 - 1635
  • [6] Task Offloading and Resource Allocation for Mobile Edge Computing by Deep Reinforcement Learning Based on SARSA
    Alfakih, Taha
    Hassan, Mohammad Mehedi
    Gumaei, Abdu
    Savaglio, Claudio
    Fortino, Giancarlo
    IEEE ACCESS, 2020, 8 : 54074 - 54084
  • [7] Improved SARSA and DQN algorithms for reinforcement learning
    Yao, Guangyu
    Zhang, Nan
    Duan, Zhenhua
    Tian, Cong
    THEORETICAL COMPUTER SCIENCE, 2025, 1027
  • [8] Model Predictive Control-Based Reinforcement Learning Using Expected Sarsa
    Moradimaryamnegari, Hoomaan
    Frego, Marco
    Peer, Angelika
    IEEE ACCESS, 2022, 10 : 81177 - 81191
  • [9] Task scheduling, resource provisioning, and load balancing on scientific workflows using parallel SARSA reinforcement learning agents and genetic algorithm
    Ali Asghari
    Mohammad Karim Sohrabi
    Farzin Yaghmaee
    The Journal of Supercomputing, 2021, 77 : 2800 - 2828
  • [10] Smoothed Sarsa: Reinforcement Learning for Robot Delivery Tasks
    Ramachandran, Deepak
    Gupta, Rakesh
    ICRA: 2009 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-7, 2009, : 3327 - +