A Reinforcement Learning Approach to the Shepherding Task Using SARSA

被引:0
|
作者
Go, Clark Kendrick [1 ]
Lao, Bryan [1 ]
Yoshimoto, Junichiro [1 ]
Ikeda, Kazushi [1 ]
机构
[1] Nara Inst Sci & Technol, Ikoma, Nara, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a reinforcement learning model of the shepherding of a flock of sheep by a dog. The shepherding task, a heuristic model originally proposed by Strombom, et al., describes the dynamics of the sheep while being herded by a dog to a predefined target. This study recreates the proposed model using SARSA, an algorithm for learning the optimal policy in reinforcement learning. Results show that with a discretized state and action space, the dog is able to successfully herd a flock of a sheep to the target position by first learning to reach a subgoal. A reward is awarded when the dog reaches the neighbourhood of a subgoal, while a penalty is incurred for each time the shepherding task is not completed. The stochasticity of the interaction among sheep and dog, including the existence of multiple subgoals affect the learning time of the agent. Finally, we present an example of the learned shepherding task which shows the agent's continuous success after the 350th episode.
引用
收藏
页码:3833 / 3836
页数:4
相关论文
共 50 条
  • [31] Safe Reinforcement Learning for Single Train Trajectory Optimization via Shield SARSA
    Zhao, Zicong
    Xun, Jing
    Wen, Xuguang
    Chen, Jianqiu
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (01) : 412 - 428
  • [32] A Novel Deep Reinforcement Learning Approach for Task Offloading in MEC Systems
    Liu, Xiaowei
    Jiang, Shuwen
    Wu, Yi
    APPLIED SCIENCES-BASEL, 2022, 12 (21):
  • [33] Autonomous RL: Autonomous Vehicle Obstacle Avoidance in a Dynamic Environment using MLP-SARSA Reinforcement Learning
    Arvind, C. S.
    Senthilnath, J.
    2019 IEEE 5TH INTERNATIONAL CONFERENCE ON MECHATRONICS SYSTEM AND ROBOTS (ICMSR 2019), 2019, : 120 - 124
  • [34] A Deep Reinforcement Learning Approach for Competitive Task Assignment in Enterprise Blockchain
    Volpe, Gaetano
    Mangini, Agostino Marcello
    Fanti, Maria Pia
    IEEE ACCESS, 2023, 11 : 48236 - 48247
  • [35] A Novel Task Provisioning Approach Fusing Reinforcement Learning for Big Data
    Cheng, Yongyi
    Xu, Gaochao
    IEEE ACCESS, 2019, 7 : 143699 - 143709
  • [36] A deep reinforcement learning approach for dynamic task scheduling of flight tests
    Tian, Bei
    Xiao, Gang
    Shen, Yu
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (13): : 18761 - 18796
  • [37] A Deep Reinforcement Learning Approach to the Optimization of Data Center Task Scheduling
    Che, Haiying
    Bai, Zixing
    Zuo, Rong
    Li, Honglei
    COMPLEXITY, 2020, 2020
  • [38] A Multi-Task Reinforcement Learning Approach for Navigating Unsignalized Intersections
    Kai, Shixiong
    Wang, Bin
    Chen, Dong
    Hao, Jianye
    Zhang, Hongbo
    Liu, Wulong
    2020 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2020, : 1682 - 1687
  • [39] Task distribution and human resource management using reinforcement learning
    Paduraru, Ciprian
    Paduraru, Miruna
    Camelia Patilea, Catalina
    2021 36TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING WORKSHOPS (ASEW 2021), 2021, : 96 - 101
  • [40] A Hybrid Multi-Task Learning Approach for Optimizing Deep Reinforcement Learning Agents
    Varghese, Nelson Vithayathil
    Mahmoud, Qusay H.
    IEEE ACCESS, 2021, 9 : 44681 - 44703