A Reinforcement Learning Approach to the Shepherding Task Using SARSA

被引：0

作者：

Go, Clark Kendrick ^{[1
]}

Lao, Bryan ^{[1
]}

Yoshimoto, Junichiro ^{[1
]}

Ikeda, Kazushi ^{[1
]}

机构：

[1] Nara Inst Sci & Technol, Ikoma, Nara, Japan

来源：

2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2016年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we present a reinforcement learning model of the shepherding of a flock of sheep by a dog. The shepherding task, a heuristic model originally proposed by Strombom, et al., describes the dynamics of the sheep while being herded by a dog to a predefined target. This study recreates the proposed model using SARSA, an algorithm for learning the optimal policy in reinforcement learning. Results show that with a discretized state and action space, the dog is able to successfully herd a flock of a sheep to the target position by first learning to reach a subgoal. A reward is awarded when the dog reaches the neighbourhood of a subgoal, while a penalty is incurred for each time the shepherding task is not completed. The stochasticity of the interaction among sheep and dog, including the existence of multiple subgoals affect the learning time of the agent. Finally, we present an example of the learned shepherding task which shows the agent's continuous success after the 350th episode.

引用

页码：3833 / 3836

页数：4

共 50 条

[1] Deep Reinforcement Learning with Sarsa and Q-Learning: A Hybrid Approach
Xu, Zhi-xiong
Cao, Lei
Chen, Xi-liang
Li, Chen-xi
Zhang, Yong-liang
Lai, Jun
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2018, E101D (09) : 2315 - 2322
[2] Genetic Network Programming with reinforcement learning using sarsa algorithm
Mabu, Shingo
Hatakeyama, Hiroyuki
Hirasawa, Kotaro
Hu, Jinglu
2006 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-6, 2006, : 463 - +
[3] Factored SARSA(λ) algorithm of reinforcement learning
Chen, H.W.
Xie, J.P.
Xie, L.J.
2001, Science Press (38):
[4] Multi-Drone Collaborative Shepherding Through Multi-Task Reinforcement Learning
Wang, Guanghui
Peng, Junkun
Guan, Chenyang
Chen, Jinhua
Guo, Bing
IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (11): : 10311 - 10318
[5] Reinforcement Learning for Solving Communication Problems in Shepherding
Mohamed, Reem E.
Elsayed, Saber
Hunjet, Robert
Abbass, Hussein
2022 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2022, : 1626 - 1635
[6] Task Offloading and Resource Allocation for Mobile Edge Computing by Deep Reinforcement Learning Based on SARSA
Alfakih, Taha
Hassan, Mohammad Mehedi
Gumaei, Abdu
Savaglio, Claudio
Fortino, Giancarlo
IEEE ACCESS, 2020, 8 : 54074 - 54084
[7] Improved SARSA and DQN algorithms for reinforcement learning
Yao, Guangyu
Zhang, Nan
Duan, Zhenhua
Tian, Cong
THEORETICAL COMPUTER SCIENCE, 2025, 1027
[8] Model Predictive Control-Based Reinforcement Learning Using Expected Sarsa
Moradimaryamnegari, Hoomaan
Frego, Marco
Peer, Angelika
IEEE ACCESS, 2022, 10 : 81177 - 81191
[9] Task scheduling, resource provisioning, and load balancing on scientific workflows using parallel SARSA reinforcement learning agents and genetic algorithm
Ali Asghari
Mohammad Karim Sohrabi
Farzin Yaghmaee
The Journal of Supercomputing, 2021, 77 : 2800 - 2828
[10] Smoothed Sarsa: Reinforcement Learning for Robot Delivery Tasks
Ramachandran, Deepak
Gupta, Rakesh
ICRA: 2009 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-7, 2009, : 3327 - +

← 1 2 3 4 5 →