A Reinforcement Learning Approach to the Shepherding Task Using SARSA

被引：0

作者：

Go, Clark Kendrick ^{[1
]}

Lao, Bryan ^{[1
]}

Yoshimoto, Junichiro ^{[1
]}

Ikeda, Kazushi ^{[1
]}

机构：

[1] Nara Inst Sci & Technol, Ikoma, Nara, Japan

来源：

2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2016年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we present a reinforcement learning model of the shepherding of a flock of sheep by a dog. The shepherding task, a heuristic model originally proposed by Strombom, et al., describes the dynamics of the sheep while being herded by a dog to a predefined target. This study recreates the proposed model using SARSA, an algorithm for learning the optimal policy in reinforcement learning. Results show that with a discretized state and action space, the dog is able to successfully herd a flock of a sheep to the target position by first learning to reach a subgoal. A reward is awarded when the dog reaches the neighbourhood of a subgoal, while a penalty is incurred for each time the shepherding task is not completed. The stochasticity of the interaction among sheep and dog, including the existence of multiple subgoals affect the learning time of the agent. Finally, we present an example of the learned shepherding task which shows the agent's continuous success after the 350th episode.

引用

页码：3833 / 3836

页数：4

共 50 条

[11] Least-Squares SARSA(λ) Algorithms for Reinforcement Learning
Chen, Sheng-Lei
Wei, Yan-Mei
ICNC 2008: FOURTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 2, PROCEEDINGS, 2008, : 632 - +
[12] Task scheduling, resource provisioning, and load balancing on scientific workflows using parallel SARSA reinforcement learning agents and genetic algorithm
Asghari, Ali
Sohrabi, Mohammad Karim
Yaghmaee, Farzin
JOURNAL OF SUPERCOMPUTING, 2021, 77 (03): : 2800 - 2828
[13] Swarm Reinforcement Learning Algorithms Based on Sarsa Method
Iima, Hitoshi
Kuroe, Yasuaki
2008 PROCEEDINGS OF SICE ANNUAL CONFERENCE, VOLS 1-7, 2008, : 1963 - 1967
[14] Deep Reinforcement Learning with Experience Replay Based on SARSA
Zhao, Dongbin
Wang, Haitao
Shao, Kun
Zhu, Yuanheng
PROCEEDINGS OF 2016 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2016,
[15] Learning to Herd Agents Amongst Obstacles: Training Robust Shepherding Behaviors Using Deep Reinforcement Learning
Zhi, Jixuan
Lien, Jyh-Ming
IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02): : 4163 - 4168
[16] Deep SARSA-based reinforcement learning approach for anomaly network intrusion detection system
Safa Mohamed
Ridha Ejbali
International Journal of Information Security, 2023, 22 : 235 - 247
[17] Deep SARSA-based reinforcement learning approach for anomaly network intrusion detection system
Mohamed, Safa
Ejbali, Ridha
INTERNATIONAL JOURNAL OF INFORMATION SECURITY, 2023, 22 (01) : 235 - 247
[18] Deep-Sarsa: A reinforcement learning algorithm for autonomous navigation
Andrecut, M
Ali, MK
INTERNATIONAL JOURNAL OF MODERN PHYSICS C, 2001, 12 (10): : 1513 - 1523
[19] SARSA(0) Reinforcement Learning over Fully Homomorphic Encryption
Suh, Jihoon
Tanaka, Takashi
2021 SICE INTERNATIONAL SYMPOSIUM ON CONTROL SYSTEMS (SICE ISCS 2021), 2021, : 1 - 7
[20] Autonomous Foraging with SARSA-based Deep Reinforcement Learning
Mesquita, Anderson
Nogueira, Yuri
Vidal, Creto
Cavalcante-Neto, Joaquim
Serafim, Paulo
2020 22ND SYMPOSIUM ON VIRTUAL AND AUGMENTED REALITY (SVR 2020), 2020, : 425 - 433

← 1 2 3 4 5 →