A Reinforcement Learning Approach to the Shepherding Task Using SARSA

被引：0

作者：

Go, Clark Kendrick ^{[1
]}

Lao, Bryan ^{[1
]}

Yoshimoto, Junichiro ^{[1
]}

Ikeda, Kazushi ^{[1
]}

机构：

[1] Nara Inst Sci & Technol, Ikoma, Nara, Japan

来源：

2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2016年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we present a reinforcement learning model of the shepherding of a flock of sheep by a dog. The shepherding task, a heuristic model originally proposed by Strombom, et al., describes the dynamics of the sheep while being herded by a dog to a predefined target. This study recreates the proposed model using SARSA, an algorithm for learning the optimal policy in reinforcement learning. Results show that with a discretized state and action space, the dog is able to successfully herd a flock of a sheep to the target position by first learning to reach a subgoal. A reward is awarded when the dog reaches the neighbourhood of a subgoal, while a penalty is incurred for each time the shepherding task is not completed. The stochasticity of the interaction among sheep and dog, including the existence of multiple subgoals affect the learning time of the agent. Finally, we present an example of the learned shepherding task which shows the agent's continuous success after the 350th episode.

引用

页码：3833 / 3836

页数：4

共 50 条

[21] An Improved Sarsa(λ) Reinforcement Learning Algorithm for Wireless Communication Systems
Jiang, Hao
Gui, Renjie
Chen, Zhen
Wu, Liang
Dang, Jian
Zhou, Jie
IEEE ACCESS, 2019, 7 : 115418 - 115427
[22] Using reinforcement learning to adapt an imitation task
Guenter, Florent
Billard, Aude G.
2007 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-9, 2007, : 1028 - 1033
[23] A Core Task Abstraction Approach to Hierarchical Reinforcement Learning
Li, Zhuoru
Narayan, Akshay
Leong, Tze-Yun
AAMAS'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2016, : 1411 - 1412
[24] Optimal Detection Task Allocation: A Reinforcement Learning Approach
Huang, Qilong
Bu, Qing
Qin, Ziyi
2017 CHINESE AUTOMATION CONGRESS (CAC), 2017, : 369 - 374
[25] SARSA-based reinforcement learning for motion planning in Serial Manipulators
Aleo, Ignazio
Arena, Paolo
Patane, Luca
2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
[26] Optimizing Workflow Task Clustering Using Reinforcement Learning
Leong, Chin Poh
Liew, Chee Sun
Chan, Chee Seng
Rehman, Muhammad Habib Ur
IEEE ACCESS, 2021, 9 : 110614 - 110626
[27] Task Scheduling in Cloud Using Deep Reinforcement Learning
Swarup, Shashank
Shakshuki, Elhadi M.
Yasar, Ansar
12TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT) / THE 4TH INTERNATIONAL CONFERENCE ON EMERGING DATA AND INDUSTRY 4.0 (EDI40) / AFFILIATED WORKSHOPS, 2021, 184 : 42 - 51
[28] Adaptive task scheduling in IoT using reinforcement learning
Pandit, Mohammad Khalid
Mir, Roohie Naaz
Chishti, Mohammad Ahsan
INTERNATIONAL JOURNAL OF INTELLIGENT COMPUTING AND CYBERNETICS, 2020, 13 (03) : 261 - 282
[29] Analysis of Space Manipulator Route Planning Based on Sarsa (λ) Reinforcement Learning
Xu
Lu S.
Yuhang Xuebao/Journal of Astronautics, 2019, 40 (04): : 435 - 443
[30] A Sarsa reinforcement learning hybrid ensemble method for robotic battery power forecasting
Peng, Fei
Liu, Hui
Zheng, Li
JOURNAL OF CENTRAL SOUTH UNIVERSITY, 2023, 30 (11) : 3867 - 3880

← 1 2 3 4 5 →