A Reinforcement Learning Approach to the Shepherding Task Using SARSA

被引:0
|
作者
Go, Clark Kendrick [1 ]
Lao, Bryan [1 ]
Yoshimoto, Junichiro [1 ]
Ikeda, Kazushi [1 ]
机构
[1] Nara Inst Sci & Technol, Ikoma, Nara, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a reinforcement learning model of the shepherding of a flock of sheep by a dog. The shepherding task, a heuristic model originally proposed by Strombom, et al., describes the dynamics of the sheep while being herded by a dog to a predefined target. This study recreates the proposed model using SARSA, an algorithm for learning the optimal policy in reinforcement learning. Results show that with a discretized state and action space, the dog is able to successfully herd a flock of a sheep to the target position by first learning to reach a subgoal. A reward is awarded when the dog reaches the neighbourhood of a subgoal, while a penalty is incurred for each time the shepherding task is not completed. The stochasticity of the interaction among sheep and dog, including the existence of multiple subgoals affect the learning time of the agent. Finally, we present an example of the learned shepherding task which shows the agent's continuous success after the 350th episode.
引用
收藏
页码:3833 / 3836
页数:4
相关论文
共 50 条
  • [21] An Improved Sarsa(λ) Reinforcement Learning Algorithm for Wireless Communication Systems
    Jiang, Hao
    Gui, Renjie
    Chen, Zhen
    Wu, Liang
    Dang, Jian
    Zhou, Jie
    IEEE ACCESS, 2019, 7 : 115418 - 115427
  • [22] Using reinforcement learning to adapt an imitation task
    Guenter, Florent
    Billard, Aude G.
    2007 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-9, 2007, : 1028 - 1033
  • [23] A Core Task Abstraction Approach to Hierarchical Reinforcement Learning
    Li, Zhuoru
    Narayan, Akshay
    Leong, Tze-Yun
    AAMAS'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2016, : 1411 - 1412
  • [24] Optimal Detection Task Allocation: A Reinforcement Learning Approach
    Huang, Qilong
    Bu, Qing
    Qin, Ziyi
    2017 CHINESE AUTOMATION CONGRESS (CAC), 2017, : 369 - 374
  • [25] SARSA-based reinforcement learning for motion planning in Serial Manipulators
    Aleo, Ignazio
    Arena, Paolo
    Patane, Luca
    2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
  • [26] Optimizing Workflow Task Clustering Using Reinforcement Learning
    Leong, Chin Poh
    Liew, Chee Sun
    Chan, Chee Seng
    Rehman, Muhammad Habib Ur
    IEEE ACCESS, 2021, 9 : 110614 - 110626
  • [27] Task Scheduling in Cloud Using Deep Reinforcement Learning
    Swarup, Shashank
    Shakshuki, Elhadi M.
    Yasar, Ansar
    12TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT) / THE 4TH INTERNATIONAL CONFERENCE ON EMERGING DATA AND INDUSTRY 4.0 (EDI40) / AFFILIATED WORKSHOPS, 2021, 184 : 42 - 51
  • [28] Adaptive task scheduling in IoT using reinforcement learning
    Pandit, Mohammad Khalid
    Mir, Roohie Naaz
    Chishti, Mohammad Ahsan
    INTERNATIONAL JOURNAL OF INTELLIGENT COMPUTING AND CYBERNETICS, 2020, 13 (03) : 261 - 282
  • [29] Analysis of Space Manipulator Route Planning Based on Sarsa (λ) Reinforcement Learning
    Xu
    Lu S.
    Yuhang Xuebao/Journal of Astronautics, 2019, 40 (04): : 435 - 443
  • [30] A Sarsa reinforcement learning hybrid ensemble method for robotic battery power forecasting
    Peng, Fei
    Liu, Hui
    Zheng, Li
    JOURNAL OF CENTRAL SOUTH UNIVERSITY, 2023, 30 (11) : 3867 - 3880