Distributed reinforcement learning for sequential decision making

被引:0
|
作者
Rogova, G [1 ]
Scott, P [1 ]
Lolett, C [1 ]
机构
[1] Encompass Consulting, Ctr Multisource Informat Fus, Honeoye Falls, NY USA
关键词
distributed systems; reinforcement learning; neural network; evidence theory; pignistic likelihood ratios test; profit sharing strategy;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The paper addresses a problem of reinforcement learning in a homogeneous non-communicating multi-agent system for sequential decision making. We introduce a particular reinforcement learning model composed of evidential reinforcement neural networks representing agents, a fusion center, and a decision maker. The fusion center combines beliefs in each hypothesis under consideration generated by the agents and produces pignistic probabilities of the hypotheses under consideration. These pignistic probabilities are used by a decision maker in a sequential pignistic probability ratio test to choose one of two actions: "defer decision" or "decide hypothesis k "The test is shaped to encourage early decisions and incorporates a finite decision deadline. Upon each decision, a non-binary reinforcement signal is computed by the environment, and is then fed back to the agents, which utilize it to learn an optimizing belief function. The learning algorithm adapts the "profit sharing strategy" to the sequential decision making setting.
引用
收藏
页码:1263 / 1268
页数:6
相关论文
共 50 条
  • [1] Sequential Decision Making with "Sequential Information" in Deep Reinforcement Learning
    Xu, Aimin
    Yuan, Linghui
    Liu, Yunlong
    [J]. PRICAI 2022: TRENDS IN ARTIFICIAL INTELLIGENCE, PT III, 2022, 13631 : 173 - 184
  • [2] Reinforcement learning for decision making in sequential visual attention
    Paletta, Lucas
    Fritz, Gerald
    [J]. ATTENTION IN COGNITIVE SYSTEMS: THEORIES AND SYSTEMS FROM AN INTERDISCIPLINARY VIEWPOINT, 2007, 4840 : 293 - 306
  • [3] Negotiable Reinforcement Learning for Pareto Optimal Sequential Decision-Making
    Desai, Nishant
    Critch, Andrew
    Russell, Stuart
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [4] A Novel Heterogeneous Swarm Reinforcement Learning Method for Sequential Decision Making Problems
    Akbari, Zohreh
    Unland, Rainer
    [J]. MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2019, 1 (02): : 590 - 610
  • [5] Hierarchical reinforcement learning and decision making
    Botvinick, Matthew Michael
    [J]. CURRENT OPINION IN NEUROBIOLOGY, 2012, 22 (06) : 956 - 962
  • [6] Informing sequential clinical decision-making through reinforcement learning: an empirical study
    Shortreed, Susan M.
    Laber, Eric
    Lizotte, Daniel J.
    Stroup, T. Scott
    Pineau, Joelle
    Murphy, Susan A.
    [J]. MACHINE LEARNING, 2011, 84 (1-2) : 109 - 136
  • [7] Informing sequential clinical decision-making through reinforcement learning: an empirical study
    Susan M. Shortreed
    Eric Laber
    Daniel J. Lizotte
    T. Scott Stroup
    Joelle Pineau
    Susan A. Murphy
    [J]. Machine Learning, 2011, 84 : 109 - 136
  • [8] A Reinforcement Learning Approach for Sequential Decision-Making Process of Attacks in Smart Grid
    Ni, Zhen
    Paul, Shuva
    Zhong, Xiangnan
    Wei, Qinglai
    [J]. 2017 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2017, : 1622 - 1629
  • [9] Learning to Generalize for Sequential Decision Making
    Yin, Xusen
    Weischedel, Ralph
    May, Jonathan
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020,
  • [10] Experimentation Platforms Meet Reinforcement Learning: Bayesian Sequential Decision-Making for Continuous Monitoring
    Wan, Runzhe
    Liu, Yu
    McQueen, James
    Hains, Doug
    Song, Rui
    [J]. PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 5016 - 5027