Distributed reinforcement learning for sequential decision making

被引:0
|
作者
Rogova, G [1 ]
Scott, P [1 ]
Lolett, C [1 ]
机构
[1] Encompass Consulting, Ctr Multisource Informat Fus, Honeoye Falls, NY USA
关键词
distributed systems; reinforcement learning; neural network; evidence theory; pignistic likelihood ratios test; profit sharing strategy;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The paper addresses a problem of reinforcement learning in a homogeneous non-communicating multi-agent system for sequential decision making. We introduce a particular reinforcement learning model composed of evidential reinforcement neural networks representing agents, a fusion center, and a decision maker. The fusion center combines beliefs in each hypothesis under consideration generated by the agents and produces pignistic probabilities of the hypotheses under consideration. These pignistic probabilities are used by a decision maker in a sequential pignistic probability ratio test to choose one of two actions: "defer decision" or "decide hypothesis k "The test is shaped to encourage early decisions and incorporates a finite decision deadline. Upon each decision, a non-binary reinforcement signal is computed by the environment, and is then fed back to the agents, which utilize it to learn an optimizing belief function. The learning algorithm adapts the "profit sharing strategy" to the sequential decision making setting.
引用
收藏
页码:1263 / 1268
页数:6
相关论文
共 50 条
  • [31] Decision making and learning while taking sequential risks
    Pleskac, Timothy J.
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-LEARNING MEMORY AND COGNITION, 2008, 34 (01) : 167 - 185
  • [32] Social learning with heterogeneous agents and sequential decision making
    Wang, Yunlong
    Djuric, Petar M.
    DIGITAL SIGNAL PROCESSING, 2015, 47 : 17 - 24
  • [33] Structure Learning in Human Sequential Decision-Making
    Acuna, Daniel E.
    Schrater, Paul
    PLOS COMPUTATIONAL BIOLOGY, 2010, 6 (12)
  • [34] adaPARL: Adaptive Privacy-Aware Reinforcement Learning for Sequential Decision Making Human-in-the-Loop Systems
    Taherisadr, Mojtaba
    Stavroulakis, Stelios Andrew
    Elmalaki, Salma
    PROCEEDINGS 8TH ACM/IEEE CONFERENCE ON INTERNET OF THINGS DESIGN AND IMPLEMENTATION, IOTDI 2023, 2023, : 262 - 274
  • [35] Decision Making Based on Reinforcement Learning and Emotion Learning for Social Behavior
    Matsuda, Atsushi
    Misawa, Hideaki
    Horio, Keiichi
    IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ 2011), 2011, : 2714 - 2719
  • [36] Distributed reinforcement learning in multi-agent decision systems
    Giráldez, JI
    Borrajo, D
    PROGRESS IN ARTIFICIAL INTELLIGENCE-IBERAMIA 98, 1998, 1484 : 148 - 159
  • [37] Learning, optimizing, and distributed decision making based on experience
    Ho, YC
    42ND IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-6, PROCEEDINGS, 2003, : 4818 - 4819
  • [38] Reinforcement learning and decision making in monkeys during a competitive game
    Lee, D
    Conroy, ML
    McGreevy, BP
    Barraclough, DJ
    COGNITIVE BRAIN RESEARCH, 2004, 22 (01): : 45 - 58
  • [39] Quantum reinforcement learning during human decision-making
    Ji-An Li
    Daoyi Dong
    Zhengde Wei
    Ying Liu
    Yu Pan
    Franco Nori
    Xiaochu Zhang
    Nature Human Behaviour, 2020, 4 : 294 - 307
  • [40] Quantum reinforcement learning during human decision-making
    Li, Ji-An
    Dong, Daoyi
    Wei, Zhengde
    Liu, Ying
    Pan, Yu
    Nori, Franco
    Zhang, Xiaochu
    NATURE HUMAN BEHAVIOUR, 2020, 4 (03) : 294 - 307