Distributed reinforcement learning for sequential decision making

被引：0

作者：

Rogova, G ^{[1
]}

Scott, P ^{[1
]}

Lolett, C ^{[1
]}

机构：

[1] Encompass Consulting, Ctr Multisource Informat Fus, Honeoye Falls, NY USA

来源：

PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON INFORMATION FUSION, VOL II | 2002年

关键词：

distributed systems; reinforcement learning; neural network; evidence theory; pignistic likelihood ratios test; profit sharing strategy;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The paper addresses a problem of reinforcement learning in a homogeneous non-communicating multi-agent system for sequential decision making. We introduce a particular reinforcement learning model composed of evidential reinforcement neural networks representing agents, a fusion center, and a decision maker. The fusion center combines beliefs in each hypothesis under consideration generated by the agents and produces pignistic probabilities of the hypotheses under consideration. These pignistic probabilities are used by a decision maker in a sequential pignistic probability ratio test to choose one of two actions: "defer decision" or "decide hypothesis k "The test is shaped to encourage early decisions and incorporates a finite decision deadline. Upon each decision, a non-binary reinforcement signal is computed by the environment, and is then fed back to the agents, which utilize it to learn an optimizing belief function. The learning algorithm adapts the "profit sharing strategy" to the sequential decision making setting.

引用

页码：1263 / 1268

页数：6

共 50 条

[31] Decision making and learning while taking sequential risks
Pleskac, Timothy J.
JOURNAL OF EXPERIMENTAL PSYCHOLOGY-LEARNING MEMORY AND COGNITION, 2008, 34 (01) : 167 - 185
[32] Social learning with heterogeneous agents and sequential decision making
Wang, Yunlong
Djuric, Petar M.
DIGITAL SIGNAL PROCESSING, 2015, 47 : 17 - 24
[33] Structure Learning in Human Sequential Decision-Making
Acuna, Daniel E.
Schrater, Paul
PLOS COMPUTATIONAL BIOLOGY, 2010, 6 (12)
[34] adaPARL: Adaptive Privacy-Aware Reinforcement Learning for Sequential Decision Making Human-in-the-Loop Systems
Taherisadr, Mojtaba
Stavroulakis, Stelios Andrew
Elmalaki, Salma
PROCEEDINGS 8TH ACM/IEEE CONFERENCE ON INTERNET OF THINGS DESIGN AND IMPLEMENTATION, IOTDI 2023, 2023, : 262 - 274
[35] Decision Making Based on Reinforcement Learning and Emotion Learning for Social Behavior
Matsuda, Atsushi
Misawa, Hideaki
Horio, Keiichi
IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ 2011), 2011, : 2714 - 2719
[36] Distributed reinforcement learning in multi-agent decision systems
Giráldez, JI
Borrajo, D
PROGRESS IN ARTIFICIAL INTELLIGENCE-IBERAMIA 98, 1998, 1484 : 148 - 159
[37] Learning, optimizing, and distributed decision making based on experience
Ho, YC
42ND IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-6, PROCEEDINGS, 2003, : 4818 - 4819
[38] Reinforcement learning and decision making in monkeys during a competitive game
Lee, D
Conroy, ML
McGreevy, BP
Barraclough, DJ
COGNITIVE BRAIN RESEARCH, 2004, 22 (01): : 45 - 58
[39] Quantum reinforcement learning during human decision-making
Ji-An Li
Daoyi Dong
Zhengde Wei
Ying Liu
Yu Pan
Franco Nori
Xiaochu Zhang
Nature Human Behaviour, 2020, 4 : 294 - 307
[40] Quantum reinforcement learning during human decision-making
Li, Ji-An
Dong, Daoyi
Wei, Zhengde
Liu, Ying
Pan, Yu
Nori, Franco
Zhang, Xiaochu
NATURE HUMAN BEHAVIOUR, 2020, 4 (03) : 294 - 307

← 1 2 3 4 5 →