A Satisficing Strategy with Variable Reference in the Multi-armed Bandit Problems

被引：0

作者：

Kohno, Yu ^{[1
]}

Takahashi, Tatsuji ^{[2
]}

机构：

[1] Tokyo Denki Univ, Grad Sch Adv Sci & Technol, Hiki, Saitama 3500394, Japan

[2] Tokyo Denki Univ, Hiki, Saitama 3500394, Japan

来源：

PROCEEDINGS OF THE INTERNATIONAL CONFERENCE OF NUMERICAL ANALYSIS AND APPLIED MATHEMATICS 2014 (ICNAAM-2014) | 2015年 / 1648卷

关键词：

Symmetric reasoning; decision-making; N armed bandit problem; speed-accuracy trade-off;

D O I：

10.1063/1.4912815

中图分类号：

O29 [应用数学];

学科分类号：

070104 ;

摘要：

The loosely symmetric model (LS) is as a subjective probability model that came from human beings' cognitive characteristics. To suggest a value to apply human beings' cognitive characteristics, we developed a value function "loosely symmetric model with variable reference" (LS-aVR) that expanded LS in the decision-amaking. It is important how get a reference value having an agent from environment to determine whether an algorithm using LS-aVR explores in comparison with a reference value. In this study, we proposed using statistical knowledge in an online method to acquire a reference value. Therefore we succeeded in making the result that new method exceeded a superior existing model in the multi-aarmed banded problem that is a kind of decision-amaking problems.

引用

页数：4

共 50 条

[41] CCN Interest Forwarding Strategy as Multi-Armed Bandit Model with Delays
Avrachenkov, Konstantin
Jacko, Peter
2012 6TH INTERNATIONAL CONFERENCE ON NETWORK GAMES, CONTROL AND OPTIMIZATION (NETGCOOP), 2012, : 38 - 43
[42] Maximal Expectation as Upper Confidence Bound for Multi-armed Bandit Problems
Kao, Kuo-Yuan
Chen, I-Hao
2014 IEEE 7TH JOINT INTERNATIONAL INFORMATION TECHNOLOGY AND ARTIFICIAL INTELLIGENCE CONFERENCE (ITAIC), 2014, : 325 - 329
[43] Mean-Variance and Value at Risk in Multi-Armed Bandit Problems
Vakili, Sattar
Zhao, Qing
2015 53RD ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2015, : 1330 - 1335
[44] Thompson Sampling Based Mechanisms for Stochastic Multi-Armed Bandit Problems
Ghalme, Ganesh
Jain, Shweta
Gujar, Sujit
Narahari, Y.
AAMAS'17: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2017, : 87 - 95
[45] Empirical Gittins index strategies with ?-explorations for multi-armed bandit problems
Li, Xiao
Li, Yuqiang
Wu, Xianyi
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2023, 180
[46] Modeling Choice Variation in Search Strategies with Multi-armed Bandit Problems
Sharma, Neha
Dutt, Varun
2017 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND DATA SCIENCE (MLDS 2017), 2017, : 91 - 97
[47] Solving multi-armed bandit problems using a chaotic microresonator comb
Cuevas, Jonathan
Iwami, Ryugo
Uchida, Atsushi
Minoshima, Kaoru
Kuse, Naoya
APL PHOTONICS, 2024, 9 (03)
[48] ON MULTI-ARMED BANDIT PROBLEM WITH NUISANCE PARAMETER
孙嘉阳
Science China Mathematics, 1986, (05) : 464 - 475
[49] Multi-armed bandit algorithms and empirical evaluation
Vermorel, J
Mohri, M
MACHINE LEARNING: ECML 2005, PROCEEDINGS, 2005, 3720 : 437 - 448
[50] Sustainable Cooperative Coevolution with a Multi-Armed Bandit
De Rainville, Francois-Michel
Sebag, Michele
Gagne, Christian
Schoenauer, Marc
Laurendeau, Denis
GECCO'13: PROCEEDINGS OF THE 2013 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2013, : 1517 - 1524

← 1 2 3 4 5 →