A Satisficing Strategy with Variable Reference in the Multi-armed Bandit Problems

被引:0
|
作者
Kohno, Yu [1 ]
Takahashi, Tatsuji [2 ]
机构
[1] Tokyo Denki Univ, Grad Sch Adv Sci & Technol, Hiki, Saitama 3500394, Japan
[2] Tokyo Denki Univ, Hiki, Saitama 3500394, Japan
关键词
Symmetric reasoning; decision-making; N armed bandit problem; speed-accuracy trade-off;
D O I
10.1063/1.4912815
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
The loosely symmetric model (LS) is as a subjective probability model that came from human beings' cognitive characteristics. To suggest a value to apply human beings' cognitive characteristics, we developed a value function "loosely symmetric model with variable reference" (LS-aVR) that expanded LS in the decision-amaking. It is important how get a reference value having an agent from environment to determine whether an algorithm using LS-aVR explores in comparison with a reference value. In this study, we proposed using statistical knowledge in an online method to acquire a reference value. Therefore we succeeded in making the result that new method exceeded a superior existing model in the multi-aarmed banded problem that is a kind of decision-amaking problems.
引用
收藏
页数:4
相关论文
共 50 条
  • [41] CCN Interest Forwarding Strategy as Multi-Armed Bandit Model with Delays
    Avrachenkov, Konstantin
    Jacko, Peter
    2012 6TH INTERNATIONAL CONFERENCE ON NETWORK GAMES, CONTROL AND OPTIMIZATION (NETGCOOP), 2012, : 38 - 43
  • [42] Maximal Expectation as Upper Confidence Bound for Multi-armed Bandit Problems
    Kao, Kuo-Yuan
    Chen, I-Hao
    2014 IEEE 7TH JOINT INTERNATIONAL INFORMATION TECHNOLOGY AND ARTIFICIAL INTELLIGENCE CONFERENCE (ITAIC), 2014, : 325 - 329
  • [43] Mean-Variance and Value at Risk in Multi-Armed Bandit Problems
    Vakili, Sattar
    Zhao, Qing
    2015 53RD ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2015, : 1330 - 1335
  • [44] Thompson Sampling Based Mechanisms for Stochastic Multi-Armed Bandit Problems
    Ghalme, Ganesh
    Jain, Shweta
    Gujar, Sujit
    Narahari, Y.
    AAMAS'17: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2017, : 87 - 95
  • [45] Empirical Gittins index strategies with ?-explorations for multi-armed bandit problems
    Li, Xiao
    Li, Yuqiang
    Wu, Xianyi
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2023, 180
  • [46] Modeling Choice Variation in Search Strategies with Multi-armed Bandit Problems
    Sharma, Neha
    Dutt, Varun
    2017 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND DATA SCIENCE (MLDS 2017), 2017, : 91 - 97
  • [47] Solving multi-armed bandit problems using a chaotic microresonator comb
    Cuevas, Jonathan
    Iwami, Ryugo
    Uchida, Atsushi
    Minoshima, Kaoru
    Kuse, Naoya
    APL PHOTONICS, 2024, 9 (03)
  • [48] ON MULTI-ARMED BANDIT PROBLEM WITH NUISANCE PARAMETER
    孙嘉阳
    Science China Mathematics, 1986, (05) : 464 - 475
  • [49] Multi-armed bandit algorithms and empirical evaluation
    Vermorel, J
    Mohri, M
    MACHINE LEARNING: ECML 2005, PROCEEDINGS, 2005, 3720 : 437 - 448
  • [50] Sustainable Cooperative Coevolution with a Multi-Armed Bandit
    De Rainville, Francois-Michel
    Sebag, Michele
    Gagne, Christian
    Schoenauer, Marc
    Laurendeau, Denis
    GECCO'13: PROCEEDINGS OF THE 2013 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2013, : 1517 - 1524