Bandit Algorithm Driven by a Classical Random Walk and a Quantum Walk

被引:0
|
作者
Yamagami, Tomoki [1 ]
Segawa, Etsuo [2 ]
Mihana, Takatomo [1 ]
Rohm, Andre [1 ]
Horisaki, Ryoichi [1 ]
Naruse, Makoto [1 ]
机构
[1] Univ Tokyo, Grad Sch Informat Sci & Technol, Dept Informat Phys & Comp, 7-3-1 Hongo, Bunkyo, Tokyo 1138656, Japan
[2] Yokohama Natl Univ, Grad Sch Environm & Informat Sci, 79-1 Tokiwadai, Hodogaya, Yokohama, Kanagawa 2408501, Japan
基金
日本科学技术振兴机构; 日本学术振兴会;
关键词
random walk; quantum walk; bandit algorithm; exploration-exploitation trade-off; decision-making;
D O I
10.3390/e25060843
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Quantum walks (QWs) have a property that classical random walks (RWs) do not possess-the coexistence of linear spreading and localization-and this property is utilized to implement various kinds of applications. This paper proposes RW- and QW-based algorithms for multi-armed-bandit (MAB) problems. We show that, under some settings, the QW-based model realizes higher performance than the corresponding RW-based one by associating the two operations that make MAB problems difficult-exploration and exploitation-with these two behaviors of QWs.
引用
收藏
页数:20
相关论文
共 50 条