Quantum greedy algorithms for multi-armed bandits

被引：0

作者：

Ohno, Hiroshi ^{[1
]}

机构：

[1] Toyota Cent Res & Dev Labs Inc, 41-1 Yokomichi, Nagakute, Aichi 4801192, Japan

来源：

QUANTUM INFORMATION PROCESSING | 2023年 / 22卷 / 02期

关键词：

Multi-armed bandits; -greedy algorithm; MovieLens dataset; Quantum amplitude amplification; Regret analysis;

D O I：

10.1007/s11128-023-03844-2

中图分类号：

O4 [物理学];

学科分类号：

0702 ;

摘要：

Multi-armed bandits are widely used in machine learning applications such as recommendation systems. Here, we implement two quantum versions of the e-greedy algorithm, a popular algorithm for multi-armed bandits. One of the quantum greedy algorithms uses a quantum maximization algorithm and the other is a simple algorithm that uses an amplitude encoding method as a quantum subroutine instead of the argmax operation in the e-greedy algorithm. For the former algorithm, given a quantum oracle, the query complexity is on the order root K (O(root K)) in each round, where K is the number of arms. For the latter algorithm, quantum parallelism is achieved by the quantum superposition of the arms and the run-time complexity is on the order O(K)/O(log K) in each round. Bernoulli reward distributions and the MovieLens dataset are used to evaluate the algorithms with their classical counterparts. The experimental results show that for best arm identification, the performance of the quantum greedy algorithm is comparable with that of the classical counterparts.

引用

页数：20

共 50 条

[1] Quantum greedy algorithms for multi-armed bandits
Hiroshi Ohno
Quantum Information Processing, 22
[2] Quantum Exploration Algorithms for Multi-Armed Bandits
Wang, Daochen
You, Xuchen
Li, Tongyang
Childs, Andrew M.
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 10102 - 10110
[3] Algorithms for Differentially Private Multi-Armed Bandits
Tossou, Aristide C. Y.
Dimitrakakis, Christos
THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 2087 - 2093
[4] Optimal Algorithms for Multiplayer Multi-Armed Bandits
Wang, Po-An
Proutiere, Alexandre
Ariu, Kaito
Jedra, Yassir
Russo, Alessio
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108
[5] Optimal Streaming Algorithms for Multi-Armed Bandits
Jin, Tianyuan
Huang, Keke
Tang, Jing
Xiao, Xiaokui
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
[6] Quantum Reinforcement Learning for Multi-Armed Bandits
Liu, Yi-Pei
Li, Kuo
Cao, Xi
Jia, Qing-Shan
Wang, Xu
2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 5675 - 5680
[7] Multi-Armed Bandits and Quantum Channel Oracles
Buchholz, Simon
Kuebler, Jonas M.
Schoelkopf, Bernhard
QUANTUM, 2025, 9
[8] Generic Asymptotically Optimal Algorithms for Multi-Armed Bandits
Combes, Richard
Magureanu, Stefan
Proutiere, Alexandre
2018 56TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2018, : 152 - 152
[9] Fair Algorithms for Multi-Agent Multi-Armed Bandits
Hossain, Safwan
Micha, Evi
Shah, Nisarg
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[10] Anytime optimal algorithms in stochastic multi-armed bandits
Degenne, Remy
Perchet, Vianney
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48

← 1 2 3 4 5 →