An α-No-Regret Algorithm For Graphical Bilinear Bandits

被引:0
|
作者
Rizk, Geovani [1 ]
Colin, Igor [2 ]
Thomas, Albert [2 ]
Laraki, Rida [1 ]
Chevaleyre, Yann [1 ]
机构
[1] Univ Paris 09, PSL, CNRS, LAMSADE, Paris, France
[2] Huawei Noahs Ark Lab, Paris, France
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose the first regret-based approach to the Graphical Bilinear Bandits problem, where n agents in a graph play a stochastic bilinear bandit game with each of their neighbors. This setting reveals a combinatorial NP-hard problem that prevents the use of any existing regret-based algorithm in the (bi-)linear bandit literature. In this paper, we fill this gap and present the first regret-based algorithm for graphical bilinear bandits using the principle of optimism in the face of uncertainty. Theoretical analysis of this new method yields an upper bound of (O) over tilde(root T) on the.-regret and evidences the impact of the graph structure on the rate of convergence. Finally, we show through various experiments the validity of our approach.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Regret Analysis of a Markov Policy Gradient Algorithm for Multiarm Bandits
    Walton, Neil
    Denisov, Denis
    [J]. MATHEMATICS OF OPERATIONS RESEARCH, 2023, 48 (03) : 1553 - 1588
  • [22] Regret of Queueing Bandits
    Krishnasamy, Subhashini
    Sen, Rajat
    Johari, Ramesh
    Shakkottai, Sanjay
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [23] Strategizing against No-regret Learners
    Deng, Yuan
    Schneider, Jon
    Sivan, Balasubramanian
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [24] On Fixed Convex Combinations of No-Regret Learners
    Calliess, Jan-P.
    [J]. MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION, 2009, 5632 : 494 - 504
  • [25] Environmental protection: A no-regret strategy for India
    Khanna, M
    Zilberman, D
    [J]. ENERGY AND ECONOMIC GROWTH: IS SUSTAINABLE GROWTH POSSIBLE?, VOLS 1-3, 1997, : 787 - 799
  • [26] Limits and limitations of no-regret learning in games
    Monnot, Barnabe
    Piliouras, Georgios
    [J]. KNOWLEDGE ENGINEERING REVIEW, 2017, 32
  • [27] No-Regret Learning Supports Voters' Competence
    Spelda, Petr
    Stritecky, Vit
    Symons, John
    [J]. SOCIAL EPISTEMOLOGY, 2024, 38 (05) : 543 - 559
  • [28] Opportunistic Approachability and Generalized No-Regret Problems
    Bernstein, Andrey
    Mannor, Shie
    Shimkin, Nahum
    [J]. MATHEMATICS OF OPERATIONS RESEARCH, 2014, 39 (04) : 1057 - 1083
  • [29] Optimistic No-regret Algorithms for Discrete Caching
    Mhaisen N.
    Sinha A.
    Paschos G.
    Iosifidis G.
    [J]. Performance Evaluation Review, 2023, 51 (01): : 69 - 70
  • [30] No-regret Algorithms for Fair Resource Allocation
    Sinha, Abhishek
    Joshi, Ativ
    Bhattacharjee, Rajarshi
    Musco, Cameron
    Hajiesmaili, Mohammad
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,