Beyond the hazard rate: More perturbation algorithms for adversarial multi-armed bandits

被引:0
|
作者
机构
[1] Li, Zifan
[2] Tewari, Ambuj
关键词
Follow the perturbed leader - Gradient based algorithm - Multi armed bandit - Online learning - Regret;
D O I
暂无
中图分类号
学科分类号
摘要
22
引用
收藏
相关论文
共 50 条
  • [21] Optimal Algorithms for Range Searching over Multi-Armed Bandits
    Barman, Siddharth
    Krishnamurthy, Ramakrishnan
    Rahul, Saladi
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 2177 - 2183
  • [22] Best Arm Identification for Both Stochastic and Adversarial Multi-armed Bandits
    Zhang, Hantao
    Shen, Cong
    2018 IEEE INFORMATION THEORY WORKSHOP (ITW), 2018, : 385 - 389
  • [23] Self-Unaware Adversarial Multi-Armed Bandits With Switching Costs
    Alipour-Fanid, Amir
    Dabaghchian, Monireh
    Zeng, Kai
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (06) : 2908 - 2922
  • [24] On No-Sensing Adversarial Multi-Player Multi-Armed Bandits with Collision Communications
    Shi C.
    Shen C.
    IEEE Journal on Selected Areas in Information Theory, 2021, 2 (02): : 515 - 533
  • [25] An Attackability Perspective on No-Sensing Adversarial Multi-player Multi-armed Bandits
    Shi, Chengshuai
    Shen, Cong
    2021 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2021, : 533 - 538
  • [26] Aggregation of Multi-Armed Bandits Learning Algorithms for Opportunistic Spectrum Access
    Besson, Lilian
    Kaufmann, Emilie
    Moy, Christophe
    2018 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2018,
  • [27] Multi-armed bandits for performance marketing
    Gigli, Marco
    Stella, Fabio
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2024,
  • [28] Lenient Regret for Multi-Armed Bandits
    Merlis, Nadav
    Mannor, Shie
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 8950 - 8957
  • [29] Finding structure in multi-armed bandits
    Schulz, Eric
    Franklin, Nicholas T.
    Gershman, Samuel J.
    COGNITIVE PSYCHOLOGY, 2020, 119
  • [30] ON MULTI-ARMED BANDITS AND DEBT COLLECTION
    Czekaj, Lukasz
    Biegus, Tomasz
    Kitlowski, Robert
    Tomasik, Pawel
    36TH ANNUAL EUROPEAN SIMULATION AND MODELLING CONFERENCE, ESM 2022, 2022, : 137 - 141