Beyond the hazard rate: More perturbation algorithms for adversarial multi-armed bandits

被引:0
|
作者
机构
[1] Li, Zifan
[2] Tewari, Ambuj
关键词
Follow the perturbed leader - Gradient based algorithm - Multi armed bandit - Online learning - Regret;
D O I
暂无
中图分类号
学科分类号
摘要
22
引用
收藏
相关论文
共 50 条
  • [1] Beyond the Hazard Rate: More Perturbation Algorithms for Adversarial Multi-armed Bandits
    Li, Zifan
    Tewari, Ambuj
    JOURNAL OF MACHINE LEARNING RESEARCH, 2018, 18
  • [2] Quantum greedy algorithms for multi-armed bandits
    Hiroshi Ohno
    Quantum Information Processing, 22
  • [3] Algorithms for Differentially Private Multi-Armed Bandits
    Tossou, Aristide C. Y.
    Dimitrakakis, Christos
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 2087 - 2093
  • [4] Optimal Algorithms for Multiplayer Multi-Armed Bandits
    Wang, Po-An
    Proutiere, Alexandre
    Ariu, Kaito
    Jedra, Yassir
    Russo, Alessio
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108
  • [5] Optimal Streaming Algorithms for Multi-Armed Bandits
    Jin, Tianyuan
    Huang, Keke
    Tang, Jing
    Xiao, Xiaokui
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [6] Quantum Exploration Algorithms for Multi-Armed Bandits
    Wang, Daochen
    You, Xuchen
    Li, Tongyang
    Childs, Andrew M.
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 10102 - 10110
  • [7] Quantum greedy algorithms for multi-armed bandits
    Ohno, Hiroshi
    QUANTUM INFORMATION PROCESSING, 2023, 22 (02)
  • [8] Adapting to Delays and Data in Adversarial Multi-Armed Bandits
    Gyorgy, Andras
    Joulani, Pooria
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [9] Stealthy Adversarial Attacks on Stochastic Multi-Armed Bandits
    Wang, Zhiwei
    Wang, Huazheng
    Wang, Hongning
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 14, 2024, : 15770 - 15777
  • [10] Generic Asymptotically Optimal Algorithms for Multi-Armed Bandits
    Combes, Richard
    Magureanu, Stefan
    Proutiere, Alexandre
    2018 56TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2018, : 152 - 152