Achieving fairness in the stochastic multi-armed bandit problem

被引:0
|
作者
Patil, Vishakha [1 ]
Ghalme, Ganesh [2 ]
Nair, Vineet [3 ]
Narahari, Y. [4 ]
机构
[1] Patil, Vishakha
[2] Ghalme, Ganesh
[3] Nair, Vineet
[4] Narahari, Y.
来源
| 1600年 / Microtome Publishing卷 / 22期
关键词
Reinforcement learning;
D O I
暂无
中图分类号
学科分类号
摘要
引用
收藏
页码:1 / 31
相关论文
共 50 条
  • [21] Adversarial multi-armed bandit approach to stochastic optimization
    Chang, Hyeong Soo
    Fu, Michael C.
    Marcus, Steven I.
    PROCEEDINGS OF THE 45TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-14, 2006, : 5684 - +
  • [22] Achieving Regular and Fair Learning in Combinatorial Multi-Armed Bandit
    Wu, Xiaoyi
    Li, Bin
    IEEE INFOCOM 2024-IEEE CONFERENCE ON COMPUTER COMMUNICATIONS, 2024, : 361 - 370
  • [23] Adaptive Active Learning as a Multi-armed Bandit Problem
    Czarnecki, Wojciech M.
    Podolak, Igor T.
    21ST EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (ECAI 2014), 2014, 263 : 989 - 990
  • [24] On the Combinatorial Multi-Armed Bandit Problem with Markovian Rewards
    Gai, Yi
    Krishnamachari, Bhaskar
    Liu, Mingyan
    2011 IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE (GLOBECOM 2011), 2011,
  • [25] Possibilistic reward methods for the multi-armed bandit problem
    Martin, Miguel
    Jimenez-Martin, Antonio
    Mateos, Alfonso
    NEUROCOMPUTING, 2018, 310 : 201 - 212
  • [26] Scalable Discrete Sampling as a Multi-Armed Bandit Problem
    Chen, Yutian
    Ghahramani, Zoubin
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
  • [27] The sample complexity of exploration in the multi-armed bandit problem
    Mannor, S
    Tsitsiklis, JN
    JOURNAL OF MACHINE LEARNING RESEARCH, 2004, 5 : 623 - 648
  • [28] Interface Design Optimization as a Multi-Armed Bandit Problem
    Lomas, J. Derek
    Forlizzi, Jodi
    Poonwala, Nikhil
    Patel, Nirmal
    Shodhan, Sharan
    Patel, Kishan
    Koedinger, Ken
    Brunskill, Emma
    34TH ANNUAL CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, CHI 2016, 2016, : 4142 - 4153
  • [29] Online Optimization Algorithms for Multi-Armed Bandit Problem
    Kamalov, Mikhail
    Dobrynin, Vladimir
    Balykina, Yulia
    2017 CONSTRUCTIVE NONSMOOTH ANALYSIS AND RELATED TOPICS (DEDICATED TO THE MEMORY OF V.F. DEMYANOV) (CNSA), 2017, : 141 - 143
  • [30] THE MULTI-ARMED BANDIT PROBLEM: AN EFFICIENT NONPARAMETRIC SOLUTION
    Chan, Hock Peng
    ANNALS OF STATISTICS, 2020, 48 (01): : 346 - 373