Achieving fairness in the stochastic multi-armed bandit problem

被引:0
|
作者
Patil, Vishakha [1 ]
Ghalme, Ganesh [2 ]
Nair, Vineet [3 ]
Narahari, Y. [4 ]
机构
[1] Patil, Vishakha
[2] Ghalme, Ganesh
[3] Nair, Vineet
[4] Narahari, Y.
来源
| 1600年 / Microtome Publishing卷 / 22期
关键词
Reinforcement learning;
D O I
暂无
中图分类号
学科分类号
摘要
引用
收藏
页码:1 / 31
相关论文
共 50 条
  • [41] Tug-of-War Model for Multi-armed Bandit Problem
    Kim, Song-Ju
    Aono, Masashi
    Hara, Masahiko
    UNCONVENTIONAL COMPUTATION, PROCEEDINGS, 2010, 6079 : 69 - +
  • [42] Optimal Regret Analysis of Thompson Sampling in Stochastic Multi-armed Bandit Problem with Multiple Plays
    Komiyama, Junpei
    Honda, Junya
    Nakagawa, Hiroshi
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 1152 - 1161
  • [43] Lower Bounds and Selectivity of Weak-Consistent Policies in Stochastic Multi-Armed Bandit Problem
    Salomon, Antoine
    Audibert, Jean-Yves
    El Alaoui, Issam
    JOURNAL OF MACHINE LEARNING RESEARCH, 2013, 14 : 187 - 207
  • [44] DYNAMIC ALLOCATION INDEX FOR THE DISCOUNTED MULTI-ARMED BANDIT PROBLEM
    GITTINS, JC
    JONES, DM
    BIOMETRIKA, 1979, 66 (03) : 561 - 565
  • [45] Dynamic Multi-Armed Bandit with Covariates
    Pavlidis, Nicos G.
    Tasoulis, Dimitris K.
    Adams, Niall M.
    Hand, David J.
    ECAI 2008, PROCEEDINGS, 2008, 178 : 777 - +
  • [46] Scaling Multi-Armed Bandit Algorithms
    Fouche, Edouard
    Komiyama, Junpei
    Boehm, Klemens
    KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, : 1449 - 1459
  • [47] A stochastic multi-armed bandit approach to nonparametric H∞-norm estimation
    Mueller, Matias I.
    Valenzuela, Patricio E.
    Proutiere, Alexandre
    Rojas, Cristian R.
    2017 IEEE 56TH ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2017,
  • [48] Satisficing in Multi-Armed Bandit Problems
    Reverdy, Paul
    Srivastava, Vaibhav
    Leonard, Naomi Ehrich
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2017, 62 (08) : 3788 - 3803
  • [49] Multi-armed Bandit with Additional Observations
    Yun, Donggyu
    Proutiere, Alexandre
    Ahn, Sumyeong
    Shin, Jinwoo
    Yi, Yung
    PROCEEDINGS OF THE ACM ON MEASUREMENT AND ANALYSIS OF COMPUTING SYSTEMS, 2018, 2 (01)
  • [50] IMPROVING STRATEGIES FOR THE MULTI-ARMED BANDIT
    POHLENZ, S
    MARKOV PROCESS AND CONTROL THEORY, 1989, 54 : 158 - 163