Achieving fairness in the stochastic multi-armed bandit problem

被引：0

作者：

Patil, Vishakha ^{[1
]}

Ghalme, Ganesh ^{[2
]}

Nair, Vineet ^{[3
]}

Narahari, Y. ^{[4
]}

机构：

[1] Patil, Vishakha

[2] Ghalme, Ganesh

[3] Nair, Vineet

[4] Narahari, Y.

来源：

| 1600年 / Microtome Publishing卷 / 22期

关键词：

Reinforcement learning;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

引用

页码：1 / 31

共 50 条

[41] Tug-of-War Model for Multi-armed Bandit Problem
Kim, Song-Ju
Aono, Masashi
Hara, Masahiko
UNCONVENTIONAL COMPUTATION, PROCEEDINGS, 2010, 6079 : 69 - +
[42] Optimal Regret Analysis of Thompson Sampling in Stochastic Multi-armed Bandit Problem with Multiple Plays
Komiyama, Junpei
Honda, Junya
Nakagawa, Hiroshi
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 1152 - 1161
[43] Lower Bounds and Selectivity of Weak-Consistent Policies in Stochastic Multi-Armed Bandit Problem
Salomon, Antoine
Audibert, Jean-Yves
El Alaoui, Issam
JOURNAL OF MACHINE LEARNING RESEARCH, 2013, 14 : 187 - 207
[44] DYNAMIC ALLOCATION INDEX FOR THE DISCOUNTED MULTI-ARMED BANDIT PROBLEM
GITTINS, JC
JONES, DM
BIOMETRIKA, 1979, 66 (03) : 561 - 565
[45] Dynamic Multi-Armed Bandit with Covariates
Pavlidis, Nicos G.
Tasoulis, Dimitris K.
Adams, Niall M.
Hand, David J.
ECAI 2008, PROCEEDINGS, 2008, 178 : 777 - +
[46] Scaling Multi-Armed Bandit Algorithms
Fouche, Edouard
Komiyama, Junpei
Boehm, Klemens
KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, : 1449 - 1459
[47] A stochastic multi-armed bandit approach to nonparametric H∞-norm estimation
Mueller, Matias I.
Valenzuela, Patricio E.
Proutiere, Alexandre
Rojas, Cristian R.
2017 IEEE 56TH ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2017,
[48] Satisficing in Multi-Armed Bandit Problems
Reverdy, Paul
Srivastava, Vaibhav
Leonard, Naomi Ehrich
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2017, 62 (08) : 3788 - 3803
[49] Multi-armed Bandit with Additional Observations
Yun, Donggyu
Proutiere, Alexandre
Ahn, Sumyeong
Shin, Jinwoo
Yi, Yung
PROCEEDINGS OF THE ACM ON MEASUREMENT AND ANALYSIS OF COMPUTING SYSTEMS, 2018, 2 (01)
[50] IMPROVING STRATEGIES FOR THE MULTI-ARMED BANDIT
POHLENZ, S
MARKOV PROCESS AND CONTROL THEORY, 1989, 54 : 158 - 163

← 1 2 3 4 5 →