CCN Interest Forwarding Strategy as Multi-Armed Bandit Model with Delays

被引：0

作者：

Avrachenkov, Konstantin ^{[1
]}

Jacko, Peter ^{[2
]}

机构：

[1] INRIA Sophia Antipolis, Biot, France

[2] BCAM, Bilbao, Spain

来源：

2012 6TH INTERNATIONAL CONFERENCE ON NETWORK GAMES, CONTROL AND OPTIMIZATION (NETGCOOP) | 2012年

关键词：

PROBABILITY-INEQUALITIES;

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

We consider Content Centric Network (CCN) interest forwarding problem as a Multi-Armed Bandit (MAB) problem with delays. We investigate the transient behaviour of the epsilon-greedy, tuned epsilon-greedy and Upper Confidence Bound (UCB) interest forwarding policies. Surprisingly, for all the three policies very short initial exploratory phase is needed. We demonstrate that the tuned epsilon-greedy algorithm is nearly as good as the UCB algorithm, commonly reported as the best currently available algorithm. We prove the uniform logarithmic bound for the tuned epsilon-greedy algorithm in the presence of delays. In addition to its immediate application to CCN interest forwarding, the new theoretical results for MAB problem with delays represent significant theoretical advances in machine learning discipline.

引用

页码：38 / 43

页数：6

共 50 条

[41] An Adaptive Algorithm in Multi-Armed Bandit Problem
Zhang X.
Zhou Q.
Liang B.
Xu J.
Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2019, 56 (03): : 643 - 654
[42] Multi-Armed Recommender System Bandit Ensembles
Canamares, Rocio
Redondo, Marcos
Castells, Pablo
RECSYS 2019: 13TH ACM CONFERENCE ON RECOMMENDER SYSTEMS, 2019, : 432 - 436
[43] Noise Free Multi-armed Bandit Game
Nakamura, Atsuyoshi
Helmbold, David P.
Warmuth, Manfred K.
LANGUAGE AND AUTOMATA THEORY AND APPLICATIONS, LATA 2016, 2016, 9618 : 412 - 423
[44] Ambiguity aversion in multi-armed bandit problems
Anderson, Christopher M.
THEORY AND DECISION, 2012, 72 (01) : 15 - 33
[45] Robust control of the multi-armed bandit problem
Felipe Caro
Aparupa Das Gupta
Annals of Operations Research, 2022, 317 : 461 - 480
[46] CHARACTERIZING TRUTHFUL MULTI-ARMED BANDIT MECHANISMS
Babaioff, Moshe
Sharma, Yogeshwer
Slivkins, Aleksandrs
SIAM JOURNAL ON COMPUTING, 2014, 43 (01) : 194 - 230
[47] Multi-armed Bandit Problems with Strategic Arms
Braverman, Mark
Mao, Jieming
Schneider, Jon
Weinberg, S. Matthew
CONFERENCE ON LEARNING THEORY, VOL 99, 2019, 99
[48] Ambiguity aversion in multi-armed bandit problems
Christopher M. Anderson
Theory and Decision, 2012, 72 : 15 - 33
[49] Multi-armed bandit problem with known trend
Bouneffouf, Djallel
Feraud, Raphael
NEUROCOMPUTING, 2016, 205 : 16 - 21
[50] A Multi-Armed Bandit Hyper-Heuristic
Ferreira, Alexandre Silvestre
Goncalves, Richard Aderbal
Ramirez Pozo, Aurora Trinidad
2015 BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS 2015), 2015, : 13 - 18

← 1 2 3 4 5 →