DYNAMIC SPECTRUM ACCESS WITH NON-STATIONARY MULTI-ARMED BANDIT

被引：8

作者：

Alaya-Feki, Afef Ben Hadj ^{[1
]}

Moulines, Eric ^{[1
]}

LeCornec, Alain ^{[1
]}

机构：

[1] Telecom ParisTech, Orange Labs, Paris, France

来源：

2008 IEEE 9TH WORKSHOP ON SIGNAL PROCESSING ADVANCES IN WIRELESS COMMUNICATIONS, VOLS 1 AND 2 | 2008年

关键词：

Multi armed bandit; cognitive radio; opportunistic spectrum access;

D O I：

10.1109/SPAWC.2008.4641641

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Dynamic spectrum access (DSA) is an emerging notion in cognitive radio, aiming to improve the spectrum usage with reliable secondary access to the spectral resources. The main challenge in DSA is the detection of spectral opportunities and their efficient utilization without causing interference to the primary users. For this goal, we propose to make use of a reinforcement learning approach: the Multi Armed Bandit (MAB). The MAB approach provides the secondary users with the rules and policies necessary to achieve a tradeoff between exploitation and exploration in DSA. Different MAB strategies are tested on an IEEE802.11 medium access model and evaluated in dynamic environment. Our study shows that the MAB constitute a viable solution for the DSA. Adding to that, the performances of the MAB algorithms can be improved with a finite tuning of the internal parameters.

引用

页码：416 / 420

页数：5

共 50 条

[1] The non-stationary stochastic multi-armed bandit problem
Allesiardo R.
Féraud R.
Maillard O.-A.
Allesiardo, Robin (robin.allesiardo@gmail.com), 1600, Springer Science and Business Media Deutschland GmbH (03): : 267 - 283
[2] Contextual Multi-Armed Bandit With Costly Feature Observation in Non-Stationary Environments
Ghoorchian, Saeed
Kortukov, Evgenii
Maghsudi, Setareh
IEEE OPEN JOURNAL OF SIGNAL PROCESSING, 2024, 5 : 820 - 830
[3] Reinforcement learning and evolutionary algorithms for non-stationary multi-armed bandit problems
Koulouriotis, D. E.
Xanthopoulos, A.
APPLIED MATHEMATICS AND COMPUTATION, 2008, 196 (02) : 913 - 922
[4] LLM-Informed Multi-Armed Bandit Strategies for Non-Stationary Environments
de Curto, J.
de Zarza, I.
Roig, Gemma
Cano, Juan Carlos
Manzoni, Pietro
Calafate, Carlos T.
ELECTRONICS, 2023, 12 (13)
[5] Non-stationary stochastic multi-armed bandit problems with external information on stationarity
Namba H.
Transactions of the Japanese Society for Artificial Intelligence, 2021, 36 (03) : D - K84_1
[6] Multi-Armed Bandit Learning in IoT Networks: Learning Helps Even in Non-stationary Settings
Bonnefoi, Remi
Besson, Lilian
Moy, Christophe
Kaufmann, Emilie
Palicot, Jacques
COGNITIVE RADIO ORIENTED WIRELESS NETWORKS, 2018, 228 : 173 - 185
[7] A Multi-armed Bandit Algorithm Available in Stationary or Non-stationary Environments Using Self-organizing Maps
Manome, Nobuhito
Shinohara, Shuji
Suzuki, Kouta
Tomonaga, Kosuke
Mitsuyoshi, Shunji
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: THEORETICAL NEURAL COMPUTATION, PT I, 2019, 11727 : 529 - 540
[8] Dynamic Multi-Armed Bandit with Covariates
Pavlidis, Nicos G.
Tasoulis, Dimitris K.
Adams, Niall M.
Hand, David J.
ECAI 2008, PROCEEDINGS, 2008, 178 : 777 - +
[9] Opportunistic Spectrum Access Based on a Constrained Multi-Armed Bandit Formulation
Ai, Jing
Abouzeid, Alhussein A.
JOURNAL OF COMMUNICATIONS AND NETWORKS, 2009, 11 (02) : 134 - 147
[10] DBA: Dynamic Multi-Armed Bandit Algorithm
Nobari, Sadegh
THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 9869 - 9870

← 1 2 3 4 5 →