DYNAMIC SPECTRUM ACCESS WITH NON-STATIONARY MULTI-ARMED BANDIT

被引:8
|
作者
Alaya-Feki, Afef Ben Hadj [1 ]
Moulines, Eric [1 ]
LeCornec, Alain [1 ]
机构
[1] Telecom ParisTech, Orange Labs, Paris, France
来源
2008 IEEE 9TH WORKSHOP ON SIGNAL PROCESSING ADVANCES IN WIRELESS COMMUNICATIONS, VOLS 1 AND 2 | 2008年
关键词
Multi armed bandit; cognitive radio; opportunistic spectrum access;
D O I
10.1109/SPAWC.2008.4641641
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Dynamic spectrum access (DSA) is an emerging notion in cognitive radio, aiming to improve the spectrum usage with reliable secondary access to the spectral resources. The main challenge in DSA is the detection of spectral opportunities and their efficient utilization without causing interference to the primary users. For this goal, we propose to make use of a reinforcement learning approach: the Multi Armed Bandit (MAB). The MAB approach provides the secondary users with the rules and policies necessary to achieve a tradeoff between exploitation and exploration in DSA. Different MAB strategies are tested on an IEEE802.11 medium access model and evaluated in dynamic environment. Our study shows that the MAB constitute a viable solution for the DSA. Adding to that, the performances of the MAB algorithms can be improved with a finite tuning of the internal parameters.
引用
收藏
页码:416 / 420
页数:5
相关论文
共 50 条
  • [1] The non-stationary stochastic multi-armed bandit problem
    Allesiardo R.
    Féraud R.
    Maillard O.-A.
    Allesiardo, Robin (robin.allesiardo@gmail.com), 1600, Springer Science and Business Media Deutschland GmbH (03): : 267 - 283
  • [2] Contextual Multi-Armed Bandit With Costly Feature Observation in Non-Stationary Environments
    Ghoorchian, Saeed
    Kortukov, Evgenii
    Maghsudi, Setareh
    IEEE OPEN JOURNAL OF SIGNAL PROCESSING, 2024, 5 : 820 - 830
  • [3] Reinforcement learning and evolutionary algorithms for non-stationary multi-armed bandit problems
    Koulouriotis, D. E.
    Xanthopoulos, A.
    APPLIED MATHEMATICS AND COMPUTATION, 2008, 196 (02) : 913 - 922
  • [4] LLM-Informed Multi-Armed Bandit Strategies for Non-Stationary Environments
    de Curto, J.
    de Zarza, I.
    Roig, Gemma
    Cano, Juan Carlos
    Manzoni, Pietro
    Calafate, Carlos T.
    ELECTRONICS, 2023, 12 (13)
  • [6] Multi-Armed Bandit Learning in IoT Networks: Learning Helps Even in Non-stationary Settings
    Bonnefoi, Remi
    Besson, Lilian
    Moy, Christophe
    Kaufmann, Emilie
    Palicot, Jacques
    COGNITIVE RADIO ORIENTED WIRELESS NETWORKS, 2018, 228 : 173 - 185
  • [7] A Multi-armed Bandit Algorithm Available in Stationary or Non-stationary Environments Using Self-organizing Maps
    Manome, Nobuhito
    Shinohara, Shuji
    Suzuki, Kouta
    Tomonaga, Kosuke
    Mitsuyoshi, Shunji
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: THEORETICAL NEURAL COMPUTATION, PT I, 2019, 11727 : 529 - 540
  • [8] Dynamic Multi-Armed Bandit with Covariates
    Pavlidis, Nicos G.
    Tasoulis, Dimitris K.
    Adams, Niall M.
    Hand, David J.
    ECAI 2008, PROCEEDINGS, 2008, 178 : 777 - +
  • [9] Opportunistic Spectrum Access Based on a Constrained Multi-Armed Bandit Formulation
    Ai, Jing
    Abouzeid, Alhussein A.
    JOURNAL OF COMMUNICATIONS AND NETWORKS, 2009, 11 (02) : 134 - 147
  • [10] DBA: Dynamic Multi-Armed Bandit Algorithm
    Nobari, Sadegh
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 9869 - 9870