Non-Stationary Bandit Strategy for Rate Adaptation with Delayed Feedback

被引：0

作者：

Zhao, Yapeng ^{[1
,2
]}

Qian, Hua ^{[2
]}

Kang, Kai ^{[2
]}

Jin, Yanliang ^{[1
]}

机构：

[1] School of Communication and Information Engineering, Shanghai University, Shanghai,200444, China

[2] Shanghai Advanced Research Institute, China Academy of Sciences, Shanghai,201210, China

来源：

IEEE Access | 2020年 / 8卷

关键词：

Time division multiplexing;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Rate adaptation is an efficient mechanism to utilize the channel capacity by adjusting the modulation and coding scheme in a dynamic wireless environment. The channel feedback, such as acknowledgment/negative acknowledgment (ACK/NACK) messages or the channel measurement such as received signal strength indicator (RSSI) can be applied to the rate adaptation. Existing rate adaptation algorithms are mainly driven by heuristics. They can not achieve satisfactory transmission rates in the time-varying environment. In this paper, we focus on the rate adaptation problem in a time-division duplex (TDD) system. A multi-armed bandit (MAB) strategy is applied to learn the changes of the channel condition from both RSSI and ACK/NACK signals. A discounted upper confidence bound based rate adaptation (DUCB-RA) algorithm is proposed. We show that the performance of the proposed algorithm is converged to the optimal with mathematical proofs. Simulation results demonstrate that the proposed algorithm can adapt to the time-varying channel and achieve better transmission throughput compared to existing rate adaptation algorithms. © 2013 IEEE.

引用

页码：75503 / 75511

共 50 条

[31] Evolutionary adaptation in non-stationary environments: A case study
Obuchowicz, Andrzej
Wawrzyniak, Dariusz
PARALLEL PROCESSING AND APPLIED MATHEMATICS, 2006, 3911 : 439 - 446
[32] Contextual Multi-Armed Bandit With Costly Feature Observation in Non-Stationary Environments
Ghoorchian, Saeed
Kortukov, Evgenii
Maghsudi, Setareh
IEEE OPEN JOURNAL OF SIGNAL PROCESSING, 2024, 5 : 820 - 830
[33] Reinforcement learning and evolutionary algorithms for non-stationary multi-armed bandit problems
Koulouriotis, D. E.
Xanthopoulos, A.
APPLIED MATHEMATICS AND COMPUTATION, 2008, 196 (02) : 913 - 922
[34] Optimization of satellite searching strategy of the non-stationary antenna
曹海青
王渝
姚志英
Journal of Beijing Institute of Technology, 2015, 24 (03) : 398 - 404
[35] LLM-Informed Multi-Armed Bandit Strategies for Non-Stationary Environments
de Curto, J.
de Zarza, I.
Roig, Gemma
Cano, Juan Carlos
Manzoni, Pietro
Calafate, Carlos T.
ELECTRONICS, 2023, 12 (13)
[36] Solving Non-Stationary Bandit Problems by Random Sampling from Sibling Kalman Filters
Granmo, Ole-Christoffer
Berg, Stian
TRENDS IN APPLIED INTELLIGENT SYSTEMS, PT III, PROCEEDINGS, 2010, 6098 : 199 - 208
[37] Non-stationary stochastic multi-armed bandit problems with external information on stationarity
Namba H.
Transactions of the Japanese Society for Artificial Intelligence, 2021, 36 (03) : D - K84_1
[38] Constrained non-stationary state feedback speed control of PMSM
Tarczewski, T.
Skiwski, M.
Grzesiak, L. M.
2017 19TH EUROPEAN CONFERENCE ON POWER ELECTRONICS AND APPLICATIONS (EPE'17 ECCE EUROPE), 2017,
[39] Is the Labour Force Participation Rate Non-Stationary in Romania?
Tiwari, Aviral Kumar
Mutascu, Mihai
REVIEW OF ECONOMIC PERSPECTIVES, 2014, 14 (04) : 411 - 426
[40] A self-adaptive communication strategy for flocking in stationary and non-stationary environments
Eliseo Ferrante
Ali Emre Turgut
Alessandro Stranieri
Carlo Pinciroli
Mauro Birattari
Marco Dorigo
Natural Computing, 2014, 13 : 225 - 245

← 1 2 3 4 5 →