Reinforcement Learning with Budget-Constrained Nonparametric Function Approximation for Opportunistic Spectrum Access

被引:0
|
作者
Tsiligkaridis, Theodoros [1 ]
Romero, David [1 ]
机构
[1] MIT, Lincoln Lab, Cambridge, MA 02139 USA
关键词
Reinforcement Learning; Kernel Method; Opportunistic Spectrum Access;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Opportunistic spectrum access is one of the emerging techniques for maximizing throughput in congested bands and is enabled by predicting idle slots in spectrum. We propose a kernel-based reinforcement learning approach coupled with a novel budget-constrained sparsification technique that efficiently captures the environment to find the best channel access actions. This approach allows learning and planning over the intrinsic state-action space and extends well to large state spaces. We apply our methods to evaluate coexistence of a reinforcement learning-based radio with a multi-channel adversarial radio and a single-channel carrier-sense multiple-access with collision avoidance (CSMA-CA) radio. Numerical experiments show the performance gains over carrier-sense systems.
引用
收藏
页码:579 / 583
页数:5
相关论文
共 50 条
  • [21] Opportunistic spectrum access for energy-constrained cognitive radios
    Hoang, Anh Tuan
    Liang, Ying-Chang
    Wong, David Tung Chong
    Zhang, Rui
    Zeng, Yonghong
    [J]. 2008 IEEE 67TH VEHICULAR TECHNOLOGY CONFERENCE-SPRING, VOLS 1-7, 2008, : 1559 - 1563
  • [22] Bursty traffic in energy-constrained opportunistic spectrum access
    Chen, Yunxia
    Zhao, Qing
    Swami, Ananthram
    [J]. GLOBECOM 2007: 2007 IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE, VOLS 1-11, 2007, : 4641 - +
  • [23] GCOMB: Learning Budget-constrained Combinatorial Algorithms over Billion-sized Graphs
    Manchanda, Sahil
    Mittal, Akash
    Dhawan, Anuj
    Medya, Sourav
    Ranu, Sayan
    Singh, Ambuj
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [24] Evolutionary function approximation for reinforcement learning
    Whiteson, Shimon
    Stone, Peter
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2006, 7 : 877 - 917
  • [25] Distributed cognitive MAC for energy-constrained opportunistic spectrum access
    Chen, Yunxia
    Zhao, Qing
    Swami, Ananthram
    [J]. MILCOM 2006, VOLS 1-7, 2006, : 1904 - +
  • [26] Approximately Optimal Adaptive Learning in Opportunistic Spectrum Access
    Tekin, Cem
    Liu, Mingyan
    [J]. 2012 PROCEEDINGS IEEE INFOCOM, 2012, : 1548 - 1556
  • [27] Decentralized Online Learning Algorithms for Opportunistic Spectrum Access
    Gai, Yi
    Krishnamachari, Bhaskar
    [J]. 2011 IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE (GLOBECOM 2011), 2011,
  • [28] Dynamic Spectrum Anti-Jamming With Reinforcement Learning Based on Value Function Approximation
    Zhu, Xinyu
    Huang, Yang
    Wang, Shaoyu
    Wu, Qihui
    Ge, Xiaohu
    Liu, Yuan
    Gao, Zhen
    [J]. IEEE WIRELESS COMMUNICATIONS LETTERS, 2023, 12 (02) : 386 - 390
  • [29] Online budget-feasible mobile crowdsensing with constrained reinforcement learning
    Zhang, Bolei
    Wu, Lifa
    [J]. Journal of Supercomputing, 2025, 81 (01):
  • [30] Multiagent reinforcement learning using function approximation
    Abul, O
    Polat, F
    Alhajj, R
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2000, 30 (04): : 485 - 497