Meta-Scheduling for the Wireless Downlink Through Learning With Bandit Feedback

被引:3
|
作者
Song, Jianhan [1 ]
de Veciana, Gustavo [1 ]
Shakkottai, Sanjay [1 ]
机构
[1] Univ Texas Austin, Dept Elect & Comp Engn, Cockrell Sch Engn, Austin, TX 78712 USA
关键词
Wireless communication; Measurement; Throughput; Optimal scheduling; Heuristic algorithms; Job shop scheduling; Dynamic scheduling; Online learning; bandit algorithms; upper confidence bound; wireless networks; scheduling; TIME;
D O I
10.1109/TNET.2021.3117783
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we study learning-assisted multi-user scheduling for the wireless downlink. There have been many scheduling algorithms developed that optimize for a plethora of performance metrics; however a systematic approach across diverse performance metrics and deployment scenarios is still lacking. We address this by developing a meta-scheduler - given a diverse collection of schedulers, we develop a learning-based overlay algorithm (meta-scheduler) that selects that ``best'' scheduler from amongst these for each deployment scenario. More formally, we develop a multi-armed bandit (MAB) framework for meta-scheduling that assigns and adapts a score for each scheduler to maximize reward (e.g., mean delay, timely throughput etc.). The meta-scheduler is based on a variant of the Upper Confidence Bound algorithm (UCB), but adapted to interrupt the queuing dynamics at the base-station so as to filter out schedulers that might render the system unstable. We show that the algorithm has a poly-logarithmic regret in the expected reward with respect to a genie that chooses the optimal scheduler for each scenario. Finally through simulation, we show that the meta-scheduler learns the choice of the scheduler to best adapt to the deployment scenario (e.g. load conditions, performance metrics).
引用
收藏
页码:487 / 500
页数:14
相关论文
共 50 条
  • [21] PRACTICAL EXPERIENCES WITH TORQUE META-SCHEDULING IN THE CZECH NATIONAL GRID
    Toth, Simon
    Ruda, Miroslav
    COMPUTER SCIENCE-AGH, 2012, 13 (02): : 33 - 45
  • [22] Bandit Learning with Biased Human Feedback
    Tang, Wei
    Ho, Chien-Ju
    AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 1324 - 1332
  • [23] Learning with Bandit Feedback in Potential Games
    Cohen, Johanne
    Heliou, Amelie
    Mertikopoulos, Panayotis
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [24] Learning in Congestion Games with Bandit Feedback
    Cui, Qiwen
    Xiong, Zhihan
    Fazel, Maryam
    Du, Simon S.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [25] The Blinded Bandit: Learning with Adaptive Feedback
    Dekel, Ofer
    Hazan, Elad
    Koren, Tomer
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
  • [26] Learning from eXtreme Bandit Feedback
    Lopez, Romain
    Dhillon, Inderjit S.
    Jordan, Michael, I
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 8732 - 8740
  • [27] Performance investigation of weighted meta-scheduling algorithm for scientific grid
    Song, J
    Koh, CK
    See, S
    Leng, GK
    GRID AND COOPERATIVE COMPUTING - GCC 2005, PROCEEDINGS, 2005, 3795 : 1021 - 1030
  • [28] Dynamic Meta-Scheduling Architecture based on Monitoring in Distributed Systems
    Pop, Florin
    Dobre, Ciprian
    Stratan, Corina
    Costan, Alexandru
    Cristea, Valentin
    CISIS: 2009 INTERNATIONAL CONFERENCE ON COMPLEX, INTELLIGENT AND SOFTWARE INTENSIVE SYSTEMS, VOLS 1 AND 2, 2009, : 388 - 395
  • [29] Batch Learning from Logged Bandit Feedback through Counterfactual Risk Minimization
    Swaminathan, Adith
    Joachims, Thorsten
    JOURNAL OF MACHINE LEARNING RESEARCH, 2015, 16 : 1731 - 1755
  • [30] ARiA: A Protocol for Dynamic Fully Distributed Grid Meta-Scheduling
    Brocco, Amos
    Malatras, Apostolos
    Huang, Ye
    Hirsbrunner, Beat
    2010 INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS ICDCS 2010, 2010,