共 50 条
- [1] Towards Q-learning the Whittle Index for Restless Bandits [J]. 2019 AUSTRALIAN & NEW ZEALAND CONTROL CONFERENCE (ANZCC), 2019, : 249 - 254
- [2] On Learning Whittle Index Policy for Restless Bandits With Scalable Regret [J]. IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2024, 11 (03): : 1190 - 1202
- [3] Optimistic Whittle Index Policy: Online Learning for Restless Bandits [J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 8, 2023, : 10131 - 10139
- [5] Finite-Time Analysis of Whittle Index based Q-Learning for Restless Multi-Armed Bandits with Neural Network Function Approximation [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [6] On the Whittle index of Markov modulated restless bandits [J]. QUEUEING SYSTEMS, 2022, 102 (3-4) : 373 - 430
- [7] On the Whittle index of Markov modulated restless bandits [J]. Queueing Systems, 2022, 102 : 373 - 430
- [9] On the computation of Whittle’s index for Markovian restless bandits [J]. Mathematical Methods of Operations Research, 2021, 93 : 179 - 208