共 50 条
- [21] Time-Decaying Bandits for Non-stationary Systems WEB AND INTERNET ECONOMICS, 2014, 8877 : 460 - 466
- [23] Non-Stationary Representation Learning in Sequential Linear Bandits IEEE OPEN JOURNAL OF CONTROL SYSTEMS, 2022, 1 : 41 - 56
- [24] Randomized Exploration for Non-Stationary Stochastic Linear Bandits CONFERENCE ON UNCERTAINTY IN ARTIFICIAL INTELLIGENCE (UAI 2020), 2020, 124 : 71 - 80
- [25] Non-stationary Dueling Bandits for Online Learning to Rank WEB AND BIG DATA, PT II, APWEB-WAIM 2022, 2023, 13422 : 166 - 174
- [26] Reward Attack on Stochastic Bandits with Non-stationary Rewards FIFTY-SEVENTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, IEEECONF, 2023, : 1387 - 1393
- [27] Non-stationary Projection-Free Online Learning with Dynamic and Adaptive Regret Guarantees THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 14, 2024, : 15671 - 15679
- [28] Non-stationary Risk-Sensitive Reinforcement Learning: Near-Optimal Dynamic Regret, Adaptive Detection, and Separation Design THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 6, 2023, : 7405 - 7413
- [29] Non-Stationary Bandits with Auto-Regressive Temporal Dependency ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,