共 50 条
- [41] Non-cooperative Target Assignment using Regret Matching 11TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV 2010), 2010, : 787 - 792
- [42] Neural Regret-Matching for Distributed Constraint Optimization Problems PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 146 - 153
- [44] Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [46] Constant or Logarithmic Regret in Asynchronous Multiplayer Bandits with Limited Communication INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
- [47] Tiered Reinforcement Learning: Pessimism in the Face of Uncertainty and Constant Regret ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [48] Dynamic Regret of Online Markov Decision Processes INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [49] Distributed Estimation of Dynamic Parameters : Regret Analysis 2016 AMERICAN CONTROL CONFERENCE (ACC), 2016, : 1066 - 1071
- [50] Unconstrained Dynamic Regret via Sparse Coding ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,