共 50 条
- [41] My Fair Bandit: Distributed Learning of Max-Min Fairness with Multi-player Bandits 25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
- [43] Tight last-iterate convergence rates for no-regret learning in multi-player games ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
- [44] Policy Iteration Adaptive Dynamic Programming for Optimal Control of Multi-Player Stackelberg-Nash Games 2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 2393 - 2397
- [45] My Fair Bandit: Distributed Learning of Max-Min Fairness with Multi-player Bandits INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
- [47] EVENT-TRIGGERED ADAPTIVE CONTROL FOR NONLINEAR MULTI-PLAYER GAMES USING NEURAL CRITIC LEARNING INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2024, 20 (05): : 1257 - 1275
- [50] Output Feedback H∞ Control for Linear Discrete-Time Multi-Player Systems With Multi-Source Disturbances Using Off-Policy Q-Learning IEEE ACCESS, 2020, 8 : 208938 - 208951