共 50 条
- [41] Q-learning algorithm for optimal multilevel thresholding [J]. IC-AI'2001: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS I-III, 2001, : 335 - 340
- [43] An Efficient Hardware Implementation of Reinforcement Learning: The Q-Learning Algorithm [J]. IEEE ACCESS, 2019, 7 : 186340 - 186351
- [44] Q-learning with Logarithmic Regret [J]. 24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
- [45] Rich Information is Affordable: A Systematic Performance Analysis of Second-order Optimization Using K-FAC [J]. KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 2145 - 2153
- [46] Double Gumbel Q-Learning [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [47] Q-Learning: Theory and Applications [J]. ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, VOL 7, 2020, 2020, 7 : 279 - 301
- [48] Adaptive Bases for Q-learning [J]. 49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2010, : 4587 - 4593
- [50] Q-Learning With Kalman Filters [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 2939 - 2947