共 50 条
- [1] A,Multiagent approach to Q-learning for daily stock trading [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2007, 37 (06): : 864 - 877
- [3] Q-learning with Experience Replay in a Dynamic Environment [J]. PROCEEDINGS OF 2016 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2016,
- [4] Improved Fuzzy Q-Learning with Replay Memory [J]. FUZZY INFORMATION PROCESSING 2020, 2022, 1337 : 13 - 23
- [6] Comparing Multi-Armed Bandit Algorithms and Q-learning for Multiagent Action Selection: a Case Study in Route Choice [J]. 2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
- [7] Convergence of optimistic and incremental Q-learning [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 14, VOLS 1 AND 2, 2002, 14 : 1499 - 1506
- [8] Multiagent Q-learning with Sub-Team Coordination [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,