共 50 条
- [1] LEARNING TO PLAY K-ARMED BANDIT PROBLEMS ICAART: PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 1, 2012, : 74 - 81
- [3] Contextual Q-Learning ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 : 2927 - 2928
- [4] Deep Reinforcement Learning: From Q-Learning to Deep Q-Learning NEURAL INFORMATION PROCESSING (ICONIP 2017), PT IV, 2017, 10637 : 475 - 483
- [5] Fuzzy Q-Learning for generalization of reinforcement learning FUZZ-IEEE '96 - PROCEEDINGS OF THE FIFTH IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3, 1996, : 2208 - 2214
- [6] Deep Reinforcement Learning with Double Q-Learning THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 2094 - 2100
- [7] Reinforcement learning guidance law of Q-learning Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2020, 42 (02): : 414 - 419
- [8] Choosing optimal seller based on off-line learning negotiation history and K-armed bandit problem Proceedings of 2005 International Conference on Machine Learning and Cybernetics, Vols 1-9, 2005, : 155 - 160
- [9] Multi-Agent Reinforcement Learning - An Exploration Using Q-Learning RESEARCH AND DEVELOPMENT IN INTELLIGENT SYSTEMS XXVI: INCORPORATING APPLICATIONS AND INNOVATIONS IN INTELLIGENT SYSTEMS XVII, 2010, : 293 - 298