共 50 条
- [2] Inverted pendulum control of double q-learning reinforcement learning algorithm based on neural network UPB Scientific Bulletin, Series D: Mechanical Engineering, 2020, 82 (02): : 15 - 26
- [9] An Optimized Q-Learning Algorithm Based on the Thinking of Tabu Search PROCEEDINGS OF THE 2008 INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN, VOL 1, 2008, : 533 - 536