共 10 条
- [2] GQ(λ): A general gradient algorithm for temporal-difference prediction learning with eligibility traces ARTRIFICIAL GENERAL INTELLIGENCE, AGI 2010, 2010, 10 : 91 - 96
- [4] IMPROVING REINFORCEMENT LEARNING USING TEMPORAL-DIFFERENCE NETWORK EUROCON2009 EUROCON 2009: INTERNATIONAL IEEE CONFERENCE DEVOTED TO THE 150 ANNIVERSARY OF ALEXANDER S. POPOV, VOLS 1- 4, PROCEEDINGS, 2009, : 1716 - 1722
- [7] A temporal-difference learning method using gaussian state representation for continuous state space problems 1600, Japanese Society for Artificial Intelligence (29):