共 50 条
- [1] Trajectory-Based Off-Policy Deep Reinforcement Learning INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
- [2] Trajectory-Based Modified Policy Iteration PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 12, 2006, 12 : 103 - +
- [5] Learning CPG-based biped locomotion with a policy gradient method 2005 5TH IEEE-RAS INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS, 2005, : 208 - 213
- [6] Learning CPG-based biped locomotion with a policy gradient method Matsubara, T. (takam-m@atr.jp), (Inst. of Elec. and Elec. Eng. Computer Society, 445 Hoes Lane - P.O.Box 1331, Piscataway, NJ 08855-1331, United States):
- [8] Trajectory-based Split Hindsight Reverse Curriculum Learning 2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 3971 - 3978
- [9] Policy gradient reinforcement learning for fast quadrupedal locomotion 2004 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1- 5, PROCEEDINGS, 2004, : 2619 - 2624