共 50 条
- [41] Greedy exploration policy of Q-learning based on state balance TENCON 2005 - 2005 IEEE REGION 10 CONFERENCE, VOLS 1-5, 2006, : 2556 - +
- [42] Exploration Among and Within Plateaus in Greedy Best-First Search TWENTY-SEVENTH INTERNATIONAL CONFERENCE ON AUTOMATED PLANNING AND SCHEDULING, 2017, : 11 - 19
- [43] Dynamic mutation enhanced greedy strategy for wavefront shaping OPTICS AND LASER TECHNOLOGY, 2024, 169
- [46] Phased Exploration with Greedy Exploitation in Stochastic Combinatorial Partial Monitoring Games ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29