共 13 条
- [3] Sample-Efficient Iterative Lower Bound Optimization of Deep Reactive Policies for Planning in Continuous MDPs THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 9840 - 9848
- [5] Cross-Entropy Optimization of Control Policies With Adaptive Basis Functions IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2011, 41 (01): : 196 - 209
- [7] The Cross-Entropy Method for Continuous Multi-Extremal Optimization Methodology and Computing in Applied Probability, 2006, 8 : 383 - 407