共 50 条
- [5] Reinforcement learning for POMDPs based on action values and stochastic optimization [J]. EIGHTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-02)/FOURTEENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-02), PROCEEDINGS, 2002, : 199 - 204
- [6] Controller Optimization for Multirate Systems Based on Reinforcement Learning [J]. International Journal of Automation and Computing, 2020, 17 : 417 - 427
- [9] Selective maintenance optimization with stochastic break duration based on reinforcement learning [J]. EKSPLOATACJA I NIEZAWODNOSC-MAINTENANCE AND RELIABILITY, 2022, 24 (04): : 771 - 784
- [10] Variational quantum reinforcement learning via evolutionary optimization [J]. MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2022, 3 (01):