共 50 条
- [3] Non-Stationary Policy Learning for Multi-Timescale Multi-Agent Reinforcement Learning 2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 2372 - 2378
- [4] A Q-learning-based Multi-timescale Resilience Enhancement Approach for Power Grids with High Renewables 2024 IEEE 2ND INTERNATIONAL CONFERENCE ON POWER SCIENCE AND TECHNOLOGY, ICPST 2024, 2024, : 1919 - 1924
- [5] Design of cognitive radar jamming based on Q-learning algorithm Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology, 2015, 35 (11): : 1194 - 1199
- [6] Q-learning intelligent jamming decision algorithm based on efficient upper confidence bound variance Harbin Gongye Daxue Xuebao/Journal of Harbin Institute of Technology, 2022, 54 (05): : 162 - 170
- [7] A Multi-Parameter Intelligent Communication Anti-Jamming Method Based on Three-Dimensional Q-Learning 2022 IEEE 2ND INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND ARTIFICIAL INTELLIGENCE (CCAI 2022), 2022, : 205 - 210
- [8] Optimal method for the generation of the attack path based on the Q-learning decision Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2021, 48 (01): : 160 - 167
- [9] Q-Learning with probability based action policy 2006 IEEE 14TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS, VOLS 1 AND 2, 2006, : 210 - +
- [10] Cooperative Q-Learning Based on Maturity of the Policy 2009 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION, VOLS 1-7, CONFERENCE PROCEEDINGS, 2009, : 1352 - 1356