共 21 条
- [1] EPSILON-OPTIMAL DISCRETIZED PURSUIT LEARNING AUTOMATA 1989 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-3: CONFERENCE PROCEEDINGS, 1989, : 6 - 12
- [3] AntNet with Reward-Penalty Reinforcement Learning 2010 SECOND INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE, COMMUNICATION SYSTEMS AND NETWORKS (CICSYN), 2010, : 17 - 21
- [4] THE ASYMPTOTIC OPTIMALITY OF DISCRETIZED LINEAR REWARD INACTION LEARNING AUTOMATA IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1984, 14 (03): : 542 - 545
- [5] 2 EPSILON-OPTIMAL NONLINEAR REINFORCEMENT SCHEMES FOR STOCHASTIC AUTOMATA IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1974, SMC4 (01): : 126 - 131
- [6] EPSILON-OPTIMAL STUBBORN LEARNING-MECHANISMS IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1990, 20 (05): : 1209 - 1216
- [7] Reward-penalty reinforcement learning scheme for planning and reactive behavior 1998 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5, 1998, : 1485 - 1490
- [9] Learning Non-Unique Segmentation with Reward-Penalty Dice Loss 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,