共 50 条
- [1] Reward Redistribution for Reinforcement Learning of Dynamic Nonprehensile Manipulation [J]. 2021 7TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND ROBOTICS (ICCAR), 2021, : 326 - 331
- [2] Leveraging Reward Consistency for Interpretable Feature Discovery in Reinforcement Learning [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (02): : 1014 - 1025
- [3] Generation of Roles in Reinforcement Learning Considering Redistribution of Reward between Agents [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 2259 - +
- [7] Reward tampering problems and solutions in reinforcement learning: a causal influence diagram perspective [J]. Synthese, 2021, 198 : 6435 - 6467
- [8] A survey on interpretable reinforcement learning [J]. MACHINE LEARNING, 2024, 113 (08) : 5847 - 5890
- [9] Interpretable Control by Reinforcement Learning [J]. IFAC PAPERSONLINE, 2020, 53 (02): : 8082 - 8089
- [10] Programmatically Interpretable Reinforcement Learning [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80