共 50 条
- [41] Misleading Inference Generation via Proximal Policy Optimization ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2022, PT I, 2022, 13280 : 497 - 509
- [42] DNA: Proximal Policy Optimization with a Dual Network Architecture ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [43] Trust Region-Guided Proximal Policy Optimization ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
- [44] Supply Chain Capacity Competition and Optimization Based on Online Transaction Advantage of Backwardness 2016 2ND INTERNATIONAL CONFERENCE ON MODERN EDUCATION AND SOCIAL SCIENCE (MESS 2016), 2016, : 976 - 984
- [45] Neural Proximal/Trust Region Policy Optimization Attains Globally Optimal Policy ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
- [46] Reactive Power Optimization Based on Proximal Policy Optimization of Deep Reinforcement Learning Dianwang Jishu/Power System Technology, 2023, 47 (02): : 562 - 570
- [47] Proximal policy optimization-based join order optimization with spark SQL Lee, Kyu-Chul (kclee@cnu.ac.kr), 1600, Institute of Electronics Engineers of Korea (10): : 227 - 232
- [48] GREEN SIMULATION BASED POLICY OPTIMIZATION WITH PARTIAL HISTORICAL TRAJECTORY REUSE 2022 WINTER SIMULATION CONFERENCE (WSC), 2022, : 168 - 179
- [49] Soft policy optimization using dual-track advantage estimator 20TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2020), 2020, : 1064 - 1069