共 50 条
- [1] Towards Robust Off-Policy Learning for Runtime Uncertainty THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 10101 - 10109
- [2] Trajectory-Aware Eligibility Traces for Off-Policy Reinforcement Learning INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
- [3] VALUE-AWARE IMPORTANCE WEIGHTING FOR OFF-POLICY REINFORCEMENT LEARNING CONFERENCE ON LIFELONG LEARNING AGENTS, VOL 232, 2023, 232 : 745 - 763
- [4] Boosted Off-Policy Learning INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 206, 2023, 206
- [5] Uncertainty-Aware Policy Sampling and Mixing for Safe Interactive Imitation Learning 2021 18TH CONFERENCE ON ROBOTS AND VISION (CRV 2021), 2021, : 72 - 78
- [6] GaussianMask: Uncertainty-aware Instance Segmentation based on Gaussian Modeling 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 3851 - 3857
- [8] Learning with Options that Terminate Off-Policy THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 3173 - 3182
- [9] An off-policy least square algorithms with eligibility trace based on importance reweighting Cluster Computing, 2017, 20 : 3475 - 3487
- [10] An off-policy least square algorithms with eligibility trace based on importance reweighting CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2017, 20 (04): : 3475 - 3487