An N-step Look Ahead Algorithm Using Mixed (On and Off) Policy Reinforcement Learning

被引：0

作者：

Kuchibhotla, Vivek ^{[1
]}

Harshitha, P. ^{[1
]}

Goyal, Shobhit ^{[1
]}

机构：

[1] Bangalore Institute of Technology, Bangalore, India

来源：

Proceedings of the 3rd International Conference on Intelligent Sustainable Systems, ICISS 2020 | 2020年

关键词：

Compendex;

D O I：

9315959

中图分类号：

学科分类号：

摘要：

Reinforcement learning

引用

页码：677 / 681

共 50 条

[1] Reinforcement learning control with n-step information for wastewater treatment systems
Li, Xin
Wang, Ding
Zhao, Mingming
Qiao, Junfei
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
[2] Efficient Reinforcement Learning With the Novel N-Step Method and V-Network
Zhang, Miaomiao
Zhang, Shuo
Wu, Xinying
Shi, Zhiyi
Deng, Xiangyang
Wu, Edmond Q.
Xu, Xin
[J]. IEEE TRANSACTIONS ON CYBERNETICS, 2024,
[3] A self-sustained EV charging framework with N-step deep reinforcement learning
Sykiotis, Stavros
Menos-Aikateriniadis, Christoforos
Doulamis, Anastasios
Doulamis, Nikolaos
Georgilakis, Pavlos S.
[J]. SUSTAINABLE ENERGY GRIDS & NETWORKS, 2023, 35
[4] Mixed experience sampling for off-policy reinforcement learning
Yu, Jiayu
Li, Jingyao
Lu, Shuai
Han, Shuai
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 251
[5] Reinforcement Learning based Multi-Step Look-Ahead Bayesian Optimization
Cheon, Mujin
Byun, Haeun
Lee, Jay H.
[J]. IFAC PAPERSONLINE, 2022, 55 (07): : 100 - 105
[6] Why Not Look One Step Ahead in Reinforcement Learning Based Knowledge Graph Reasoning?
Wang, Hao
Song, Dandan
Wu, Zhijing
Tian, YuHang
Xu, Jing
[J]. SSRN,
[7] An Improved N-Step Value Gradient Learning Adaptive Dynamic Programming Algorithm for Online Learning
Al-Dabooni, Seaar
Wunsch, Donald C., II
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (04) : 1155 - 1169
[8] Synchronous n-Step Method for Independent Q-Learning in Multi-Agent Deep Reinforcement Learning
Gong, Xudong
Ding, Bo
Xu, Jie
Wang, Huaimin
Zhou, Xing
Jia, Hongda
[J]. 2019 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI 2019), 2019, : 460 - 467
[9] Development of Variable Look-ahead Distance Tuning Algorithm for Autonomous Tractor Using a Reinforcement Learning Approach
Park S.-J.
Jeon C.-W.
Kim H.-J.
[J]. Journal of Institute of Control, Robotics and Systems, 2022, 28 (11) : 964 - 972
[10] One-step look-ahead policy for active learning reliability analysis
Pei, Pei
Zhou, Tong
[J]. RELIABILITY ENGINEERING & SYSTEM SAFETY, 2023, 236

← 1 2 3 4 5 →