An N-step Look Ahead Algorithm Using Mixed (On and Off) Policy Reinforcement Learning

被引:0
|
作者
Kuchibhotla, Vivek [1 ]
Harshitha, P. [1 ]
Goyal, Shobhit [1 ]
机构
[1] Bangalore Institute of Technology, Bangalore, India
关键词
Compendex;
D O I
9315959
中图分类号
学科分类号
摘要
Reinforcement learning
引用
收藏
页码:677 / 681
相关论文
共 50 条
  • [1] Reinforcement learning control with n-step information for wastewater treatment systems
    Li, Xin
    Wang, Ding
    Zhao, Mingming
    Qiao, Junfei
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [2] Efficient Reinforcement Learning With the Novel N-Step Method and V-Network
    Zhang, Miaomiao
    Zhang, Shuo
    Wu, Xinying
    Shi, Zhiyi
    Deng, Xiangyang
    Wu, Edmond Q.
    Xu, Xin
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2024,
  • [3] A self-sustained EV charging framework with N-step deep reinforcement learning
    Sykiotis, Stavros
    Menos-Aikateriniadis, Christoforos
    Doulamis, Anastasios
    Doulamis, Nikolaos
    Georgilakis, Pavlos S.
    [J]. SUSTAINABLE ENERGY GRIDS & NETWORKS, 2023, 35
  • [4] Mixed experience sampling for off-policy reinforcement learning
    Yu, Jiayu
    Li, Jingyao
    Lu, Shuai
    Han, Shuai
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 251
  • [5] Reinforcement Learning based Multi-Step Look-Ahead Bayesian Optimization
    Cheon, Mujin
    Byun, Haeun
    Lee, Jay H.
    [J]. IFAC PAPERSONLINE, 2022, 55 (07): : 100 - 105
  • [6] Why Not Look One Step Ahead in Reinforcement Learning Based Knowledge Graph Reasoning?
    Wang, Hao
    Song, Dandan
    Wu, Zhijing
    Tian, YuHang
    Xu, Jing
    [J]. SSRN,
  • [7] An Improved N-Step Value Gradient Learning Adaptive Dynamic Programming Algorithm for Online Learning
    Al-Dabooni, Seaar
    Wunsch, Donald C., II
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (04) : 1155 - 1169
  • [8] Synchronous n-Step Method for Independent Q-Learning in Multi-Agent Deep Reinforcement Learning
    Gong, Xudong
    Ding, Bo
    Xu, Jie
    Wang, Huaimin
    Zhou, Xing
    Jia, Hongda
    [J]. 2019 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI 2019), 2019, : 460 - 467
  • [9] Development of Variable Look-ahead Distance Tuning Algorithm for Autonomous Tractor Using a Reinforcement Learning Approach
    Park S.-J.
    Jeon C.-W.
    Kim H.-J.
    [J]. Journal of Institute of Control, Robotics and Systems, 2022, 28 (11) : 964 - 972
  • [10] One-step look-ahead policy for active learning reliability analysis
    Pei, Pei
    Zhou, Tong
    [J]. RELIABILITY ENGINEERING & SYSTEM SAFETY, 2023, 236