Dynamic fusion for ensemble of deep Q-network

被引:0
|
作者
Patrick P. K. Chan
Meng Xiao
Xinran Qin
Natasha Kees
机构
[1] South China University of Technology,School of Computer Science and Engineering
关键词
Ensemble; Deep reinforcement learning; Dynamic fusion; Deep Q-network;
D O I
暂无
中图分类号
学科分类号
摘要
Ensemble reinforcement learning, which combines the decisions of a set of base agents, is proposed to enhance the decision making process and speed up training time. Many studies indicate that an ensemble model may achieve better results than a single agent because of the complement of base agents, in which the error of an agent may be corrected by others. However, the fusion method is a fundamental issue in ensemble. Currently, existing studies mainly focus on static fusion which either assumes all agents have the same ability or ignores the ones with poor average performance. This assumption causes current static fusion methods to overlook base agents with poor overall performance, but excellent results in select scenarios, which results in the ability of some agents not being fully utilized. This study aims to propose a dynamic fusion method which utilizes each base agent according to its local competence on test states. The performance of a base agent on the validation set is measured in terms of the rewards achieved by the agent in next n steps. The similarity between a validation state and a new state is quantified by Euclidian distance in the latent space and the weights of each base agent are updated according to its performance on validation states and their similarity to a new state. The experimental studies confirm that the proposed dynamic fusion method outperforms its base agents and also the static fusion methods. This is the first dynamic fusion method proposed for deep reinforcement learning, which extends the study on dynamic fusion from classification to reinforcement learning.
引用
收藏
页码:1031 / 1040
页数:9
相关论文
共 50 条
  • [21] Social Attentive Deep Q-network for Recommendation
    Lei, Yu
    Wang, Zhitao
    Li, Wenjie
    Pei, Hongbin
    PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19), 2019, : 1189 - 1192
  • [22] Learning to schedule dynamic distributed reconfigurable workshops using expected deep Q-network
    Yang, Shengluo
    Wang, Junyi
    Xu, Zhigang
    ADVANCED ENGINEERING INFORMATICS, 2024, 59
  • [23] AGV Path Planning with Dynamic Obstacles Based on Deep Q-Network and Distributed Training
    Xie, Tingbo
    Yao, Xifan
    Jiang, Zhenhong
    Meng, Junting
    INTERNATIONAL JOURNAL OF PRECISION ENGINEERING AND MANUFACTURING-GREEN TECHNOLOGY, 2025,
  • [24] Discrete Dynamic Berth Allocation Optimization in Container Terminal Based on Deep Q-Network
    Wang, Peng
    Li, Jie
    Cao, Xiaohua
    MATHEMATICS, 2024, 12 (23)
  • [25] Deep Attention Q-Network for Personalized Treatment Recommendation
    Ma, Simin
    Lee, Junghwan
    Serban, Nicoleta
    Yang, Shihao
    2023 23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS, ICDMW 2023, 2023, : 329 - 337
  • [26] Query Join Order Optimization Method Based on Dynamic Double Deep Q-Network
    Ji, Lixia
    Zhao, Runzhe
    Dang, Yiping
    Liu, Junxiu
    Zhang, Han
    ELECTRONICS, 2023, 12 (06)
  • [27] Double Deep Q-Network Based Dynamic Framing Offloading in Vehicular Edge Computing
    Tang, Huijun
    Wu, Huaming
    Qu, Guanjin
    Li, Ruidong
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2023, 10 (03): : 1297 - 1310
  • [28] Multi-Dimensional Double Deep Dynamic Q-Network with Aligned Q-Fusion for Dual-Ring Barrier Traffic Signal Control
    Zheng, Qiming
    Xu, Hongfeng
    Chen, Jingyun
    Zhang, Kun
    APPLIED SCIENCES-BASEL, 2025, 15 (03):
  • [29] Accurate Price Prediction by Double Deep Q-Network
    Feizi-Derakhshi, Mohammad-Reza
    Lotfimanesh, Bahram
    Amani, Omid
    INTELIGENCIA ARTIFICIAL-IBEROAMERICAN JOURNAL OF ARTIFICIAL INTELLIGENCE, 2024, 27 (74): : 12 - 21
  • [30] Train Scheduling with Deep Q-Network: A Feasibility Test
    Gong, Intaek
    Oh, Sukmun
    Min, Yunhong
    APPLIED SCIENCES-BASEL, 2020, 10 (23): : 1 - 14