Probabilistic Automata-Based Method for Enhancing Performance of Deep Reinforcement Learning Systems

被引:0
|
作者
Min Yang [1 ]
Guanjun Liu [2 ,1 ]
Ziyuan Zhou [1 ]
Jiacun Wang [2 ,3 ]
机构
[1] the Department of Computer Science, Tongji University
[2] IEEE
[3] the Computer Science and Software Engineering Department, Monmouth University, West Long
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Deep reinforcement learning(DRL) has demonstrated significant potential in industrial manufacturing domains such as workshop scheduling and energy system management.However, due to the model's inherent uncertainty, rigorous validation is requisite for its application in real-world tasks. Specific tests may reveal inadequacies in the performance of pre-trained DRL models, while the “black-box” nature of DRL poses a challenge for testing model behavior. We propose a novel performance improvement framework based on probabilistic automata,which aims to proactively identify and correct critical vulnerabilities of DRL systems, so that the performance of DRL models in real tasks can be improved with minimal model modifications.First, a probabilistic automaton is constructed from the historical trajectory of the DRL system by abstracting the state to generate probabilistic decision-making units(PDMUs), and a reverse breadth-first search(BFS) method is used to identify the key PDMU-action pairs that have the greatest impact on adverse outcomes. This process relies only on the state-action sequence and final result of each trajectory. Then, under the key PDMU, we search for the new action that has the greatest impact on favorable results. Finally, the key PDMU, undesirable action and new action are encapsulated as monitors to guide the DRL system to obtain more favorable results through real-time monitoring and correction mechanisms. Evaluations in two standard reinforcement learning environments and three actual job scheduling scenarios confirmed the effectiveness of the method, providing certain guarantees for the deployment of DRL models in real-world applications.
引用
收藏
页码:2327 / 2339
页数:13
相关论文
共 50 条
  • [21] A LEARNING AUTOMATA-BASED TECHNIQUE FOR TRAINING BAYESIAN NETWORKS
    Rezvani, Nabi Allah
    Meybodi, Mohammad Reza
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER THEORY AND ENGINEERING (ICACTE 2009), VOLS 1 AND 2, 2009, : 201 - 212
  • [22] Learning Automata-Based Coverage Oriented Clustering in HWSNs
    Tyagi, Sudhanshu
    Tanwar, Sudeep
    Kumar, Neeraj
    2015 SECOND INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING AND COMMUNICATION ENGINEERING ICACCE 2015, 2015, : 78 - 83
  • [23] A Learning Automata-Based Multiobjective Hyper-Heuristic
    Li, Wenwen
    Ozcan, Ender
    John, Robert
    IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2019, 23 (01) : 59 - 73
  • [24] Learning Automata-Based QoS Framework for Cloud IaaS
    Misra, Sudip
    Krishna, P. Venkata
    Kalaiselvan, K.
    Saritha, V.
    Obaidat, Mohammad S.
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2014, 11 (01): : 15 - 24
  • [25] Cellular Learning Automata-based Graph Coloring Problem
    Eraghi, Alireza Enami
    Torkestani, Javad Akbari
    Meybodi, Mohammad Reza
    PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING (IACSIT ICMLC 2009), 2009, : 163 - 167
  • [26] Hybrid automata-based CEGAR for rectangular hybrid systems
    Pavithra Prabhakar
    Parasara Sridhar Duggirala
    Sayan Mitra
    Mahesh Viswanathan
    Formal Methods in System Design, 2015, 46 : 105 - 134
  • [27] Cellular automata-based systems with fault-tolerance
    Luděk Žaloudek
    Lukáš Sekanina
    Natural Computing, 2012, 11 : 673 - 685
  • [28] Hybrid automata-based CEGAR for rectangular hybrid systems
    Prabhakar, Pavithra
    Duggirala, Parasara Sridhar
    Mitra, Sayan
    Viswanathan, Mahesh
    FORMAL METHODS IN SYSTEM DESIGN, 2015, 46 (02) : 105 - 134
  • [29] Hybrid Automata-Based CEGAR for Rectangular Hybrid Systems
    Prabhakar, Pavithra
    Duggirala, Parasara Sridhar
    Mitra, Sayan
    Viswanathan, Mahesh
    VERIFICATION, MODEL CHECKING, AND ABSTRACT INTERPRETATION (VMCAI 2013), 2013, 7737 : 48 - 67
  • [30] Diagnosis of Active Systems by Automata-Based Reasoning Techniques
    Gianfranco Lamperti
    Marina Zanella
    Paolo Pogliano
    Applied Intelligence, 2000, 12 : 217 - 237