Probabilistic Automata-Based Method for Enhancing Performance of Deep Reinforcement Learning Systems

被引:0
|
作者
Min Yang [1 ]
Guanjun Liu [2 ,1 ]
Ziyuan Zhou [1 ]
Jiacun Wang [2 ,3 ]
机构
[1] the Department of Computer Science, Tongji University
[2] IEEE
[3] the Computer Science and Software Engineering Department, Monmouth University, West Long
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Deep reinforcement learning(DRL) has demonstrated significant potential in industrial manufacturing domains such as workshop scheduling and energy system management.However, due to the model's inherent uncertainty, rigorous validation is requisite for its application in real-world tasks. Specific tests may reveal inadequacies in the performance of pre-trained DRL models, while the “black-box” nature of DRL poses a challenge for testing model behavior. We propose a novel performance improvement framework based on probabilistic automata,which aims to proactively identify and correct critical vulnerabilities of DRL systems, so that the performance of DRL models in real tasks can be improved with minimal model modifications.First, a probabilistic automaton is constructed from the historical trajectory of the DRL system by abstracting the state to generate probabilistic decision-making units(PDMUs), and a reverse breadth-first search(BFS) method is used to identify the key PDMU-action pairs that have the greatest impact on adverse outcomes. This process relies only on the state-action sequence and final result of each trajectory. Then, under the key PDMU, we search for the new action that has the greatest impact on favorable results. Finally, the key PDMU, undesirable action and new action are encapsulated as monitors to guide the DRL system to obtain more favorable results through real-time monitoring and correction mechanisms. Evaluations in two standard reinforcement learning environments and three actual job scheduling scenarios confirmed the effectiveness of the method, providing certain guarantees for the deployment of DRL models in real-world applications.
引用
收藏
页码:2327 / 2339
页数:13
相关论文
共 50 条
  • [31] Automata-Based Analysis of Stage Suspended Boom Systems
    He, Anping
    Wu, Jinzhao
    Yang, Shihan
    Zhou, Yongquan
    Wang, Juan
    JOURNAL OF APPLIED MATHEMATICS, 2013,
  • [32] Cellular automata-based systems with fault-tolerance
    Zaloudek, Ludek
    Sekanina, Lukas
    NATURAL COMPUTING, 2012, 11 (04) : 673 - 685
  • [33] Diagnosis of active systems by automata-based reasoning techniques
    Lamperti, G
    Zanella, M
    Pogliano, P
    APPLIED INTELLIGENCE, 2000, 12 (03) : 217 - 237
  • [34] Distributed Learning Automata-based S-learning scheme for classification
    Morten Goodwin
    Anis Yazidi
    Tore Møller Jonassen
    Pattern Analysis and Applications, 2020, 23 : 1235 - 1250
  • [35] Enhancing Deep Reinforcement Learning with Scenario-Based Modeling
    Yerushalmi R.
    Amir G.
    Elyasaf A.
    Harel D.
    Katz G.
    Marron A.
    SN Computer Science, 4 (2)
  • [36] Distributed Learning Automata-based S-learning scheme for classification
    Goodwin, Morten
    Yazidi, Anis
    Jonassen, Tore Moller
    PATTERN ANALYSIS AND APPLICATIONS, 2020, 23 (03) : 1235 - 1250
  • [37] Enhancing WiFi Multiple Access Performance with Federated Deep Reinforcement Learning
    Zhang, Lyutianyang
    Yin, Hao
    Zhou, Zhanke
    Roy, Sumit
    Sun, Yaping
    2020 IEEE 92ND VEHICULAR TECHNOLOGY CONFERENCE (VTC2020-FALL), 2020,
  • [38] A note on learning automata-based schemes for adaptation of BP parameters
    Meybodi, MR
    Beigy, H
    NEUROCOMPUTING, 2002, 48 : 957 - 974
  • [39] Learning automata-based algorithms for MapReduce data skewness handling
    Mohammad Amin Irandoost
    Amir Masoud Rahmani
    Saeed Setayeshi
    The Journal of Supercomputing, 2019, 75 : 6488 - 6516
  • [40] A new dynamic cellular learning automata-based skin detector
    Ahmad Ali Abin
    Mehran Fotouhi
    Shohreh Kasaei
    Multimedia Systems, 2009, 15 : 309 - 323