Probabilistic Automata-Based Method for Enhancing Performance of Deep Reinforcement Learning Systems

被引:0
|
作者
Min Yang [1 ]
Guanjun Liu [2 ,1 ]
Ziyuan Zhou [1 ]
Jiacun Wang [2 ,3 ]
机构
[1] the Department of Computer Science, Tongji University
[2] IEEE
[3] the Computer Science and Software Engineering Department, Monmouth University, West Long
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Deep reinforcement learning(DRL) has demonstrated significant potential in industrial manufacturing domains such as workshop scheduling and energy system management.However, due to the model's inherent uncertainty, rigorous validation is requisite for its application in real-world tasks. Specific tests may reveal inadequacies in the performance of pre-trained DRL models, while the “black-box” nature of DRL poses a challenge for testing model behavior. We propose a novel performance improvement framework based on probabilistic automata,which aims to proactively identify and correct critical vulnerabilities of DRL systems, so that the performance of DRL models in real tasks can be improved with minimal model modifications.First, a probabilistic automaton is constructed from the historical trajectory of the DRL system by abstracting the state to generate probabilistic decision-making units(PDMUs), and a reverse breadth-first search(BFS) method is used to identify the key PDMU-action pairs that have the greatest impact on adverse outcomes. This process relies only on the state-action sequence and final result of each trajectory. Then, under the key PDMU, we search for the new action that has the greatest impact on favorable results. Finally, the key PDMU, undesirable action and new action are encapsulated as monitors to guide the DRL system to obtain more favorable results through real-time monitoring and correction mechanisms. Evaluations in two standard reinforcement learning environments and three actual job scheduling scenarios confirmed the effectiveness of the method, providing certain guarantees for the deployment of DRL models in real-world applications.
引用
收藏
页码:2327 / 2339
页数:13
相关论文
共 50 条
  • [1] Probabilistic Automata-Based Method for Enhancing Performance of Deep Reinforcement Learning Systems
    Yang, Min
    Liu, Guanjun
    Zhou, Ziyuan
    Wang, Jiacun
    IEEE/CAA Journal of Automatica Sinica, 2024, 11 (11) : 2327 - 2339
  • [2] Limitations of learning in automata-based systems
    Oliveira, Fernando S.
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2010, 203 (03) : 684 - 691
  • [3] Learning Automata-Based Multiagent Reinforcement Learning for Optimization of Cooperative Tasks
    Zhang, Zhen
    Wang, Dongqing
    Gao, Junwei
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (10) : 4639 - 4652
  • [4] ARED: automata-based runtime estimation for distributed systems using deep learning
    Hyunjoon Cheon
    Jinseung Ryu
    Jaecheol Ryou
    Chan Yeol Park
    Yo-Sub Han
    Cluster Computing, 2023, 26 : 2629 - 2641
  • [5] ARED: automata-based runtime estimation for distributed systems using deep learning
    Cheon, Hyunjoon
    Ryu, Jinseung
    Ryou, Jaecheol
    Park, Chan Yeol
    Han, Yo-Sub
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2023, 26 (05): : 2629 - 2641
  • [6] A cellular automata-based learning method for classification
    Wongthanavasu, Sartra
    Ponkaew, Jetsada
    EXPERT SYSTEMS WITH APPLICATIONS, 2016, 49 : 99 - 111
  • [7] A New Learning Automata-Based Pruning Method to Train Deep Neural Networks
    Guo, Haonan
    Li, Shenghong
    Li, Bin
    Ma, Yinghua
    Ren, Xudie
    IEEE INTERNET OF THINGS JOURNAL, 2018, 5 (05): : 3263 - 3269
  • [8] A Probabilistic Finite State Automata-based Fault Detection Method for Traction Motor
    Peng, Tao
    Dai, Liuxiang
    Chen, Zhiwen
    Ye, ChengLei
    Peng, Xia
    2020 IEEE 29TH INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS (ISIE), 2020, : 1199 - 1204
  • [9] A learning automata-based memetic algorithm
    Mirsaleh, M. Rezapoor
    Meybodi, M. R.
    GENETIC PROGRAMMING AND EVOLVABLE MACHINES, 2015, 16 (04) : 399 - 453
  • [10] A learning automata-based memetic algorithm
    M. Rezapoor Mirsaleh
    M. R. Meybodi
    Genetic Programming and Evolvable Machines, 2015, 16 : 399 - 453