Kullback-Leibler Control in Boolean Control Networks

被引:0
|
作者
Toyoda, Mitsuru [1 ]
Wu, Yuhu [2 ,3 ]
机构
[1] Tokyo Metropolitan Univ, Dept Mech Syst Engn, Tokyo 1910065, Japan
[2] Dalian Univ Technol, Key Lab Intelligent Control & Optimizat Ind Equipm, Minist Educ, Dalian 116024, Peoples R China
[3] Dalian Univ Technol, Sch Control Sci & Engn, Dalian 116024, Peoples R China
基金
中国国家自然科学基金; 日本学术振兴会;
关键词
Boolean control networks (BCNs); convergence analysis; gene regulatory networks; Kullback-Leibler (KL) control; optimal control; semi-tensor product (STP) of matrices; STABILIZATION; ALGORITHM; DYNAMICS;
D O I
10.1109/TCYB.2023.3292819
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article addresses the Kullback-Leibler (KL) control problem in Boolean control networks. In the considered problem, an extended stage cost function depending on the control inputs is introduced; in contrast to a stage cost of the conventional KL control problems in the Markov decision process cannot take into consideration the control inputs. An associated Bellman equation and a matrix-based iteration algorithm are presented. The theoretical analysis shows that the proposed KL control results in an approximated form of conventional dynamic programming (DP). Furthermore, the convergence analysis is presented, with the weight parameter converging to zero and diverging to infinity. In practical application examples, a comparison of the conventional DP and proposed KL control is illustrated.
引用
收藏
页码:4429 / 4442
页数:14
相关论文
共 50 条
  • [1] Fundamental Performance Limitations with Kullback-Leibler Control Cost
    Sun, Yu
    Mehta, Prashant G.
    [J]. 49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2010, : 7063 - 7068
  • [2] Nonparametric Infinite Horizon Kullback-Leibler Stochastic Control
    Pan, Yunpeng
    Theodorou, Evangelos A.
    [J]. 2014 IEEE SYMPOSIUM ON ADAPTIVE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING (ADPRL), 2014, : 63 - 70
  • [3] Computation of Kullback-Leibler Divergence in Bayesian Networks
    Moral, Serafin
    Cano, Andres
    Gomez-Olmedo, Manuel
    [J]. ENTROPY, 2021, 23 (09)
  • [4] An Efficient Kullback-Leibler Optimization Algorithm for Probabilistic Control Design
    Barao, Miguel
    Lemos, Joao M.
    [J]. 2008 MEDITERRANEAN CONFERENCE ON CONTROL AUTOMATION, VOLS 1-4, 2008, : 802 - +
  • [5] The Kullback-Leibler autodependogram
    Bagnato, L.
    De Capitani, L.
    Punzo, A.
    [J]. JOURNAL OF APPLIED STATISTICS, 2016, 43 (14) : 2574 - 2594
  • [6] Online Markov Decision Processes With Kullback-Leibler Control Cost
    Guan, Peng
    Raginsky, Maxim
    Willett, Rebecca M.
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2014, 59 (06) : 1423 - 1438
  • [7] A Kullback-Leibler information control chart for linear profiles monitoring
    Chang, Yu-Ching
    Chen, Chang-Ming
    [J]. QUALITY AND RELIABILITY ENGINEERING INTERNATIONAL, 2020, 36 (07) : 2225 - 2248
  • [8] THE KULLBACK-LEIBLER DISTANCE
    KULLBACK, S
    [J]. AMERICAN STATISTICIAN, 1987, 41 (04): : 340 - 340
  • [9] Online Markov Decision Processes with Kullback-Leibler Control Cost
    Guan, Peng
    Raginsky, Maxim
    Willett, Rebecca
    [J]. 2012 AMERICAN CONTROL CONFERENCE (ACC), 2012, : 1388 - 1393
  • [10] Kullback-Leibler Boosting
    Liu, C
    Shum, HY
    [J]. 2003 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2003, : 587 - 594