Kullback-Leibler Control in Boolean Control Networks

被引：0

作者：

Toyoda, Mitsuru ^{[1
]}

Wu, Yuhu ^{[2
,3
]}

机构：

[1] Tokyo Metropolitan Univ, Dept Mech Syst Engn, Tokyo 1910065, Japan

[2] Dalian Univ Technol, Key Lab Intelligent Control & Optimizat Ind Equipm, Minist Educ, Dalian 116024, Peoples R China

[3] Dalian Univ Technol, Sch Control Sci & Engn, Dalian 116024, Peoples R China

来源：

IEEE TRANSACTIONS ON CYBERNETICS | 2024年 / 54卷 / 08期

基金：

中国国家自然科学基金; 日本学术振兴会;

关键词：

Boolean control networks (BCNs); convergence analysis; gene regulatory networks; Kullback-Leibler (KL) control; optimal control; semi-tensor product (STP) of matrices; STABILIZATION; ALGORITHM; DYNAMICS;

D O I：

10.1109/TCYB.2023.3292819

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This article addresses the Kullback-Leibler (KL) control problem in Boolean control networks. In the considered problem, an extended stage cost function depending on the control inputs is introduced; in contrast to a stage cost of the conventional KL control problems in the Markov decision process cannot take into consideration the control inputs. An associated Bellman equation and a matrix-based iteration algorithm are presented. The theoretical analysis shows that the proposed KL control results in an approximated form of conventional dynamic programming (DP). Furthermore, the convergence analysis is presented, with the weight parameter converging to zero and diverging to infinity. In practical application examples, a comparison of the conventional DP and proposed KL control is illustrated.

引用

页码：4429 / 4442

页数：14

共 50 条

[1] Fundamental Performance Limitations with Kullback-Leibler Control Cost
Sun, Yu
Mehta, Prashant G.
[J]. 49TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2010, : 7063 - 7068
[2] Nonparametric Infinite Horizon Kullback-Leibler Stochastic Control
Pan, Yunpeng
Theodorou, Evangelos A.
[J]. 2014 IEEE SYMPOSIUM ON ADAPTIVE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING (ADPRL), 2014, : 63 - 70
[3] Computation of Kullback-Leibler Divergence in Bayesian Networks
Moral, Serafin
Cano, Andres
Gomez-Olmedo, Manuel
[J]. ENTROPY, 2021, 23 (09)
[4] An Efficient Kullback-Leibler Optimization Algorithm for Probabilistic Control Design
Barao, Miguel
Lemos, Joao M.
[J]. 2008 MEDITERRANEAN CONFERENCE ON CONTROL AUTOMATION, VOLS 1-4, 2008, : 802 - +
[5] The Kullback-Leibler autodependogram
Bagnato, L.
De Capitani, L.
Punzo, A.
[J]. JOURNAL OF APPLIED STATISTICS, 2016, 43 (14) : 2574 - 2594
[6] Online Markov Decision Processes With Kullback-Leibler Control Cost
Guan, Peng
Raginsky, Maxim
Willett, Rebecca M.
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2014, 59 (06) : 1423 - 1438
[7] A Kullback-Leibler information control chart for linear profiles monitoring
Chang, Yu-Ching
Chen, Chang-Ming
[J]. QUALITY AND RELIABILITY ENGINEERING INTERNATIONAL, 2020, 36 (07) : 2225 - 2248
[8] THE KULLBACK-LEIBLER DISTANCE
KULLBACK, S
[J]. AMERICAN STATISTICIAN, 1987, 41 (04): : 340 - 340
[9] Online Markov Decision Processes with Kullback-Leibler Control Cost
Guan, Peng
Raginsky, Maxim
Willett, Rebecca
[J]. 2012 AMERICAN CONTROL CONFERENCE (ACC), 2012, : 1388 - 1393
[10] Kullback-Leibler Boosting
Liu, C
Shum, HY
[J]. 2003 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2003, : 587 - 594

← 1 2 3 4 5 →