Stochastic learning and optimization-A sensitivity-based approach

被引:39
|
作者
Cao, Xi-Ren [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Kowloon, Hong Kong, Peoples R China
关键词
Perturbation analysis; Markov decision processes; Reinforcement learning; Stochastic control; Performance potentials; Event-based optimization; POLICY-GRADIENT ESTIMATION/; GLOBAL OPTIMIZATION; INFINITE-HORIZON; ALGORITHMS;
D O I
10.1016/j.arcontrol.2009.03.003
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We introduce a sensitivity-based view to the area of learning and optimization of stochastic dynamic systems. We show that this sensitivity-based view provides a unified framework for many different disciplines in this area, including perturbation analysis, Markov decision processes, reinforcement learning, identification and adaptive control, and singular stochastic control; and that this unified framework applies to both the discrete event dynamic systems and continuous-time continuous-state systems. Many results in these disciplines can be simply derived and intuitively explained by using two performance sensitivity formulas. In addition, we show that this sensitivity-based view leads to new results and opens up new directions for future research. For example, the n th bias optimality of Markov processes has been established and the event-based optimization may be developed; this approach has computational and other advantages over the state-based approaches. (C) 2009 Elsevier Ltd. All rights reserved.
引用
收藏
页码:11 / 24
页数:14
相关论文
共 50 条
  • [1] Stochastic Learning and Optimization: A Sensitivity-Based Approach
    Zilinskas, Antanas
    [J]. INTERFACES, 2009, 39 (02) : 172 - 174
  • [2] Sensitivity-based optimization approach for portfolio optimization with proportional transaction costs
    [J]. Huang, Y.-H. (huangyonghao@tsinghua.org.cn), 1600, Northeast University (29):
  • [3] Bidirectional machine learning–assisted sensitivity-based stochastic searching approach for groundwater DNAPL source characterization
    Hou, Zeyu
    Lin, Yingzi
    Liu, Tongzhe
    Lu, Wenxi
    [J]. Environmental Science and Pollution Research, 2024, 31 (23) : 33591 - 33609
  • [4] Sensitivity-Based Optimization of Disk Architecture
    Sankar, Sriram
    Zhang, Yan
    Gurumurthi, Sudhanva
    Stan, Mircea R.
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2009, 58 (01) : 69 - 81
  • [5] Sensitivity-Based Inverse Reinforcement Learning
    Tao, Zhaorong
    Chen, Zhichao
    Li, Yanjie
    [J]. 2013 32ND CHINESE CONTROL CONFERENCE (CCC), 2013, : 2856 - 2861
  • [6] A SENSITIVITY-BASED APPROACH TO TOLERANCE ASSIGNMENT
    ILUMOKA, A
    SPENCE, R
    [J]. IEE PROCEEDINGS-G CIRCUITS DEVICES AND SYSTEMS, 1982, 129 (04): : 139 - 149
  • [7] Problem-independent machine learning (PIML)-based topology optimization-A universal approach
    Huang, Mengcheng
    Du, Zongliang
    Liu, Chang
    Zheng, Yonggang
    Cui, Tianchen
    Mei, Yue
    Li, Xiao
    Zhang, Xiaoyu
    Guo, Xu
    [J]. EXTREME MECHANICS LETTERS, 2022, 56
  • [8] Discrete sensitivity-based evolutionary design optimization
    Steven, GP
    Li, Q
    [J]. COMPUTATIONAL FLUID AND SOLID MECHANICS 2003, VOLS 1 AND 2, PROCEEDINGS, 2003, : 2373 - 2377
  • [9] A sensitivity-based wiresizing approach to interconnect optimization of lossy transmission line topologies
    Xue, TX
    Kuh, ES
    Yu, QJ
    [J]. 1996 IEEE MULTI-CHIP MODULE CONFERENCE, PROCEEDINGS, 1996, : 117 - 122
  • [10] Tuning LQR Controllers: A Sensitivity-Based Approach
    Masti, Daniele
    Zanon, Mario
    Bemporad, Alberto
    [J]. IEEE CONTROL SYSTEMS LETTERS, 2022, 6 : 932 - 937