Sample-Based Potentials Estimation for the Optimal Control of Stochastic System

被引:0
|
作者
Cheng Kang [1 ]
Zhang Kanjian [1 ]
Fei Shumin [1 ]
Liu Xiao-Mei [1 ]
机构
[1] Southeast Univ, Sch Automat, Nanjing 210096, Jiangsu, Peoples R China
关键词
Stochastic System; Potentials; Markov Decision Processes; Policy Iteration; Basis Function; ALGORITHMS;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
An optimization method based on perturbation analysis is applied to stochastic system. A policy iteration approach is designed by the performance sensitivity formula which is constructed with potentials. For estimating the potentials, the Poisson equation is viewed as a system of linear equation, then a least squares policy evaluation method is adopted, and the selection of basis function is also discussed for getting a better performance of approximation. The simulation shows the effectiveness of the policy iteration and the approximation approach.
引用
收藏
页码:2031 / 2035
页数:5
相关论文
共 50 条
  • [1] Sample-Based Information-Theoretic Stochastic Optimal Control
    Lioutikov, Rudolf
    Paraschos, Alexandros
    Peters, Jan
    Neumann, Gerhard
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2014, : 3896 - 3902
  • [2] Stochastic Optimal Control using Local Sample-based Value Function Approximation
    Dolgov, Maxim
    Kurz, Gerhard
    Grimm, Daniela
    Rosenthal, Florian
    Hanebeck, Uwe D.
    [J]. 2018 ANNUAL AMERICAN CONTROL CONFERENCE (ACC), 2018, : 2145 - 2150
  • [3] Optimal Sample-Based Fusion for Distributed State Estimation
    Steinbring, Jannik
    Noack, Benjamin
    Reinhardt, Marc
    Hanebeck, Uwe D.
    [J]. 2016 19TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2016, : 1600 - 1607
  • [4] Sample-Based Optimal Pricing
    Allouah, Amine
    Besbes, Omar
    [J]. ACM EC '19: PROCEEDINGS OF THE 2019 ACM CONFERENCE ON ECONOMICS AND COMPUTATION, 2019, : 391 - 391
  • [5] Sample-Based Optimal Transport and Barycenter Problems
    Kuang, Max
    Tabak, Esteban G.
    [J]. COMMUNICATIONS ON PURE AND APPLIED MATHEMATICS, 2019, 72 (08) : 1581 - 1630
  • [6] A feature selection method for a sample-based stochastic policy
    Yamanaka, Jumpei
    Nakamura, Yutaka
    Ishiguro, Hiroshi
    [J]. ARTIFICIAL LIFE AND ROBOTICS, 2014, 19 (03) : 251 - 257
  • [7] Provably Near-Optimal Approximation Schemes for Implicit Stochastic and Sample-Based Dynamic Programs
    Halman, Nir
    [J]. INFORMS JOURNAL ON COMPUTING, 2020, 32 (04) : 1157 - 1181
  • [8] An Online Sample-Based Method for Mode Estimation Using ODE Analysis of Stochastic Approximation Algorithms
    Kamanchi, Chandramouli
    Diddigi, Raghuram Bharadwaj
    Prabuchandran, K. J.
    Bhatnagar, Shalabh
    [J]. IEEE CONTROL SYSTEMS LETTERS, 2019, 3 (03): : 697 - 702
  • [9] Sample coverage estimation, rarefaction, and extrapolation based on sample-based abundance data
    Chiu, Chun-Huo
    [J]. ECOLOGY, 2023, 104 (08)
  • [10] Sample-based maximum likelihood estimation of the autologistic model
    Magnussen, S.
    Reeves, R.
    [J]. JOURNAL OF APPLIED STATISTICS, 2007, 34 (05) : 547 - 561