VALUE FUNCTION ESTIMATION BASED ON AN ERROR GAUSSIAN MIXTURE MODEL

被引:0
|
作者
Cui, Delong [1 ]
Peng, Zhiping [1 ]
Li, Qirui [1 ]
He, Jieguang [1 ]
Li, Kaibin [1 ]
Hung, Shangchao [2 ,3 ]
机构
[1] Guangdong Univ Petrochem Technol, Coll Comp & Elect Informat, Maoming 525000, Guangdong, Peoples R China
[2] Fuzhou Univ, Fuzhou Polytech, Fuzhou 350108, Fujian, Peoples R China
[3] Intelligent Technol Res Ctr, Fuzhou 350108, Fujian, Peoples R China
基金
中国国家自然科学基金;
关键词
Value function estimation; error Gaussian mixture model; Gaussian process regression; reinforcement learning;
D O I
暂无
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
In reinforcement, exploration and utilization of agents' action selection has always been the key problem. Agents should not only make full use of maximum action, but also explore potential optimal action. Inspired by the exploration and utilization of actions selection, a novel value function exploration algorithm based on an error Gaussian mixture model (EGMM) is proposed in this paper. First, appropriate variables are chosen from error data, and the number of Gaussian components are obtained by optimizing a Bayesian information criterion via the EGMM. Then, the EGMM is used for the fitting and calculation of error data to obtain the conditional error mean to compensate for the output, thus obtaining more accurate results. We test the performance of the designed algorithm via a virtual experimental platform in a cloud computing environment. Experiments demonstrate the proposed algorithm eliminate the influence of non-Gaussian noise on model prediction performance.
引用
收藏
页码:1687 / 1702
页数:16
相关论文
共 50 条
  • [1] Gaussian Mixture Error Estimation for Approximate Circuits
    Ghasemazar, Amin
    Lis, Mieszko
    [J]. PROCEEDINGS OF THE 2017 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2017, : 302 - 305
  • [2] A Gaussian mixture model based cost function for parameter estimation of chaotic biological systems
    Shekofteh, Yasser
    Jafari, Sajad
    Sprott, Julien Clinton
    Golpayegani, S. Mohammad Reza Hashemi
    Almasganj, Farshad
    [J]. COMMUNICATIONS IN NONLINEAR SCIENCE AND NUMERICAL SIMULATION, 2015, 20 (02) : 469 - 481
  • [3] Estimation of Decoding Error for Light Sharing Based PET Detector Module Using a Gaussian Mixture Model
    Wei, Qingyang
    Ma, Tianyu
    Wang, Shi
    Dai, Tiantian
    Jin, Yongjie
    Liu, Yaqiang
    [J]. 2013 IEEE NUCLEAR SCIENCE SYMPOSIUM AND MEDICAL IMAGING CONFERENCE (NSS/MIC), 2013,
  • [4] Cost Function Based on Gaussian Mixture Model for Parameter Estimation of a Chaotic Circuit with a Hidden Attractor
    Lao, Seng-Kin
    Shekofteh, Yasser
    Jafari, Sajad
    Sprott, Julien Clinton
    [J]. INTERNATIONAL JOURNAL OF BIFURCATION AND CHAOS, 2014, 24 (01):
  • [5] CENTROID ESTIMATION BASED ON MSER DETECTION AND GAUSSIAN MIXTURE MODEL
    Ding, Wangbin
    Gong, Dong
    Zhang, Yanning
    He, Yao
    [J]. 2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, : 774 - 779
  • [6] Missing Value Imputation Based on Gaussian Mixture Model for the Internet of Things
    Yan, Xiaobo
    Xiong, Weiqing
    Hu, Liang
    Wang, Feng
    Zhao, Kuo
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2015, 2015
  • [7] PARAMETER ESTIMATION OF GAUSSIAN MIXTURE MODEL BASED ON VARIATIONAL BAYESIAN LEARNING
    Zhao, Linchang
    Shang, Zhaowei
    Qin, Anyong
    Tang, Yuan Yan
    [J]. PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOL 1, 2018, : 99 - 104
  • [8] Global Performance Estimation Based On Gaussian Mixture Model for Wind Turbines
    Wang, Wei
    Zhang, Menghang
    Guo, Shuangquan
    Li, Hui
    Lv, Wei
    Yang, Jiarong
    Liu, Zongchang
    [J]. APPLIED MECHANICS, MATERIALS AND MANUFACTURING IV, 2014, 670-671 : 1033 - 1036
  • [9] Robust image reconstruction enhancement based on Gaussian mixture model estimation
    Zhao, Fan
    Zhao, Jian
    Han, Xizhen
    Wang, He
    Liu, Bochao
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2016, 25 (02)
  • [10] Vehicle ROI Extraction Based on Area Estimation Gaussian Mixture Model
    Huang, ZhaoNan
    Qin, HuaBiao
    Liu, Qing
    [J]. 2017 3RD IEEE INTERNATIONAL CONFERENCE ON CYBERNETICS (CYBCONF), 2017, : 201 - 207