Similarity-based transfer learning of decision policies

被引:0
|
作者
Zugarova, Eliska [1 ]
Guy, Tatiana, V [1 ]
机构
[1] Czech Acad Sci, Inst Informat Theory & Automat, Dept Adapt Syst, Prague, Czech Republic
关键词
probabilistic model; transfer learning; closed-loop behavior; fully probabilistic design; Bayesian estimation; sequential decision making;
D O I
10.1109/smc42975.2020.9283093
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We consider a problem of learning decision policy from past experience available. Using the Fully Probabilistic Design (FPD) formalism, we propose a new general approach for finding a stochastic policy from the past data. The proposed approach assigns degree of similarity to all of the past closed-loop behaviors. The degree of similarity expresses how close the current decision making task is to a past task. Then it is used by Bayesian estimation to learn an approximate optimal policy, which comprises the best past experience. The approach learns decision policy directly from the data without interacting with any supervisor/expert or using any reinforcement signal. The past experience may consider a decision objective different than the current one. Moreover the past decision policy need not to be optimal with respect to the past objective. We demonstrate our approach on simulated examples and show that the learned policy achieves better performance than optimal FPD policy whenever a mismodeling is present.
引用
收藏
页码:37 / 44
页数:8
相关论文
共 50 条
  • [1] Transfer of learning in young children: Magic digital or similarity-based?
    Mombo, Wilfried T.
    Clerc, Jerome
    ANNEE PSYCHOLOGIQUE, 2022, 122 (03): : 471 - 512
  • [2] Similarity-Based Chained Transfer Learning for Energy Forecasting With Big Data
    Tian, Yifang
    Sehovac, Ljubisa
    Grolinger, Katarina
    IEEE ACCESS, 2019, 7 : 139895 - 139908
  • [3] NES-TL: Network Embedding Similarity-Based Transfer Learning
    Fu, Chenbo
    Zheng, Yongli
    Liu, Yi
    Xuan, Qi
    Chen, Guanrong
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2020, 7 (03): : 1607 - 1618
  • [4] A Similarity-Based Decision Process for Decisions' Implementation
    Averkyna, Maryna
    ADVANCES IN ARTIFICIAL INTELLIGENCE AND APPLIED COGNITIVE COMPUTING, 2021, : 571 - 581
  • [5] Similarity-based active learning methods
    Sui, Qun
    Ghosh, Sujit K.
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 251
  • [6] Similarity-based classifier combination for decision making
    Guo, GD
    Neagu, D
    INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOL 1-4, PROCEEDINGS, 2005, : 176 - 181
  • [7] A similarity-based approach to relevance learning
    Cöster, R
    Asker, L
    ECAI 2000: 14TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2000, 54 : 276 - 280
  • [8] Dynamic simulation of gas turbines via feature similarity-based transfer learning
    Dengji Zhou
    Jiarui Hao
    Dawen Huang
    Xingyun Jia
    Huisheng Zhang
    Frontiers in Energy, 2020, 14 : 817 - 835
  • [9] Similarity-Based Unsupervised Deep Transfer Learning for Remote Sensing Image Retrieval
    Liu, Yishu
    Ding, Liwang
    Chen, Conghui
    Liu, Yingbin
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2020, 58 (11): : 7872 - 7889
  • [10] Dynamic simulation of gas turbines via feature similarity-based transfer learning
    Zhou, Dengji
    Hao, Jiarui
    Huang, Dawen
    Jia, Xingyun
    Zhang, Huisheng
    FRONTIERS IN ENERGY, 2020, 14 (04) : 817 - 835