Action-dependent bidirectional contrastive predictive coding for neural belief representations

被引:0
|
作者
Liu, Jianfeng [1 ]
Sun, Lifan [1 ,2 ]
Pu, Jiexin [1 ]
Yan, Yongyi [1 ]
机构
[1] Henan Univ Sci & Technol, Sch Informat Engn, Luoyang 471023, Peoples R China
[2] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, Chengdu 611731, Peoples R China
基金
中国国家自然科学基金;
关键词
Neural belief representation; POMDP; Contrastive predictive coding; Self-supervised learning; Interpretability analysis;
D O I
10.1016/j.neucom.2022.02.066
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The key to solving the complex, partially observable Markov decision process (POMDP) with high dimensional observations lies in explicit belief representations. However, the existing methods generally adopted the black-box model for shaping beliefs, which is inefficient and lacks interpretability. Due to this reason, the action-dependent bidirectional contrastive predictive coding (BCPC|Action) is proposed in this paper, in which the observation features are extracted efficiently through self-supervised contrastive learning. Owing to the bottleneck belief constraints in the bidirectional model, the upper bound of prediction errors is effectively reduced. Besides, the forward prediction is optimized by the guidance of an easier trainable backward prediction; thus, the bidirectional match regularization (BMR) could be derived for stabilizing the training process. More importantly, the interpretability of the learned belief representation is thoroughly explored based on the gradient truncation. Simulation results verify the effectiveness of the presented method; apart from achieving highly accurate belief tracking, the state uncertainties could be characterized reasonably, which provides a guarantee for solving the POMDP optimal policy for downstream tasks.(c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:284 / 298
页数:15
相关论文
共 15 条
  • [1] Action-dependent bidirectional contrastive predictive coding for neural belief representations
    Liu, Jianfeng
    Sun, Lifan
    Pu, Jiexin
    Yan, Yongyi
    [J]. Neurocomputing, 2022, 488 : 284 - 298
  • [2] Action-dependent plasticity in peripersonal space representations
    Ladavas, Elisabetta
    Serino, Andrea
    [J]. COGNITIVE NEUROPSYCHOLOGY, 2008, 25 (7-8) : 1099 - 1113
  • [3] Secure Source Coding with Action-dependent Side Information
    Kittichokechai, Kittipong
    Oechtering, Tobias J.
    Skoglund, Mikael
    [J]. 2011 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY PROCEEDINGS (ISIT), 2011, : 1678 - 1682
  • [4] Coding With Action-Dependent Side Information and Additional Reconstruction Requirements
    Kittichokechai, Kittipong
    Oechtering, Tobias J.
    Skoglund, Mikael
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2015, 61 (11) : 6355 - 6367
  • [5] Secure Source Coding With Action-Dependent Side Information
    Kittichokechai, Kittipong
    Oechtering, Tobias J.
    Skoglund, Mikael
    Chia, Yeow-Khiang
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2015, 61 (12) : 6444 - 6464
  • [6] Multiterminal Source Coding With Action-Dependent Side Information
    Chia, Yeow-Khiang
    Asnani, Himanshu
    Weissman, Tsachy
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2013, 59 (06) : 3653 - 3667
  • [7] Source Coding With Common Reconstruction and Action-dependent Side Information
    Kittichokechai, Kittipong
    Oechtering, Tobias J.
    Skoglund, Mikael
    [J]. 2010 IEEE INFORMATION THEORY WORKSHOP (ITW), 2010,
  • [8] On Secure One-Helper Source Coding With Action-Dependent Side Information
    Lu, Jian
    Xu, Yinfei
    Zhang, Ping
    Wang, Qiao
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2021, 67 (01) : 95 - 110
  • [9] Inferring action-dependent outcome representations depends on anterior but not posterior medial orbitofrontal cortex
    Bradfield, Laura A.
    Hart, Genevra
    Balleine, Bernard W.
    [J]. NEUROBIOLOGY OF LEARNING AND MEMORY, 2018, 155 : 463 - 473
  • [10] Random-Coding Exponential Error Bounds for Channels with Action-Dependent States
    Matsuta, Tetsunao
    Uyematsu, Tomohiko
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2013, E96A (12): : 2324 - 2331