Latent Context Based Soft Actor-Critic

被引:0
|
作者
Pu, Yuan [1 ]
Wang, Shaochen [1 ]
Yao, Xin [1 ]
Li, Bin [1 ]
机构
[1] Univ Sci & Technol China, Hefei, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/ijcnn48605.2020.9207008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The performance of deep reinforcement learning methods prone to degenerate when applied to tasks requiring relatively longer horizon memory or with highly variable dynamics. In this paper, we utilize the probabilistic latent context variables motivated by recent Meta-RL materials, and propose the Latent Context based Soft Actor-Critic (LC-SAC) approach to address aforementioned issues. The latent context is capable to encode information about both the agent's previous behaviors and the dynamics of the current undergoing environment, which empirically believed to be beneficial for efficient policy optimization. Experiment results demonstrate that LC-SAC can achieve comparable performance with SAC on a collection of continuous control benchmarks and outperforms SAC in some particular tasks with above two characteristics. Moreover, we also introduce a simple but general procedure to integrate LC-SAC with diversequality demonstrations to enable efficient reuse of human prior knowledge, and finally achieve competitive performance with comparatively small number of interactions with environments.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Model-Based Soft Actor-Critic
    Chien, Jen-Tzung
    Yang, Shu-Hsiang
    [J]. 2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 2028 - 2035
  • [2] Soft Actor-Critic With Integer Actions
    Fan, Ting-Han
    Wang, Yubo
    [J]. 2022 AMERICAN CONTROL CONFERENCE, ACC, 2022, : 2611 - 2616
  • [3] Soft Actor-Critic for Navigation of Mobile Robots
    de Jesus, Junior Costa
    Kich, Victor Augusto
    Kolling, Alisson Henrique
    Grando, Ricardo Bedin
    Cuadros, Marco Antonio de Souza Leite
    Gamarra, Daniel Fernando Tello
    [J]. Journal of Intelligent and Robotic Systems: Theory and Applications, 2021, 102 (02):
  • [4] Simultaneous Control and Guidance of an AUV Based on Soft Actor-Critic
    Sola, Yoann
    Le Chenadec, Gilles
    Clement, Benoit
    [J]. SENSORS, 2022, 22 (16)
  • [5] Bayesian Strategy Networks Based Soft Actor-Critic Learning
    Yang, Qin
    Parasuraman, Ramviyas
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2024, 15 (03)
  • [6] Soft Actor-Critic for Navigation of Mobile Robots
    de Jesus, Junior Costa
    Kich, Victor Augusto
    Kolling, Alisson Henrique
    Grando, Ricardo Bedin
    Cuadros, Marco Antonio de Souza Leite
    Gamarra, Daniel Fernando Tello
    [J]. JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2021, 102 (02)
  • [7] Soft Actor-Critic for Navigation of Mobile Robots
    Junior Costa de Jesus
    Victor Augusto Kich
    Alisson Henrique Kolling
    Ricardo Bedin Grando
    Marco Antonio de Souza Leite Cuadros
    Daniel Fernando Tello Gamarra
    [J]. Journal of Intelligent & Robotic Systems, 2021, 102
  • [8] ISAACS: Iterative Soft Adversarial Actor-Critic for Safety
    Hsu, Kai-Chieh
    Nguyen, Duy P.
    Fisac, Jaime F.
    [J]. LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211, 2023, 211
  • [9] Multiagent Soft Actor-Critic for Traffic Light Timing
    Wu, Lan
    Wu, Yuanming
    Qiao, Cong
    Tian, Yafang
    [J]. JOURNAL OF TRANSPORTATION ENGINEERING PART A-SYSTEMS, 2023, 149 (02)
  • [10] Characterizing Motor Control of Mastication With Soft Actor-Critic
    Abdi, Amir H.
    Sagl, Benedikt
    Srungarapu, Venkata P.
    Stavness, Ian
    Prisman, Eitan
    Abolmaesumi, Purang
    Fels, Sidney
    [J]. FRONTIERS IN HUMAN NEUROSCIENCE, 2020, 14