Dictionary Learning-Based Reinforcement Learning with Non-convex Sparsity Regularizer

被引:4
|
作者
Zhao, Haoli [1 ,2 ]
Wang, Junkui [1 ,2 ]
Huang, Xingming [3 ]
Li, Zhenini [4 ,5 ]
Xie, Shengli [6 ,7 ]
机构
[1] Guangdong Univ Technol, Sch Automat, Guangzhou 510006, Peoples R China
[2] 111 Ctr Intelligent Batch Mfg Based IoT Technol G, Guangzhou 510006, Peoples R China
[3] Sun Yat Sen Univ, Sch Comp Sci & Engn, Guangzhou 510006, Peoples R China
[4] Guangdong Prov Key Lab IoT Informat Technol GDUT, Guangzhou 510006, Peoples R China
[5] Guangdong HongKong Macao Joint Lab Smart Discrete, Guangzhou 510006, Peoples R China
[6] Minist Educ, Key Lab Intelligent Detect & Internet Things Mfg, Guangzhou 510006, Peoples R China
[7] Minist Educ, Key Lab Intelligent Informat Proc & Syst Integrat, Guangzhou 510006, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
Reinforcement learning; Dictionary learning; Non-convex sparsity regularizer;
D O I
10.1007/978-3-031-20503-3_7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Spare representations can help improve value prediction and control performances in Reinforcement Learning (RL), by capturing most essential features from states and ignoring unnecessary ones to avoid interference. However, existing sparse coding-based RL methods for control problems are optimized in the neural network methodology, which can not guarantee convergence. To this end, we propose a dictionary learning-based RL with the non-convex sparsity regularizer for RL control. To avoid the black-box optimization with the SGD, we employ the dictionary learning model in RL control, guaranteeing efficient convergence in control experiments. To obtain accurate representations in RL, we employ the non-convex l(p) norm (0 < p < 1) beyond the convex l(1) norm as the sparsity regularizer in dictionary learning-based RL, for capturing more essential features from states. To obtain solutions efficiently, we employ the proximal splitting method to update the multivariate optimization problem. Hence, the non-convex sparsity regularized dictionary learning-based RL is developed and validated in different benchmark RL environments. The proposed algorithm can obtain the best control performances among compared sparse coding-based RL methods with around 10% increases in reward. Moreover, the proposed method can obtain higher sparsity in representations in different environments.
引用
收藏
页码:81 / 93
页数:13
相关论文
共 50 条
  • [1] Dictionary Learning-Structured Reinforcement Learning With Adaptive-Sparsity Regularizer
    Li, Zhenni
    Tang, Jianhao
    Zhao, Haoli
    Chen, Ci
    Xie, Shengli
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2024, 60 (02) : 1753 - 1769
  • [2] Group non-convex sparsity regularized partially shared dictionary learning for multi-view learning
    Zhao, Haoli
    Zhong, Peng
    Chen, Haiqin
    Li, Zhenni
    Chen, Wuhui
    Zheng, Zibin
    KNOWLEDGE-BASED SYSTEMS, 2022, 242
  • [3] Multiple kernel learning with NOn-conVex group spArsity
    Lu, W. (luwm@zju.edu.cn), 1616, Academic Press Inc. (25):
  • [4] Multiple kernel learning with NOn-conVex group spArsity
    Yuan, Ying
    Lu, Weiming
    Wu, Fei
    Zhuang, Yueting
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2014, 25 (07) : 1616 - 1624
  • [5] Non-Convex Metric Learning-Based Trajectory Clustering Algorithm
    Lei, Xiaoyan
    Wang, Hongyan
    MATHEMATICS, 2025, 13 (03)
  • [6] A Non-convex Relaxation Approach to Sparse Dictionary Learning
    Shi, Jianping
    Ren, Xiang
    Dai, Guang
    Wang, Jingdong
    Zhang, Zhihua
    2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011, : 1809 - 1816
  • [7] Dictionary Learning with l1/2 regularizer for Sparsity Based on Proximal Operator
    Li, Zhenni
    Ding, Shuxue
    Li, Yujie
    Chen, Wuhui
    2015 IEEE 7TH INTERNATIONAL CONFERENCE ON AWARENESS SCIENCE & TECHNOLOGY (ICAST), 2015, : 105 - 110
  • [8] A CONSENSUS-BASED DECENTRALIZED ALGORITHM FOR NON-CONVEX OPTIMIZATION WITH APPLICATION TO DICTIONARY LEARNING
    Wai, Hoi-To
    Chang, Tsung-Hui
    Scaglione, Anna
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 3546 - 3550
  • [9] An Efficient Algorithm for Dictionary Learning with a Mixed-norm Regularizer for Sparsity Based on Proximal Operator
    Li, Zhenni
    Ding, Shuxue
    Hayashi, Takafumi
    Chen, Wuhui
    Li, Yujie
    2015 9TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2015,
  • [10] DIFFUSION LEARNING IN NON-CONVEX ENVIRONMENTS
    Vlaski, Stefan
    Sayed, Ali H.
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5262 - 5266