Dictionary Learning-Based Reinforcement Learning with Non-convex Sparsity Regularizer

被引：4

作者：

Zhao, Haoli ^{[1
,2
]}

Wang, Junkui ^{[1
,2
]}

Huang, Xingming ^{[3
]}

Li, Zhenini ^{[4
,5
]}

Xie, Shengli ^{[6
,7
]}

机构：

[1] Guangdong Univ Technol, Sch Automat, Guangzhou 510006, Peoples R China

[2] 111 Ctr Intelligent Batch Mfg Based IoT Technol G, Guangzhou 510006, Peoples R China

[3] Sun Yat Sen Univ, Sch Comp Sci & Engn, Guangzhou 510006, Peoples R China

[4] Guangdong Prov Key Lab IoT Informat Technol GDUT, Guangzhou 510006, Peoples R China

[5] Guangdong HongKong Macao Joint Lab Smart Discrete, Guangzhou 510006, Peoples R China

[6] Minist Educ, Key Lab Intelligent Detect & Internet Things Mfg, Guangzhou 510006, Peoples R China

[7] Minist Educ, Key Lab Intelligent Informat Proc & Syst Integrat, Guangzhou 510006, Peoples R China

来源：

ARTIFICIAL INTELLIGENCE, CICAI 2022, PT III | 2022年 / 13606卷

基金：

中国国家自然科学基金;

关键词：

Reinforcement learning; Dictionary learning; Non-convex sparsity regularizer;

D O I：

10.1007/978-3-031-20503-3_7

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Spare representations can help improve value prediction and control performances in Reinforcement Learning (RL), by capturing most essential features from states and ignoring unnecessary ones to avoid interference. However, existing sparse coding-based RL methods for control problems are optimized in the neural network methodology, which can not guarantee convergence. To this end, we propose a dictionary learning-based RL with the non-convex sparsity regularizer for RL control. To avoid the black-box optimization with the SGD, we employ the dictionary learning model in RL control, guaranteeing efficient convergence in control experiments. To obtain accurate representations in RL, we employ the non-convex l(p) norm (0 < p < 1) beyond the convex l(1) norm as the sparsity regularizer in dictionary learning-based RL, for capturing more essential features from states. To obtain solutions efficiently, we employ the proximal splitting method to update the multivariate optimization problem. Hence, the non-convex sparsity regularized dictionary learning-based RL is developed and validated in different benchmark RL environments. The proposed algorithm can obtain the best control performances among compared sparse coding-based RL methods with around 10% increases in reward. Moreover, the proposed method can obtain higher sparsity in representations in different environments.

引用

页码：81 / 93

页数：13

共 50 条

[1] Dictionary Learning-Structured Reinforcement Learning With Adaptive-Sparsity Regularizer
Li, Zhenni
Tang, Jianhao
Zhao, Haoli
Chen, Ci
Xie, Shengli
IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2024, 60 (02) : 1753 - 1769
[2] Group non-convex sparsity regularized partially shared dictionary learning for multi-view learning
Zhao, Haoli
Zhong, Peng
Chen, Haiqin
Li, Zhenni
Chen, Wuhui
Zheng, Zibin
KNOWLEDGE-BASED SYSTEMS, 2022, 242
[3] Multiple kernel learning with NOn-conVex group spArsity
Lu, W. (luwm@zju.edu.cn), 1616, Academic Press Inc. (25):
[4] Multiple kernel learning with NOn-conVex group spArsity
Yuan, Ying
Lu, Weiming
Wu, Fei
Zhuang, Yueting
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2014, 25 (07) : 1616 - 1624
[5] Non-Convex Metric Learning-Based Trajectory Clustering Algorithm
Lei, Xiaoyan
Wang, Hongyan
MATHEMATICS, 2025, 13 (03)
[6] A Non-convex Relaxation Approach to Sparse Dictionary Learning
Shi, Jianping
Ren, Xiang
Dai, Guang
Wang, Jingdong
Zhang, Zhihua
2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011, : 1809 - 1816
[7] Dictionary Learning with l1/2 regularizer for Sparsity Based on Proximal Operator
Li, Zhenni
Ding, Shuxue
Li, Yujie
Chen, Wuhui
2015 IEEE 7TH INTERNATIONAL CONFERENCE ON AWARENESS SCIENCE & TECHNOLOGY (ICAST), 2015, : 105 - 110
[8] A CONSENSUS-BASED DECENTRALIZED ALGORITHM FOR NON-CONVEX OPTIMIZATION WITH APPLICATION TO DICTIONARY LEARNING
Wai, Hoi-To
Chang, Tsung-Hui
Scaglione, Anna
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 3546 - 3550
[9] An Efficient Algorithm for Dictionary Learning with a Mixed-norm Regularizer for Sparsity Based on Proximal Operator
Li, Zhenni
Ding, Shuxue
Hayashi, Takafumi
Chen, Wuhui
Li, Yujie
2015 9TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2015,
[10] DIFFUSION LEARNING IN NON-CONVEX ENVIRONMENTS
Vlaski, Stefan
Sayed, Ali H.
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5262 - 5266

← 1 2 3 4 5 →