Task-Agnostic Safety for Reinforcement Learning

被引：1

作者：

Rahman, Md Asifur ^{[1
]}

Alqahtani, Sarra ^{[1
]}

机构：

[1] Wake Forest Univ, Winston Salem, NC 27101 USA

来源：

PROCEEDINGS OF THE 16TH ACM WORKSHOP ON ARTIFICIAL INTELLIGENCE AND SECURITY, AISEC 2023 | 2023年

基金：

美国国家科学基金会;

关键词：

Reinforcement Learning; safety; attacks; robustness;

D O I：

10.1145/3605764.3623913

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Reinforcement learning (RL) has been an attractive potential for designing autonomous systems due to its learning-by-exploration approach. However, this learning process makes RL inherently vulnerable and thus unsuitable for applications where safety is a top priority. To address this issue, researchers have either jointly optimized task and safety or imposed constraints to restrict exploration. This paper takes a different approach, by utilizing exploration as an adaptive means to learn a robust and safe behavior. To this end, we propose Task-Agnostic Safety for Reinforcement Learning (TAS-RL) framework to ensure safety in RL by learning a representation of unsafe behaviors and excluding them from the agent's policy. TAS-RL is task-agnostic and can be integrated with any RL task policy in the same environment, providing a self-protection layer for the system. To evaluate the robustness of TAS-RL, we present a novel study where TAS-RL and 7 safe RL baselines are tested in constrained Markov decision processes (CMDP) environments under white-box action space perturbations and changes in the environment dynamics. The results show that TAS-RL outperforms all baselines by achieving consistent near-zero safety constraint violations in continuous action space with 10 times more variations in the testing environment dynamics.

引用

页码：139 / 148

页数：10

共 50 条

[31] TAFA: A Task-Agnostic Fingerprinting Algorithm for Neural Networks
Pan, Xudong
Zhang, Mi
Lu, Yifan
Yang, Min
COMPUTER SECURITY - ESORICS 2021, PT I, 2021, 12972 : 542 - 562
[32] Task-Agnostic Adaptive Activation Scaling Network for LLMs
Jia, Ni
Liu, Tong
Chen, Jiadi
Zhang, Ying
Han, Song
IEEE ACCESS, 2025, 13 : 31774 - 31784
[33] Task-Agnostic Structured Pruning of Speech Representation Models
Wang, Haoyu
Wang, Siyuan
Zhang, Wei-Qiang
Suo, Hongbin
Wan, Yulong
INTERSPEECH 2023, 2023, : 231 - 235
[34] Task-Agnostic Evolution of Diverse Repertoires of Swarm Behaviours
Gomes, Jorge
Christensen, Anders Lyhne
SWARM INTELLIGENCE (ANTS 2018), 2018, 11172 : 225 - 238
[35] VariGrow: Variational Architecture Growing for Task-Agnostic Continual Learning based on Bayesian Novelty
Ardywibowo, Randy
Huo, Zepeng
Wang, Zhangyang
Mortazavi, Bobak
Huang, Shuai
Qian, Xiaoning
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022, : 865 - 877
[36] DexBERT: Effective, Task-Agnostic and Fine-Grained Representation Learning of Android Bytecode
Sun T.
Allix K.
Kim K.
Zhou X.
Kim D.
Lo D.
Bissyande T.F.
Klein J.
IEEE Transactions on Software Engineering, 2023, 49 (10) : 4691 - 4706
[37] LEARNING DIVERSE SUB-POLICIES VIA A TASK-AGNOSTIC REGULARIZATION ON ACTION DISTRIBUTIONS
Huo, Liangyu
Wang, Zulin
Xu, Mai
Song, Yuhang
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 3932 - 3936
[38] TAPE: Task-Agnostic Prior Embedding for Image Restoration
Liu, Lin
Xie, Lingxi
Zhang, Xiaopeng
Yuan, Shanxin
Chen, Xiangyu
Zhou, Wengang
Li, Houqiang
Tian, Qi
COMPUTER VISION - ECCV 2022, PT XVIII, 2022, 13678 : 447 - 464
[39] Task-Agnostic Amortized Inference of Gaussian Process Hyperparameters
Liu, Sulin
Sun, Xingyuan
Ramadge, Peter J.
Adams, Ryan P.
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[40] Learning to Win Lottery Tickets in BERT Transfer via Task-agnostic Mask Training
Liu, Yuanxin
Meng, Fandong
Lin, Zheng
Fu, Peng
Cao, Yanan
Wang, Weiping
Zhou, Jie
NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 5840 - 5857

← 1 2 3 4 5 →