Task-Agnostic Safety for Reinforcement Learning

被引:1
|
作者
Rahman, Md Asifur [1 ]
Alqahtani, Sarra [1 ]
机构
[1] Wake Forest Univ, Winston Salem, NC 27101 USA
基金
美国国家科学基金会;
关键词
Reinforcement Learning; safety; attacks; robustness;
D O I
10.1145/3605764.3623913
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning (RL) has been an attractive potential for designing autonomous systems due to its learning-by-exploration approach. However, this learning process makes RL inherently vulnerable and thus unsuitable for applications where safety is a top priority. To address this issue, researchers have either jointly optimized task and safety or imposed constraints to restrict exploration. This paper takes a different approach, by utilizing exploration as an adaptive means to learn a robust and safe behavior. To this end, we propose Task-Agnostic Safety for Reinforcement Learning (TAS-RL) framework to ensure safety in RL by learning a representation of unsafe behaviors and excluding them from the agent's policy. TAS-RL is task-agnostic and can be integrated with any RL task policy in the same environment, providing a self-protection layer for the system. To evaluate the robustness of TAS-RL, we present a novel study where TAS-RL and 7 safe RL baselines are tested in constrained Markov decision processes (CMDP) environments under white-box action space perturbations and changes in the environment dynamics. The results show that TAS-RL outperforms all baselines by achieving consistent near-zero safety constraint violations in continuous action space with 10 times more variations in the testing environment dynamics.
引用
收藏
页码:139 / 148
页数:10
相关论文
共 50 条
  • [31] TAFA: A Task-Agnostic Fingerprinting Algorithm for Neural Networks
    Pan, Xudong
    Zhang, Mi
    Lu, Yifan
    Yang, Min
    COMPUTER SECURITY - ESORICS 2021, PT I, 2021, 12972 : 542 - 562
  • [32] Task-Agnostic Adaptive Activation Scaling Network for LLMs
    Jia, Ni
    Liu, Tong
    Chen, Jiadi
    Zhang, Ying
    Han, Song
    IEEE ACCESS, 2025, 13 : 31774 - 31784
  • [33] Task-Agnostic Structured Pruning of Speech Representation Models
    Wang, Haoyu
    Wang, Siyuan
    Zhang, Wei-Qiang
    Suo, Hongbin
    Wan, Yulong
    INTERSPEECH 2023, 2023, : 231 - 235
  • [34] Task-Agnostic Evolution of Diverse Repertoires of Swarm Behaviours
    Gomes, Jorge
    Christensen, Anders Lyhne
    SWARM INTELLIGENCE (ANTS 2018), 2018, 11172 : 225 - 238
  • [35] VariGrow: Variational Architecture Growing for Task-Agnostic Continual Learning based on Bayesian Novelty
    Ardywibowo, Randy
    Huo, Zepeng
    Wang, Zhangyang
    Mortazavi, Bobak
    Huang, Shuai
    Qian, Xiaoning
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022, : 865 - 877
  • [36] DexBERT: Effective, Task-Agnostic and Fine-Grained Representation Learning of Android Bytecode
    Sun T.
    Allix K.
    Kim K.
    Zhou X.
    Kim D.
    Lo D.
    Bissyande T.F.
    Klein J.
    IEEE Transactions on Software Engineering, 2023, 49 (10) : 4691 - 4706
  • [37] LEARNING DIVERSE SUB-POLICIES VIA A TASK-AGNOSTIC REGULARIZATION ON ACTION DISTRIBUTIONS
    Huo, Liangyu
    Wang, Zulin
    Xu, Mai
    Song, Yuhang
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 3932 - 3936
  • [38] TAPE: Task-Agnostic Prior Embedding for Image Restoration
    Liu, Lin
    Xie, Lingxi
    Zhang, Xiaopeng
    Yuan, Shanxin
    Chen, Xiangyu
    Zhou, Wengang
    Li, Houqiang
    Tian, Qi
    COMPUTER VISION - ECCV 2022, PT XVIII, 2022, 13678 : 447 - 464
  • [39] Task-Agnostic Amortized Inference of Gaussian Process Hyperparameters
    Liu, Sulin
    Sun, Xingyuan
    Ramadge, Peter J.
    Adams, Ryan P.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [40] Learning to Win Lottery Tickets in BERT Transfer via Task-agnostic Mask Training
    Liu, Yuanxin
    Meng, Fandong
    Lin, Zheng
    Fu, Peng
    Cao, Yanan
    Wang, Weiping
    Zhou, Jie
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 5840 - 5857