Task-Agnostic Safety for Reinforcement Learning

被引:1
|
作者
Rahman, Md Asifur [1 ]
Alqahtani, Sarra [1 ]
机构
[1] Wake Forest Univ, Winston Salem, NC 27101 USA
基金
美国国家科学基金会;
关键词
Reinforcement Learning; safety; attacks; robustness;
D O I
10.1145/3605764.3623913
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning (RL) has been an attractive potential for designing autonomous systems due to its learning-by-exploration approach. However, this learning process makes RL inherently vulnerable and thus unsuitable for applications where safety is a top priority. To address this issue, researchers have either jointly optimized task and safety or imposed constraints to restrict exploration. This paper takes a different approach, by utilizing exploration as an adaptive means to learn a robust and safe behavior. To this end, we propose Task-Agnostic Safety for Reinforcement Learning (TAS-RL) framework to ensure safety in RL by learning a representation of unsafe behaviors and excluding them from the agent's policy. TAS-RL is task-agnostic and can be integrated with any RL task policy in the same environment, providing a self-protection layer for the system. To evaluate the robustness of TAS-RL, we present a novel study where TAS-RL and 7 safe RL baselines are tested in constrained Markov decision processes (CMDP) environments under white-box action space perturbations and changes in the environment dynamics. The results show that TAS-RL outperforms all baselines by achieving consistent near-zero safety constraint violations in continuous action space with 10 times more variations in the testing environment dynamics.
引用
收藏
页码:139 / 148
页数:10
相关论文
共 50 条
  • [41] TADA: Efficient Task-Agnostic Domain Adaptation for Transformers
    Hung, Chia-Chien
    Lange, Lukas
    Stroetgen, Jannik
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 487 - 503
  • [42] COSMIC: Mutual Information for Task-Agnostic Summarization Evaluation
    Darrin, Maxime
    Formont, Philippe
    CilEuNG, Jackie Chi Kit
    Piantanida, Pablo
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 12696 - 12717
  • [43] Towards Learning Generalizable Code Embeddings Using Task-agnostic Graph Convolutional Networks
    Ding, Zishuo
    Li, Heng
    Shang, Weiyi
    Chen, Tse-Hsun
    ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2023, 32 (02)
  • [44] CEM: Constrained Entropy Maximization for Task-Agnostic Safe Exploration
    Yang, Qisong
    Spaan, Matthijs T. J.
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 9, 2023, : 10798 - 10806
  • [45] Task-Agnostic Privacy-Preserving Representation Learning for Federated Learning against Attribute Inference Attacks
    Arevalo, Caridad Arroyo
    Noorbakhsh, Sayedeh Leila
    Dong, Yun
    Hong, Yuan
    Wang, Binghui
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 10, 2024, : 10909 - 10917
  • [46] Pivotal Role of Language Modeling in Recommender Systems: Enriching Task-specific and Task-agnostic Representation Learning
    Shin, Kyuyong
    Kwak, Hanock
    Kim, Wonjae
    Jeong, Jisu
    Jung, Seungjae
    Kim, Kyung-Min
    Ha, Jung-Woo
    Lee, Sang-Woo
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 1146 - 1161
  • [47] Task-Agnostic Continual Hippocampus Segmentation for Smooth Population Shifts
    Gonzalez, Camila
    Ranem, Amin
    Othman, Ahmed
    Mukhopadhyay, Anirban
    DOMAIN ADAPTATION AND REPRESENTATION TRANSFER (DART 2022), 2022, 13542 : 108 - 118
  • [48] FADE: Fusing the Assets of Decoder and Encoder for Task-Agnostic Upsampling
    Lu, Hao
    Liu, Wenze
    Fu, Hongtao
    Cao, Zhiguo
    COMPUTER VISION - ECCV 2022, PT XXVII, 2022, 13687 : 231 - 247
  • [49] Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following
    Ye, Seonghyeon
    Hwang, Hyeonbin
    Yang, Sohee
    Yun, Hyeongu
    Kim, Yireun
    Seo, Minjoon
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 17, 2024, : 19386 - 19394
  • [50] Learning Task-Agnostic Embedding of Multiple Black-Box Experts for Multi-Task Model Fusion
    Trong Nghia Hoang
    Chi Thanh Lam
    Low, Bryan Kian Hsiang
    Jaillet, Patrick
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119