Task-Agnostic Safety for Reinforcement Learning

被引：1

作者：

Rahman, Md Asifur ^{[1
]}

Alqahtani, Sarra ^{[1
]}

机构：

[1] Wake Forest Univ, Winston Salem, NC 27101 USA

来源：

PROCEEDINGS OF THE 16TH ACM WORKSHOP ON ARTIFICIAL INTELLIGENCE AND SECURITY, AISEC 2023 | 2023年

基金：

美国国家科学基金会;

关键词：

Reinforcement Learning; safety; attacks; robustness;

D O I：

10.1145/3605764.3623913

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Reinforcement learning (RL) has been an attractive potential for designing autonomous systems due to its learning-by-exploration approach. However, this learning process makes RL inherently vulnerable and thus unsuitable for applications where safety is a top priority. To address this issue, researchers have either jointly optimized task and safety or imposed constraints to restrict exploration. This paper takes a different approach, by utilizing exploration as an adaptive means to learn a robust and safe behavior. To this end, we propose Task-Agnostic Safety for Reinforcement Learning (TAS-RL) framework to ensure safety in RL by learning a representation of unsafe behaviors and excluding them from the agent's policy. TAS-RL is task-agnostic and can be integrated with any RL task policy in the same environment, providing a self-protection layer for the system. To evaluate the robustness of TAS-RL, we present a novel study where TAS-RL and 7 safe RL baselines are tested in constrained Markov decision processes (CMDP) environments under white-box action space perturbations and changes in the environment dynamics. The results show that TAS-RL outperforms all baselines by achieving consistent near-zero safety constraint violations in continuous action space with 10 times more variations in the testing environment dynamics.

引用

页码：139 / 148

页数：10

共 50 条

[41] TADA: Efficient Task-Agnostic Domain Adaptation for Transformers
Hung, Chia-Chien
Lange, Lukas
Stroetgen, Jannik
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 487 - 503
[42] COSMIC: Mutual Information for Task-Agnostic Summarization Evaluation
Darrin, Maxime
Formont, Philippe
CilEuNG, Jackie Chi Kit
Piantanida, Pablo
PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 12696 - 12717
[43] Towards Learning Generalizable Code Embeddings Using Task-agnostic Graph Convolutional Networks
Ding, Zishuo
Li, Heng
Shang, Weiyi
Chen, Tse-Hsun
ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2023, 32 (02)
[44] CEM: Constrained Entropy Maximization for Task-Agnostic Safe Exploration
Yang, Qisong
Spaan, Matthijs T. J.
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 9, 2023, : 10798 - 10806
[45] Task-Agnostic Privacy-Preserving Representation Learning for Federated Learning against Attribute Inference Attacks
Arevalo, Caridad Arroyo
Noorbakhsh, Sayedeh Leila
Dong, Yun
Hong, Yuan
Wang, Binghui
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 10, 2024, : 10909 - 10917
[46] Pivotal Role of Language Modeling in Recommender Systems: Enriching Task-specific and Task-agnostic Representation Learning
Shin, Kyuyong
Kwak, Hanock
Kim, Wonjae
Jeong, Jisu
Jung, Seungjae
Kim, Kyung-Min
Ha, Jung-Woo
Lee, Sang-Woo
PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 1146 - 1161
[47] Task-Agnostic Continual Hippocampus Segmentation for Smooth Population Shifts
Gonzalez, Camila
Ranem, Amin
Othman, Ahmed
Mukhopadhyay, Anirban
DOMAIN ADAPTATION AND REPRESENTATION TRANSFER (DART 2022), 2022, 13542 : 108 - 118
[48] FADE: Fusing the Assets of Decoder and Encoder for Task-Agnostic Upsampling
Lu, Hao
Liu, Wenze
Fu, Hongtao
Cao, Zhiguo
COMPUTER VISION - ECCV 2022, PT XXVII, 2022, 13687 : 231 - 247
[49] Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following
Ye, Seonghyeon
Hwang, Hyeonbin
Yang, Sohee
Yun, Hyeongu
Kim, Yireun
Seo, Minjoon
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 17, 2024, : 19386 - 19394
[50] Learning Task-Agnostic Embedding of Multiple Black-Box Experts for Multi-Task Model Fusion
Trong Nghia Hoang
Chi Thanh Lam
Low, Bryan Kian Hsiang
Jaillet, Patrick
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119

← 1 2 3 4 5 →