Automated Model Hardening with Reinforcement Learning for On-Orbit Object Detectors with Convolutional Neural Networks

被引：3

作者：

Shi, Qi ^{[1
,2
]}

Li, Lu ^{[1
,2
]}

Feng, Jiaqi ^{[1
,2
]}

Chen, Wen ^{[1
,2
]}

Yu, Jinpei ^{[1
,2
]}

机构：

[1] Chinese Acad Sci, Innovat Acad Microsatellites, Shanghai 201306, Peoples R China

[2] Univ Chinese Acad Sci, Beijing 100039, Peoples R China

来源：

AEROSPACE | 2023年 / 10卷 / 01期

关键词：

on-orbit object detection; fault tolerance analysis; selective hardening; reinforcement learning;

D O I：

10.3390/aerospace10010088

中图分类号：

V [航空、航天];

学科分类号：

08 ; 0825 ;

摘要：

On-orbit object detection has received extensive attention in the field of artificial intelligence (AI) in space research. Deep-learning-based object-detection algorithms are often computationally intensive and rely on high-performance devices to run. However, those devices usually lack space-qualified versions, and they can hardly meet the reliability requirement if directly deployed on a satellite platform, due to software errors induced by the space environment. In this paper, we evaluated the impact of space-environment-induced software errors on object-detection algorithms through large-scale fault injection tests. Aside from silent data corruption (SDC), we propose an extended criterial SDC-0.1 to better quantify the effect of the transient faults on the object-detection algorithms. Considering that a bit-flip error could cause severe detection result corruption in many cases, we propose a novel automated model hardening with reinforcement learning (AMHR) framework to solve this problem. AMHR searches for error-sensitive kernels in a convolutional neural network (CNN) through trial and error with a deep deterministic policy gradient (DDPG) agent and has fine-grained modular-level redundancy to increase the fault tolerance of the CNN-based object detectors. Compared to other selective hardening methods, AMHR achieved the lowest SDC-0.1 rates for various detectors and could tremendously improve the mean average precision (mAP) of the SSD detector by 28.8 in the presence of multiple errors.

引用

页数：19

共 50 条

[1] Reinforcement Learning via Recurrent Convolutional Neural Networks
Shankar, Tanmay
Dwivedy, Santosha K.
Guha, Prithwijit
2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 2592 - 2597
[2] Lightweight model for On-Orbit optical object detection
Lyu X.
Xia Y.
Zhao J.
Qiao P.
National Remote Sensing Bulletin, 2024, 28 (04) : 1041 - 1051
[3] Multiagent Reinforcement Learning for Hyperparameter Optimization of Convolutional Neural Networks
Iranfar, Arman
Zapater, Marina
Atienza, David
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2022, 41 (04) : 1034 - 1047
[4] A Generalist Reinforcement Learning Agent for Compressing Convolutional Neural Networks
Gonzalez-Sahagun, Gabriel
Conant-Pablos, Santiago Enrique
Ortiz-Bayliss, Jose Carlos
Cruz-Duarte, Jorge M.
IEEE ACCESS, 2024, 12 : 51100 - 51114
[5] Multiple Instance Learning Convolutional Neural Networks for Object Recognition
Sun, Miao
Han, Tony X.
Liu, Ming-Chang
Khodayari-Rostamabad, Ahmad
2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 3270 - 3275
[6] Coupled-learning convolutional neural networks for object recognition
Xu, Chunyan
Yang, Jian
Gao, Junbin
MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (01) : 573 - 589
[7] Coupled-learning convolutional neural networks for object recognition
Chunyan Xu
Jian Yang
Junbin Gao
Multimedia Tools and Applications, 2019, 78 : 573 - 589
[8] Towards Automated Learning of Object Detectors
Ebner, Marc
APPLICATIONS OF EVOLUTIONARY COMPUTATION, PT I, PROCEEDINGS, 2010, 6024 : 231 - 240
[9] On automated source selection for transfer learning in convolutional neural networks
Afridi, Muhammad Jamal
Ross, Arun
Shapiro, Erik M.
PATTERN RECOGNITION, 2018, 73 : 65 - 75
[10] Learning Abstract Snippet Detectors with Temporal Embedding in Convolutional Neural Networks
Liu, Jiajun
Zhao, Kun
Kusy, Brano
Wen, Ji-rong
Zheng, Kai
Jurdak, Raja
2016 32ND IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2016, : 895 - 905

← 1 2 3 4 5 →