Automated Model Hardening with Reinforcement Learning for On-Orbit Object Detectors with Convolutional Neural Networks

被引：3

作者：

Shi, Qi ^{[1
,2
]}

Li, Lu ^{[1
,2
]}

Feng, Jiaqi ^{[1
,2
]}

Chen, Wen ^{[1
,2
]}

Yu, Jinpei ^{[1
,2
]}

机构：

[1] Chinese Acad Sci, Innovat Acad Microsatellites, Shanghai 201306, Peoples R China

[2] Univ Chinese Acad Sci, Beijing 100039, Peoples R China

来源：

AEROSPACE | 2023年 / 10卷 / 01期

关键词：

on-orbit object detection; fault tolerance analysis; selective hardening; reinforcement learning;

D O I：

10.3390/aerospace10010088

中图分类号：

V [航空、航天];

学科分类号：

08 ; 0825 ;

摘要：

On-orbit object detection has received extensive attention in the field of artificial intelligence (AI) in space research. Deep-learning-based object-detection algorithms are often computationally intensive and rely on high-performance devices to run. However, those devices usually lack space-qualified versions, and they can hardly meet the reliability requirement if directly deployed on a satellite platform, due to software errors induced by the space environment. In this paper, we evaluated the impact of space-environment-induced software errors on object-detection algorithms through large-scale fault injection tests. Aside from silent data corruption (SDC), we propose an extended criterial SDC-0.1 to better quantify the effect of the transient faults on the object-detection algorithms. Considering that a bit-flip error could cause severe detection result corruption in many cases, we propose a novel automated model hardening with reinforcement learning (AMHR) framework to solve this problem. AMHR searches for error-sensitive kernels in a convolutional neural network (CNN) through trial and error with a deep deterministic policy gradient (DDPG) agent and has fine-grained modular-level redundancy to increase the fault tolerance of the CNN-based object detectors. Compared to other selective hardening methods, AMHR achieved the lowest SDC-0.1 rates for various detectors and could tremendously improve the mean average precision (mAP) of the SSD detector by 28.8 in the presence of multiple errors.

引用

页数：19

共 50 条

[41] Automatic Hyperparameter Tuning in Deep Convolutional Neural Networks Using Asynchronous Reinforcement Learning
Neary, Patrick L.
2018 IEEE INTERNATIONAL CONFERENCE ON COGNITIVE COMPUTING (ICCC), 2018, : 73 - 77
[42] Scaling All-Goals Updates in Reinforcement Learning Using Convolutional Neural Networks
Pardo, Fabio
Levdik, Vitaly
Kormushev, Petar
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 5355 - 5362
[43] Learning joint features for color and depth images with Convolutional Neural Networks for object classification
Santana, Eder
Dockendorf, Karl
Principe, Jose C.
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 1320 - 1323
[44] Learning Rotation-Invariant and Fisher Discriminative Convolutional Neural Networks for Object Detection
Cheng, Gong
Han, Junwei
Zhou, Peicheng
Xu, Dong
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (01) : 265 - 278
[45] Learning Convolutional Neural Networks for Graphs
Niepert, Mathias
Ahmed, Mohamed
Kutzkov, Konstantin
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
[46] INCREMENTAL LEARNING OF CONVOLUTIONAL NEURAL NETWORKS
Medera, Dusan
Babinec, Stefan
IJCCI 2009: PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON COMPUTATIONAL INTELLIGENCE, 2009, : 547 - +
[47] Real-Time Object Navigation With Deep Neural Networks and Hierarchical Reinforcement Learning
Staroverov, Aleksey
Yudin, Dmitry A.
Belkin, Ilya
Adeshkin, Vasily
Solomentsev, Yaroslav K.
Panov, Aleksandr I.
IEEE ACCESS, 2020, 8 : 195608 - 195621
[48] Automated tea quality identification based on deep convolutional neural networks and transfer learning
Zhang, Cheng
Wang, Jin
Lu, Guodong
Fei, Shaomei
Zheng, Tao
Huang, Bincheng
JOURNAL OF FOOD PROCESS ENGINEERING, 2023, 46 (04)
[49] Reinforcement Learning with Neural Networks: A Survey
Modi, Bhumika
Jethva, H. B.
PROCEEDINGS OF FIRST INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY FOR INTELLIGENT SYSTEMS: VOL 1, 2016, 50 : 467 - 475
[50] Enhanced Online Convolutional Neural Networks for Object Tracking
Zhang, Dengzhuo
Gao, Yun
Zhou, Hao
Li, Tianwen
TENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2017), 2018, 10696

← 1 2 3 4 5 →