Automated Model Hardening with Reinforcement Learning for On-Orbit Object Detectors with Convolutional Neural Networks

被引:3
|
作者
Shi, Qi [1 ,2 ]
Li, Lu [1 ,2 ]
Feng, Jiaqi [1 ,2 ]
Chen, Wen [1 ,2 ]
Yu, Jinpei [1 ,2 ]
机构
[1] Chinese Acad Sci, Innovat Acad Microsatellites, Shanghai 201306, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100039, Peoples R China
关键词
on-orbit object detection; fault tolerance analysis; selective hardening; reinforcement learning;
D O I
10.3390/aerospace10010088
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
On-orbit object detection has received extensive attention in the field of artificial intelligence (AI) in space research. Deep-learning-based object-detection algorithms are often computationally intensive and rely on high-performance devices to run. However, those devices usually lack space-qualified versions, and they can hardly meet the reliability requirement if directly deployed on a satellite platform, due to software errors induced by the space environment. In this paper, we evaluated the impact of space-environment-induced software errors on object-detection algorithms through large-scale fault injection tests. Aside from silent data corruption (SDC), we propose an extended criterial SDC-0.1 to better quantify the effect of the transient faults on the object-detection algorithms. Considering that a bit-flip error could cause severe detection result corruption in many cases, we propose a novel automated model hardening with reinforcement learning (AMHR) framework to solve this problem. AMHR searches for error-sensitive kernels in a convolutional neural network (CNN) through trial and error with a deep deterministic policy gradient (DDPG) agent and has fine-grained modular-level redundancy to increase the fault tolerance of the CNN-based object detectors. Compared to other selective hardening methods, AMHR achieved the lowest SDC-0.1 rates for various detectors and could tremendously improve the mean average precision (mAP) of the SSD detector by 28.8 in the presence of multiple errors.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Reinforcement Learning via Recurrent Convolutional Neural Networks
    Shankar, Tanmay
    Dwivedy, Santosha K.
    Guha, Prithwijit
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 2592 - 2597
  • [2] Lightweight model for On-Orbit optical object detection
    Lyu X.
    Xia Y.
    Zhao J.
    Qiao P.
    National Remote Sensing Bulletin, 2024, 28 (04) : 1041 - 1051
  • [3] Multiagent Reinforcement Learning for Hyperparameter Optimization of Convolutional Neural Networks
    Iranfar, Arman
    Zapater, Marina
    Atienza, David
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2022, 41 (04) : 1034 - 1047
  • [4] A Generalist Reinforcement Learning Agent for Compressing Convolutional Neural Networks
    Gonzalez-Sahagun, Gabriel
    Conant-Pablos, Santiago Enrique
    Ortiz-Bayliss, Jose Carlos
    Cruz-Duarte, Jorge M.
    IEEE ACCESS, 2024, 12 : 51100 - 51114
  • [5] Multiple Instance Learning Convolutional Neural Networks for Object Recognition
    Sun, Miao
    Han, Tony X.
    Liu, Ming-Chang
    Khodayari-Rostamabad, Ahmad
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 3270 - 3275
  • [6] Coupled-learning convolutional neural networks for object recognition
    Xu, Chunyan
    Yang, Jian
    Gao, Junbin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (01) : 573 - 589
  • [7] Coupled-learning convolutional neural networks for object recognition
    Chunyan Xu
    Jian Yang
    Junbin Gao
    Multimedia Tools and Applications, 2019, 78 : 573 - 589
  • [8] Towards Automated Learning of Object Detectors
    Ebner, Marc
    APPLICATIONS OF EVOLUTIONARY COMPUTATION, PT I, PROCEEDINGS, 2010, 6024 : 231 - 240
  • [9] On automated source selection for transfer learning in convolutional neural networks
    Afridi, Muhammad Jamal
    Ross, Arun
    Shapiro, Erik M.
    PATTERN RECOGNITION, 2018, 73 : 65 - 75
  • [10] Learning Abstract Snippet Detectors with Temporal Embedding in Convolutional Neural Networks
    Liu, Jiajun
    Zhao, Kun
    Kusy, Brano
    Wen, Ji-rong
    Zheng, Kai
    Jurdak, Raja
    2016 32ND IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2016, : 895 - 905