Automated Model Hardening with Reinforcement Learning for On-Orbit Object Detectors with Convolutional Neural Networks

被引:3
|
作者
Shi, Qi [1 ,2 ]
Li, Lu [1 ,2 ]
Feng, Jiaqi [1 ,2 ]
Chen, Wen [1 ,2 ]
Yu, Jinpei [1 ,2 ]
机构
[1] Chinese Acad Sci, Innovat Acad Microsatellites, Shanghai 201306, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100039, Peoples R China
关键词
on-orbit object detection; fault tolerance analysis; selective hardening; reinforcement learning;
D O I
10.3390/aerospace10010088
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
On-orbit object detection has received extensive attention in the field of artificial intelligence (AI) in space research. Deep-learning-based object-detection algorithms are often computationally intensive and rely on high-performance devices to run. However, those devices usually lack space-qualified versions, and they can hardly meet the reliability requirement if directly deployed on a satellite platform, due to software errors induced by the space environment. In this paper, we evaluated the impact of space-environment-induced software errors on object-detection algorithms through large-scale fault injection tests. Aside from silent data corruption (SDC), we propose an extended criterial SDC-0.1 to better quantify the effect of the transient faults on the object-detection algorithms. Considering that a bit-flip error could cause severe detection result corruption in many cases, we propose a novel automated model hardening with reinforcement learning (AMHR) framework to solve this problem. AMHR searches for error-sensitive kernels in a convolutional neural network (CNN) through trial and error with a deep deterministic policy gradient (DDPG) agent and has fine-grained modular-level redundancy to increase the fault tolerance of the CNN-based object detectors. Compared to other selective hardening methods, AMHR achieved the lowest SDC-0.1 rates for various detectors and could tremendously improve the mean average precision (mAP) of the SSD detector by 28.8 in the presence of multiple errors.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] Automatic Hyperparameter Tuning in Deep Convolutional Neural Networks Using Asynchronous Reinforcement Learning
    Neary, Patrick L.
    2018 IEEE INTERNATIONAL CONFERENCE ON COGNITIVE COMPUTING (ICCC), 2018, : 73 - 77
  • [42] Scaling All-Goals Updates in Reinforcement Learning Using Convolutional Neural Networks
    Pardo, Fabio
    Levdik, Vitaly
    Kormushev, Petar
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 5355 - 5362
  • [43] Learning joint features for color and depth images with Convolutional Neural Networks for object classification
    Santana, Eder
    Dockendorf, Karl
    Principe, Jose C.
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 1320 - 1323
  • [44] Learning Rotation-Invariant and Fisher Discriminative Convolutional Neural Networks for Object Detection
    Cheng, Gong
    Han, Junwei
    Zhou, Peicheng
    Xu, Dong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (01) : 265 - 278
  • [45] Learning Convolutional Neural Networks for Graphs
    Niepert, Mathias
    Ahmed, Mohamed
    Kutzkov, Konstantin
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
  • [46] INCREMENTAL LEARNING OF CONVOLUTIONAL NEURAL NETWORKS
    Medera, Dusan
    Babinec, Stefan
    IJCCI 2009: PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON COMPUTATIONAL INTELLIGENCE, 2009, : 547 - +
  • [47] Real-Time Object Navigation With Deep Neural Networks and Hierarchical Reinforcement Learning
    Staroverov, Aleksey
    Yudin, Dmitry A.
    Belkin, Ilya
    Adeshkin, Vasily
    Solomentsev, Yaroslav K.
    Panov, Aleksandr I.
    IEEE ACCESS, 2020, 8 : 195608 - 195621
  • [48] Automated tea quality identification based on deep convolutional neural networks and transfer learning
    Zhang, Cheng
    Wang, Jin
    Lu, Guodong
    Fei, Shaomei
    Zheng, Tao
    Huang, Bincheng
    JOURNAL OF FOOD PROCESS ENGINEERING, 2023, 46 (04)
  • [49] Reinforcement Learning with Neural Networks: A Survey
    Modi, Bhumika
    Jethva, H. B.
    PROCEEDINGS OF FIRST INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY FOR INTELLIGENT SYSTEMS: VOL 1, 2016, 50 : 467 - 475
  • [50] Enhanced Online Convolutional Neural Networks for Object Tracking
    Zhang, Dengzhuo
    Gao, Yun
    Zhou, Hao
    Li, Tianwen
    TENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2017), 2018, 10696