Satellite Component Semantic Segmentation: Video Dataset and Real-Time Pyramid Attention and Decoupled Attention Network

被引:3
|
作者
Shao, Yadong [1 ,2 ]
Wu, Aodi [2 ]
Li, Shengyang [3 ,4 ]
Shu, Leizheng [3 ,4 ]
Wan, Xue [3 ,4 ]
Shao, Yuanbin
Huo, Junyan [5 ]
机构
[1] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing 100049, Peoples R China
[2] Univ Chinese Acad Sci, Sch Aeronaut & Astronaut, Beijing 100049, Peoples R China
[3] Chinese Acad Sci, Technol & Engn Ctr Space Utilizat, Beijing 100094, Peoples R China
[4] Chinese Acad Sci, Key Lab Space Utilizat, Beijing 100094, Peoples R China
[5] Univ Warwick, Dept Comp Sci, Coventry CV4 8UW, England
关键词
Satellites; Semantic segmentation; Task analysis; Real-time systems; Streaming media; Solid modeling; Semantics;
D O I
10.1109/TAES.2023.3282608
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
High-accuracy and real-time satellite component semantic segmentation can locate the key satellite components, such as solar panels, to be operated in on-orbit services, which is of great significance for navigation and control. However, to accomplish the above aim, two main challenges remain unsolved. First, satellite component semantic segmentation algorithms require a large number of images for training; however, on-orbit satellite images are difficult to obtain, especially for a large-scale satellite component video dataset. In addition, high-accuracy semantic segmentation networks require relatively more computation resources, which are difficult to be fulfilled in on-orbit tasks. How to build a satellite component semantic segmentation network that meets the requirements of both high-accuracy and real-time on-orbit operation is the key aim to be accomplished in this article. In this article, a simulated satellite component dataset consisting of 98 video sequences of 13 satellites, with a complex background, various on-orbit illumination, and common satellite motion, is proposed, and it has 32 402 frames in total. To meet the requirements of both high-accuracy and real-time on-orbit operation, this article proposes an attention-based real-time network, Pyramid Attention and Decoupled Attention Network (PADAN), which contains an image-based version, PADAN-S, and a video-based version, PADAN-T. The PADAN-S, which mainly adopts pyramid attention calculation on three-layer pyramid features and then performs decoupled attention calculation by considering both row and column attention, is based on AttaNet. The PADAN-T uses a part of the PADAN-S to obtain temporal pyramid features from temporal frames, then performs decoupled attention calculations between the features of the output frame and the features at each layer in the temporal pyramid. The experimental results show that the PADAN-S and PADAN-T have superior performance compared to other real-time state-of-the-art algorithms in accuracy in both image-based and video-based satellite component semantic segmentation tasks on simulation datasets, and our dataset has a degree of simulating the real on-orbit environment. The PADAN-S can achieve a speed of 10.25 frames per second with an image solution of 1280 pixels x 720 pixels on the edge computing device Jetson Xavier, and the PADAN-T can obtain a speed of 7.18 frames per second.
引用
收藏
页码:7315 / 7333
页数:19
相关论文
共 50 条
  • [1] Joint pyramid attention network for real-time semantic segmentation of urban scenes
    Hu, Xuegang
    Jing, Liyuan
    Sehar, Uroosa
    [J]. APPLIED INTELLIGENCE, 2022, 52 (01) : 580 - 594
  • [2] Joint pyramid attention network for real-time semantic segmentation of urban scenes
    Xuegang Hu
    Liyuan Jing
    Uroosa Sehar
    [J]. Applied Intelligence, 2022, 52 : 580 - 594
  • [3] Contextual Attention Refinement Network for Real-Time Semantic Segmentation
    Hao, Shijie
    Zhou, Yuan
    Zhang, Youming
    Guo, Yanrong
    [J]. IEEE ACCESS, 2020, 8 : 55230 - 55240
  • [4] A lightweight network with attention decoder for real-time semantic segmentation
    Wang, Kang
    Yang, Jinfu
    Yuan, Shuai
    Li, Mingai
    [J]. VISUAL COMPUTER, 2022, 38 (07): : 2329 - 2339
  • [5] A lightweight network with attention decoder for real-time semantic segmentation
    Kang Wang
    Jinfu Yang
    Shuai Yuan
    Mingai Li
    [J]. The Visual Computer, 2022, 38 : 2329 - 2339
  • [6] Real-Time Semantic Segmentation With Fast Attention
    Hu, Ping
    Perazzi, Federico
    Heilbron, Fabian Caba
    Wang, Oliver
    Lin, Zhe
    Saenko, Kate
    Sclaroff, Stan
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (01) : 263 - 270
  • [7] Global Attention Pyramid Network for Semantic Segmentation
    Zhang, Na
    Li, Jun
    Li, Yongrui
    Du, Yang
    [J]. PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 8728 - 8732
  • [8] Efficient Attention Pyramid Network for Semantic Segmentation
    Yang, Qirui
    Ku, Tao
    Hu, Kunyuan
    [J]. IEEE ACCESS, 2021, 9 : 18867 - 18875
  • [9] Attention based lightweight asymmetric network for real-time semantic segmentation
    Liu, Qian
    Wang, Cunbao
    Li, Zhensheng
    Qi, Youwei
    Fang, Jiongtao
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 130
  • [10] LSNet: Real-time attention semantic segmentation network with linear complexity
    Sheng, Pengpeng
    Shi, Yanli
    Liu, Xin
    Jin, Huan
    [J]. NEUROCOMPUTING, 2022, 509 : 94 - 101