Panoptic SwiftNet: Pyramidal Fusion for Real-Time Panoptic Segmentation

被引:3
|
作者
Saric, Josip [1 ]
Orsic, Marin [2 ]
Segvic, Sinisa [1 ]
机构
[1] Univ Zagreb, Fac Elect Engn & Comp, Zagreb 10000, Croatia
[2] Microblink, Zagreb 10000, Croatia
关键词
panoptic segmentation; real-time processing; satellite imagery; deep learning; computer vision; SCENE;
D O I
10.3390/rs15081968
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Dense panoptic prediction is a key ingredient in many existing applications such as autonomous driving, automated warehouses, or remote sensing. Many of these applications require fast inference over large input resolutions on affordable or even embedded hardware. We proposed to achieve this goal by trading off backbone capacity for multi-scale feature extraction. In comparison with contemporaneous approaches to panoptic segmentation, the main novelties of our method are efficient scale-equivariant feature extraction, cross-scale upsampling through pyramidal fusion and boundary-aware learning of pixel-to-instance assignment. The proposed method is very well suited for remote sensing imagery due to the huge number of pixels in typical city-wide and region-wide datasets. We present panoptic experiments on Cityscapes, Vistas, COCO, and the BSB-Aerial dataset. Our models outperformed the state-of-the-art on the BSB-Aerial dataset while being able to process more than a hundred 1MPx images per second on an RTX3090 GPU with FP16 precision and TensorRT optimization.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Real-Time Panoptic Segmentation with Prototype Masks for Automated Driving
    Petrovai, Andra
    Nedevschi, Sergiu
    [J]. 2020 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2020, : 1400 - 1406
  • [2] Center Focusing Network for Real-Time LiDAR Panoptic Segmentation
    Li, Xiaoyan
    Zhang, Gang
    Wang, Boyue
    Hu, Yongli
    Yin, Baocai
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 13425 - 13434
  • [3] LiDAR-Based Real-Time Panoptic Segmentation via Spatiotemporal Sequential Data Fusion
    Wang, Weiqi
    You, Xiong
    Yang, Jian
    Su, Mingzhan
    Zhang, Lantian
    Yang, Zhenkai
    Kuang, Yingcai
    [J]. REMOTE SENSING, 2022, 14 (08)
  • [4] C-YOSO: Contrastive Query on Real-Time Panoptic Segmentation
    Plabplathong, Chananvich
    Rojviboonchai, Kultida
    Vateekul, Peerapon
    [J]. IEEE Access, 2024, 12 : 177355 - 177367
  • [5] You Only Segment Once: Towards Real-Time Panoptic Segmentation
    Hu, Jie
    Huang, Linyan
    Ren, Tianhe
    Zhang, Shengchuan
    Ji, Rongrong
    Cao, Liujuan
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 17819 - 17829
  • [6] Panoptic-PHNet: Towards Real-Time and High-Precision LiDAR Panoptic Segmentation via Clustering Pseudo Heatmap
    Li, Jinke
    He, Xiao
    Wen, Yang
    Gao, Yuan
    Cheng, Xiaoqiang
    Zhang, Dan
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11799 - 11808
  • [7] Real-time panoptic segmentation with relationship between adjacent pixels and boundary prediction
    Zhang, Xiaoliang
    Li, Hongliang
    Wang, Lanxiao
    Cheng, Haoyang
    Qiu, Heqian
    Hu, Wenzhe
    Meng, Fanman
    Wu, Qingbo
    [J]. NEUROCOMPUTING, 2022, 506 : 290 - 299
  • [8] Real-time panoptic segmentation with relationship between adjacent pixels and boundary prediction
    Zhang, Xiaoliang
    Li, Hongliang
    Wang, Lanxiao
    Cheng, Haoyang
    Qiu, Heqian
    Hu, Wenzhe
    Meng, Fanman
    Wu, Qingbo
    [J]. Neurocomputing, 2022, 506 : 290 - 299
  • [9] Panoptic Segmentation
    Kirillov, Alexander
    He, Kaiming
    Girshick, Ross
    Rother, Carsten
    Dollar, Piotr
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9396 - 9405
  • [10] SwiftNet: Real-time Video Object Segmentation
    Wang, Haochen
    Jiang, Xiaolong
    Ren, Haibing
    Hu, Yao
    Bai, Song
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 1296 - 1305