Efficient semantic segmentation with pyramidal fusion

被引:61
|
作者
Orsic, Marin [1 ]
Segvic, Sinisa [1 ]
机构
[1] Univ Zagreb, Fac Elect Engn & Comp, Unska 3, Zagreb 10000, Croatia
关键词
Semantic segmentation; Real-time inference; Shared resolution pyramid; Computer vision; Deep learning; NETWORKS;
D O I
10.1016/j.patcog.2020.107611
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Emergence of large datasets and resilience of convolutional models have enabled successful training of very large semantic segmentation models. However, high capacity implies high computational complexity and therefore hinders real-time operation. We therefore study compact architectures which aim at high accuracy in spite of modest capacity. We propose a novel semantic segmentation approach based on shared pyramidal representation and fusion of heterogeneous features along the upsampling path. The proposed pyramidal fusion approach is especially effective for dense inference in images with large scale variance due to strong regularization effects induced by feature sharing across the resolution pyramid. Interpretation of the decision process suggests that our approach succeeds by acting as a large ensemble of relatively simple models, as well as due to large receptive range and strong gradient flow towards early layers. Our best model achieves 76.4% mIoU on Cityscapes test and runs in real time on low-power embedded devices. (c) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Cross-form efficient attention pyramidal network for semantic image segmentation
    Maurya, Anamika
    Chand, Satish
    [J]. AI COMMUNICATIONS, 2022, 35 (03) : 225 - 242
  • [2] Accel: A Corrective Fusion Network for Efficient Semantic Segmentation on Video
    Jain, Samvit
    Wang, Xin
    Gonzalez, Joseph E.
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 8858 - 8867
  • [3] Efficient cross-information fusion decoder for semantic segmentation
    Zhang, Songyang
    Ren, Ge
    Zeng, Xiaoxi
    Zhang, Liang
    Du, Kailun
    Liu, Gege
    Lin, Hong
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 240
  • [4] Efficient Depth Fusion Transformer for Aerial Image Semantic Segmentation
    Yan, Li
    Huang, Jianming
    Xie, Hong
    Wei, Pengcheng
    Gao, Zhao
    [J]. REMOTE SENSING, 2022, 14 (05)
  • [5] Fully Convolutional Pyramidal Networks for Semantic Segmentation
    Li, Fengxiao
    Long, Zourong
    He, Peng
    Feng, Peng
    Guo, Xiaodong
    Ren, Xuezhi
    Wei, Biao
    Zhao, Mingfu
    Tang, Bin
    [J]. IEEE ACCESS, 2020, 8 : 229132 - 229140
  • [6] Pyramidal region context module for semantic segmentation
    Liang, Tingting
    Zhao, Qijie
    Wang, Zhuoying
    Shan, Kaiyu
    Zhang, Huan
    Wang, Yongtao
    Tang, Zhi
    [J]. ACM International Conference Proceeding Series, 2019,
  • [7] Importance-Aware Semantic Segmentation with Efficient Pyramidal Context Network for Navigational Assistant Systems
    Xiang, Kaite
    Wang, Kaiwei
    Yang, Kailun
    [J]. 2019 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2019, : 3412 - 3418
  • [8] Specialize and Fuse: Pyramidal Output Representation for Semantic Segmentation
    Hsiao, Chi-Wei
    Sun, Cheng
    Chen, Hwann-Tzong
    Sun, Min
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 7117 - 7126
  • [9] An Efficient Approach to Semantic Segmentation
    Gabriela Csurka
    Florent Perronnin
    [J]. International Journal of Computer Vision, 2011, 95 : 198 - 212
  • [10] Efficient Transductive Semantic Segmentation
    Alvarez, Jose M.
    Salzmann, Mathieu
    Barnes, Nick
    [J]. 2016 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2016), 2016,