Adaptive Pyramid Context Network for Semantic Segmentation

被引:281
|
作者
He, Junjun [1 ,2 ]
Deng, Zhongying [1 ]
Zhou, Lei [1 ]
Wang, Yali [1 ]
Qiao, Yu [1 ,3 ]
机构
[1] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen Key Lab Comp Vis & Pattern Recognit, SIAT SenseTime Joint Lab, Beijing, Peoples R China
[2] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[3] Chinese Univ Hong Kong, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/CVPR.2019.00770
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent studies witnessed that context features can significantly improve the performance of deep semantic segmentation networks. Current context based segmentation methods differ with each other in how to construct context features and perform differently in practice. This paper firstly introduces three desirable properties of context features in segmentation task. Specially, we find that Global-guided Local Affinity (GLA) can play a vital role in constructing effective context features, while this property has been largely ignored in previous works. Based on this analysis, this paper proposes Adaptive Pyramid Context Network (APCNet) for semantic segmentation. APCNet adaptively constructs multi-scale contextual representations with multiple well-designed Adaptive Context Modules (ACMs). Specifically, each ACM leverages a global image representation as a guidance to estimate the local affinity coefficients for each sub-region, and then calculates a context vector with these affinities. We empirically evaluate our APCNet on three semantic segmentation and scene parsing datasets, including PASCAL VOC 2012, Pascal-Context,and ADE20K dataset. Experimental results show that APCNet achieves state-of-the-art performance on all three benchmarks, and obtains a new record 84.2% on PASCAL VOC 2012 test set without MS COCO pre-trained and any post-processing.
引用
收藏
页码:7511 / 7520
页数:10
相关论文
共 50 条
  • [1] Pyramid Context Contrast for Semantic Segmentation
    Chen, Yuzhong
    Lin, Yangyang
    Niu, Yuzhen
    Ke, Xiao
    Huang, tengda
    [J]. IEEE ACCESS, 2019, 7 : 173679 - 173693
  • [2] Context-aware adaptive network for UDA semantic segmentation
    Yuan, Yu
    Shi, Jinlong
    Shu, Xin
    Qian, Qiang
    Song, Yunna
    Ou, Zhen
    Xu, Dan
    Zuo, Xin
    Yu, Yuecheng
    Sun, Yunhan
    [J]. MULTIMEDIA SYSTEMS, 2024, 30 (04)
  • [3] GPNet: Gated pyramid network for semantic segmentation
    Zhang, Yu
    Sun, Xin
    Dong, Junyu
    Chen, Changrui
    Lv, Qingxuan
    [J]. PATTERN RECOGNITION, 2021, 115
  • [4] Efficient Attention Pyramid Network for Semantic Segmentation
    Yang, Qirui
    Ku, Tao
    Hu, Kunyuan
    [J]. IEEE ACCESS, 2021, 9 : 18867 - 18875
  • [5] Global Attention Pyramid Network for Semantic Segmentation
    Zhang, Na
    Li, Jun
    Li, Yongrui
    Du, Yang
    [J]. PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 8728 - 8732
  • [6] Enhanced Feature Pyramid Network for Semantic Segmentation
    Ye, Mucong
    Ouyang, Jingpeng
    Chen, Ge
    Zhang, Jing
    Yu, Xiaogang
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 3209 - 3216
  • [7] Volumetric Semantic Segmentation using Pyramid Context Features
    Barron, Jonathan T.
    Arbelaez, Pablo
    Keraenen, Soile V. E.
    Biggin, Mark D.
    Knowles, David W.
    Malik, Jitendra
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 3448 - 3455
  • [8] Adaptive pyramid and semantic graph:: Knowledge driven segmentation
    Deruyver, A
    Hodé, Y
    Leammer, E
    Jolion, JM
    [J]. GRAPH-BASED REPRESENTATIONS IN PATTERN RECOGNITION, PROCEEDINGS, 2005, 3434 : 213 - 222
  • [9] Enhanced-feature pyramid network for semantic segmentation
    Quyen, Van Toan
    Lee, Jong Hyuk
    Kim, Min Young
    [J]. 2023 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION, ICAIIC, 2023, : 782 - 787
  • [10] PCANet: Pyramid convolutional attention network for semantic segmentation
    Sang, Haiwei
    Zhou, Qiuhao
    Zhao, Yong
    [J]. IMAGE AND VISION COMPUTING, 2020, 103 (103)