SPNet: Superpixel Pyramid Network for Scene Parsing

被引:0
|
作者
Xu, Bingbing [1 ,2 ]
Yang, Fei [1 ,2 ]
Yang, Jinfu [1 ,2 ]
Wu, Suishuo [1 ,2 ]
Shan, Yi [1 ,2 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[2] Beijing Key Lab Computat Intelligence & Intellige, Beijing 100124, Peoples R China
关键词
scene pursing; deep coding-decoding network; pyramid pooling structure; superpixel segmention;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Scene parsing is the important part of computer vision research. And the deep coding-decoding network is widely applied to scene parsing. However, there are still some problems, such as ambiguity of object edge segmentation and uncertainty when segmenting small-size-objects in scene analysis. In this paper, we propose Superpixel Pyramid Network for Scene Parsing. First, a deep coding-decoding network is used to learn image features. Then, multi-scale spatial pyramid pooling structure is employed to enhance the performance of small-size-objects. Next, the Superpixel Segmentation is also applied to cope with the problem of ambiguity of object edge. Finally, a two-layer neural network classifier is applied to identify the fused features pixel-by-pixel. Extensive experimental results over ADE20K, PASCAL VOC 2012, and Camvid, demonstrated that the proposed method can obtain better performance counterparts than other.
引用
收藏
页码:3690 / 3695
页数:6
相关论文
共 50 条
  • [41] Horizon detection in maritime images using scene parsing network
    Jeong, C. Y.
    Yang, H. S.
    Moon, K. D.
    ELECTRONICS LETTERS, 2018, 54 (12) : 760 - 761
  • [42] Semantic combined network for zero-shot scene parsing
    Wang, Yinduo
    Zhang, Haofeng
    Wang, Shidong
    Long, Yang
    Yang, Longzhi
    IET IMAGE PROCESSING, 2020, 14 (04) : 757 - 765
  • [43] SPNet: Siamese-Prototype Network for Few-Shot Remote Sensing Image Scene Classification
    Cheng, Gong
    Cai, Liming
    Lang, Chunbo
    Yao, Xiwen
    Chen, Jinyong
    Guo, Lei
    Han, Junwei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [44] Semantic Information Supplementary Pyramid Network for Dynamic Scene Deblurring
    Liu, Yiming
    Luo, Yifei
    Huang, Wenzhuo
    Qiao, Ying
    Li, Junhui
    Xu, Dahong
    Luo, Duqiang
    IEEE ACCESS, 2020, 8 : 188587 - 188599
  • [45] Lightweight and Efficient Multimodal Prompt Injection Network for Scene Parsing of Remote Sensing Scene Images
    Li, Yangzhen
    Zhou, Wujie
    Meng, Jiajun
    Yan, Weiqing
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [46] Kernel Likelihood Estimation for Superpixel Image Parsing
    Ates, Hasan F.
    Sunetci, Sercan
    Ak, Kenan E.
    IMAGE ANALYSIS AND RECOGNITION (ICIAR 2016), 2016, 9730 : 234 - 242
  • [47] Hierarchical Parsing Net: Semantic Scene Parsing From Global Scene to Objects
    Shi, Hengcan
    Li, Hongliang
    Meng, Fanman
    Wu, Qingbo
    Xu, Linfeng
    Ngan, King Ngi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (10) : 2670 - 2682
  • [48] MM-Pyramid: Multimodal Pyramid Attentional Network for Audio-Visual Event Localization and Video Parsing
    Yu, Jiashuo
    Cheng, Ying
    Zhao, Rui-Wei
    Feng, Rui
    Zhang, Yuejie
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 6241 - 6249
  • [49] AttaNet: Attention-Augmented Network for Fast and Accurate Scene Parsing
    Song, Qi
    Mei, Kangfu
    Huang, Rui
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 2567 - 2575
  • [50] PSANet: Point-wise Spatial Attention Network for Scene Parsing
    Zhao, Hengshuang
    Zhang, Yi
    Liu, Shu
    Shi, Jianping
    Loy, Chen Change
    Lin, Dahua
    Jia, Jiaya
    COMPUTER VISION - ECCV 2018, PT IX, 2018, 11213 : 270 - 286