Convolutional neural networks rarely learn shape for semantic segmentation

被引:1
|
作者
Zhang, Yixin [1 ,5 ]
Mazurowski, Maciej A. [1 ,2 ,3 ,4 ,6 ]
机构
[1] Duke Univ, Dept Elect & Comp Engn, Durham, NC USA
[2] Duke Univ, Dept Radiol, Durham, NC USA
[3] Duke Univ, Dept Biostat & Bioinformat, Durham, NC USA
[4] Duke Univ, Dept Comp Sci, Durham, NC USA
[5] 2424 Erwin Rd,Off 10072, Durham, NC 27705 USA
[6] Box 2731 Med Ctr, Durham, NC 27710 USA
基金
美国国家卫生研究院;
关键词
Segmentation; Feature measurement; Machine learning; Computer vision; INFORMATION;
D O I
10.1016/j.patcog.2023.110018
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Shape learning, or the ability to leverage shape information, could be a desirable property of convolutional neural networks (CNNs) when target objects have specific shapes. While some research on the topic is emerging, there is no systematic study to conclusively determine whether and under what circumstances CNNs learn shape. Here, we present such a study in the context of segmentation networks where shapes are particularly important. We define shape and propose a new behavioral metric to measure the extent to which a CNN utilizes shape information. We then execute a set of experiments with synthetic and real-world data to progressively uncover under which circumstances CNNs learn shape and what can be done to encourage such behavior. We conclude that (i) CNNs do not learn shape in typical settings but rather rely on other features available to identify the objects of interest, (ii) CNNs can learn shape, but only if the shape is the only feature available to identify the object, (iii) sufficiently large receptive field size relative to the size of target objects is necessary for shape learning; (iv) a limited set of augmentations can encourage shape learning; (v) learning shape is indeed useful in the presence of out-of-distribution data.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Fully Convolutional Networks for Semantic Segmentation
    Long, Jonathan
    Shelhamer, Evan
    Darrell, Trevor
    [J]. 2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 3431 - 3440
  • [32] Fully Convolutional Networks for Semantic Segmentation
    Shelhamer, Evan
    Long, Jonathan
    Darrell, Trevor
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (04) : 640 - 651
  • [33] BINARY SEGMENTATION BASED CLASS EXTENSION IN SEMANTIC IMAGE SEGMENTATION USING CONVOLUTIONAL NEURAL NETWORKS
    Wang, Chunlai
    Yu, Jiawei
    Mauch, Lukas
    Yang, Bin
    [J]. 2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 2232 - 2236
  • [34] SEMANTIC SEGMENTATION OF THE GROWTH STAGES OF PLASMODIUM PARASITES USING CONVOLUTIONAL NEURAL NETWORKS
    Aladago, Maxwell Mbailla
    Torresani, Lorenzo
    Rosca, Elena V.
    [J]. 2019 IEEE AFRICON, 2019,
  • [35] Multi-scale deep context convolutional neural networks for semantic segmentation
    Quan Zhou
    Wenbing Yang
    Guangwei Gao
    Weihua Ou
    Huimin Lu
    Jie Chen
    Longin Jan Latecki
    [J]. World Wide Web, 2019, 22 : 555 - 570
  • [36] Semantic Segmentation System of Pigmented Skin Lesions Based on Convolutional Neural Networks
    Fedorenko, Vladimir V.
    Lyakhova, Ulyana A.
    Nagornov, Nikolay N.
    Efimenko, Georgii A.
    Kaplun, Dmitrii, I
    [J]. 2022 11TH MEDITERRANEAN CONFERENCE ON EMBEDDED COMPUTING (MECO), 2022, : 589 - 593
  • [37] Leveraging convolutional neural networks for semantic segmentation of global floods with PlanetScope imagery
    Leach, Nicholas R.
    Popien, Philip
    Goodman, Maxwell C.
    Tellman, Beth
    [J]. 2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 314 - 317
  • [38] Image Semantic Segmentation Based on Convolutional Neural Networks for Monitoring Agricultural Vegetation
    Ganchenko, Valentin
    Doudkin, Alexander
    [J]. PATTERN RECOGNITION AND INFORMATION PROCESSING, PRIP 2019, 2019, 1055 : 52 - 63
  • [39] A multi-scale strategy for deep semantic segmentation with convolutional neural networks
    Zhao, Bonan
    Zhang, Xiaoshan
    Li, Zheng
    Hu, Xianliang
    [J]. NEUROCOMPUTING, 2019, 365 : 273 - 284
  • [40] GRNet: Deep Convolutional Neural Networks based on Graph Reasoning for Semantic Segmentation
    Wu, Yang
    Jiang, Aimin
    Tang, Yibin
    Kwan, Hon Keung
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2020, : 116 - 119