Convolutional neural networks rarely learn shape for semantic segmentation

被引：1

作者：

Zhang, Yixin ^{[1
,5
]}

Mazurowski, Maciej A. ^{[1
,2
,3
,4
,6
]}

机构：

[1] Duke Univ, Dept Elect & Comp Engn, Durham, NC USA

[2] Duke Univ, Dept Radiol, Durham, NC USA

[3] Duke Univ, Dept Biostat & Bioinformat, Durham, NC USA

[4] Duke Univ, Dept Comp Sci, Durham, NC USA

[5] 2424 Erwin Rd,Off 10072, Durham, NC 27705 USA

[6] Box 2731 Med Ctr, Durham, NC 27710 USA

来源：

PATTERN RECOGNITION | 2024年 / 146卷

基金：

美国国家卫生研究院;

关键词：

Segmentation; Feature measurement; Machine learning; Computer vision; INFORMATION;

D O I：

10.1016/j.patcog.2023.110018

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Shape learning, or the ability to leverage shape information, could be a desirable property of convolutional neural networks (CNNs) when target objects have specific shapes. While some research on the topic is emerging, there is no systematic study to conclusively determine whether and under what circumstances CNNs learn shape. Here, we present such a study in the context of segmentation networks where shapes are particularly important. We define shape and propose a new behavioral metric to measure the extent to which a CNN utilizes shape information. We then execute a set of experiments with synthetic and real-world data to progressively uncover under which circumstances CNNs learn shape and what can be done to encourage such behavior. We conclude that (i) CNNs do not learn shape in typical settings but rather rely on other features available to identify the objects of interest, (ii) CNNs can learn shape, but only if the shape is the only feature available to identify the object, (iii) sufficiently large receptive field size relative to the size of target objects is necessary for shape learning; (iv) a limited set of augmentations can encourage shape learning; (v) learning shape is indeed useful in the presence of out-of-distribution data.

引用

页数：13

共 50 条

[31] Fully Convolutional Networks for Semantic Segmentation
Long, Jonathan
Shelhamer, Evan
Darrell, Trevor
[J]. 2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 3431 - 3440
[32] Fully Convolutional Networks for Semantic Segmentation
Shelhamer, Evan
Long, Jonathan
Darrell, Trevor
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (04) : 640 - 651
[33] BINARY SEGMENTATION BASED CLASS EXTENSION IN SEMANTIC IMAGE SEGMENTATION USING CONVOLUTIONAL NEURAL NETWORKS
Wang, Chunlai
Yu, Jiawei
Mauch, Lukas
Yang, Bin
[J]. 2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 2232 - 2236
[34] SEMANTIC SEGMENTATION OF THE GROWTH STAGES OF PLASMODIUM PARASITES USING CONVOLUTIONAL NEURAL NETWORKS
Aladago, Maxwell Mbailla
Torresani, Lorenzo
Rosca, Elena V.
[J]. 2019 IEEE AFRICON, 2019,
[35] Multi-scale deep context convolutional neural networks for semantic segmentation
Quan Zhou
Wenbing Yang
Guangwei Gao
Weihua Ou
Huimin Lu
Jie Chen
Longin Jan Latecki
[J]. World Wide Web, 2019, 22 : 555 - 570
[36] Semantic Segmentation System of Pigmented Skin Lesions Based on Convolutional Neural Networks
Fedorenko, Vladimir V.
Lyakhova, Ulyana A.
Nagornov, Nikolay N.
Efimenko, Georgii A.
Kaplun, Dmitrii, I
[J]. 2022 11TH MEDITERRANEAN CONFERENCE ON EMBEDDED COMPUTING (MECO), 2022, : 589 - 593
[37] Leveraging convolutional neural networks for semantic segmentation of global floods with PlanetScope imagery
Leach, Nicholas R.
Popien, Philip
Goodman, Maxwell C.
Tellman, Beth
[J]. 2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 314 - 317
[38] Image Semantic Segmentation Based on Convolutional Neural Networks for Monitoring Agricultural Vegetation
Ganchenko, Valentin
Doudkin, Alexander
[J]. PATTERN RECOGNITION AND INFORMATION PROCESSING, PRIP 2019, 2019, 1055 : 52 - 63
[39] A multi-scale strategy for deep semantic segmentation with convolutional neural networks
Zhao, Bonan
Zhang, Xiaoshan
Li, Zheng
Hu, Xianliang
[J]. NEUROCOMPUTING, 2019, 365 : 273 - 284
[40] GRNet: Deep Convolutional Neural Networks based on Graph Reasoning for Semantic Segmentation
Wu, Yang
Jiang, Aimin
Tang, Yibin
Kwan, Hon Keung
[J]. 2020 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2020, : 116 - 119

← 1 2 3 4 5 →