Depth in convolutional neural networks solves scene segmentation

被引:11
|
作者
Seijdel, Noor [1 ,2 ]
Tsakmakidis, Nikos [3 ]
de Haan, Edward H. F. [1 ,2 ]
Bohte, Sander M. [3 ]
Scholte, H. Steven [1 ,2 ]
机构
[1] Univ Amsterdam, Dept Psychol, Amsterdam, Netherlands
[2] Univ Amsterdam, Amsterdam Brain & Cognit ABC Ctr, Amsterdam, Netherlands
[3] Machine Learning Grp, Centrum Wiskunde Informat, Amsterdam, Netherlands
基金
欧洲研究理事会;
关键词
OBJECT RECOGNITION; NATURAL SCENES; CATEGORIZATION; CONTEXT; TOP; FEEDFORWARD; CONSISTENCY; VISION;
D O I
10.1371/journal.pcbi.1008022
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Feed-forward deep convolutional neural networks (DCNNs) are, under specific conditions, matching and even surpassing human performance in object recognition in natural scenes. This performance suggests that the analysis of a loose collection of image features could support the recognition of natural object categories, without dedicated systems to solve specific visual subtasks. Research in humans however suggests that while feedforward activity may suffice for sparse scenes with isolated objects, additional visual operations ('routines') that aid the recognition process (e.g. segmentation or grouping) are needed for more complex scenes. Linking human visual processing to performance of DCNNs with increasing depth, we here explored if, how, and when object information is differentiated from the backgrounds they appear on. To this end, we controlled the information in both objects and backgrounds, as well as the relationship between them by adding noise, manipulating background congruence and systematically occluding parts of the image. Results indicate that with an increase in network depth, there is an increase in the distinction between object-and background information. For more shallow networks, results indicated a benefit of training on segmented objects. Overall, these results indicate that, de facto, scene segmentation can be performed by a network of sufficient depth. We conclude that the human brain could perform scene segmentation in the context of object identification without an explicit mechanism, by selecting or "binding" features that belong to the object and ignoring other features, in a manner similar to a very deep convolutional neural network.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Road Scene Depth Estimation Based on Deep Convolutional Neural Networks
    Yuan Jianzhong
    Zhou Wujie
    Pan Ting
    Gu Pengli
    [J]. LASER & OPTOELECTRONICS PROGRESS, 2019, 56 (08)
  • [2] A Semantic-based Scene segmentation using convolutional neural networks
    Shaaban, Aya M.
    Salem, Nancy M.
    Al-atabany, Walid, I
    [J]. AEU-INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATIONS, 2020, 125
  • [3] Traffic Scene Semantic Segmentation by Using Several Deep Convolutional Neural Networks
    Kherraki, Amine
    Maqbool, Muaz
    El Ouazzani, Rajae
    [J]. 2021 3RD IEEE MIDDLE EAST AND NORTH AFRICA COMMUNICATIONS CONFERENCE (MENACOMM), 2021, : 1 - 6
  • [4] Improved Convolutional Neural Network for Traffic Scene Segmentation
    Xu, Fuliang
    Luo, Yong
    Sun, Chuanlong
    Zhao, Hong
    [J]. CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2024, 138 (03): : 2691 - 2708
  • [5] Recurrent Convolutional Neural Networks for Scene Labeling
    Pinheiro, Pedro O.
    Collobert, Ronan
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 32 (CYCLE 1), 2014, 32
  • [6] Aerial Scene Classification with Convolutional Neural Networks
    Jia, Sibo
    Liu, Huaping
    Sun, Fuchun
    [J]. ADVANCES IN NEURAL NETWORKS - ISNN 2015, 2015, 9377 : 258 - 265
  • [7] Scene Disparity Estimation with Convolutional Neural Networks
    Anas, Essa R.
    Guo, Li
    Onsy, Ahmed
    Matuszewski, Bogdan J.
    [J]. MULTIMODAL SENSING: TECHNOLOGIES AND APPLICATIONS, 2019, 11059
  • [8] Combining belief networks and neural networks for scene segmentation
    Feng, XJ
    Williams, CKI
    Felderhof, SN
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (04) : 467 - 483
  • [9] Do Deep Convolutional Neural Networks Perform Scene Segmentation in a Similar Way Humans Do?
    Seijdel, Noor
    Tsakmakidis, Nikos
    de Haan, Edward H. F.
    Bohte, Sander M.
    Scholte, H. Steven
    [J]. PERCEPTION, 2019, 48 : 77 - 78
  • [10] AN APPLICATION OF NEURAL NETWORKS TO NATURAL SCENE SEGMENTATION
    VICENS, M
    ALBERT, J
    ARNAU, V
    [J]. LECTURE NOTES IN COMPUTER SCIENCE, 1991, 540 : 333 - 339