Depth in convolutional neural networks solves scene segmentation

被引:11
|
作者
Seijdel, Noor [1 ,2 ]
Tsakmakidis, Nikos [3 ]
de Haan, Edward H. F. [1 ,2 ]
Bohte, Sander M. [3 ]
Scholte, H. Steven [1 ,2 ]
机构
[1] Univ Amsterdam, Dept Psychol, Amsterdam, Netherlands
[2] Univ Amsterdam, Amsterdam Brain & Cognit ABC Ctr, Amsterdam, Netherlands
[3] Machine Learning Grp, Centrum Wiskunde Informat, Amsterdam, Netherlands
基金
欧洲研究理事会;
关键词
OBJECT RECOGNITION; NATURAL SCENES; CATEGORIZATION; CONTEXT; TOP; FEEDFORWARD; CONSISTENCY; VISION;
D O I
10.1371/journal.pcbi.1008022
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Feed-forward deep convolutional neural networks (DCNNs) are, under specific conditions, matching and even surpassing human performance in object recognition in natural scenes. This performance suggests that the analysis of a loose collection of image features could support the recognition of natural object categories, without dedicated systems to solve specific visual subtasks. Research in humans however suggests that while feedforward activity may suffice for sparse scenes with isolated objects, additional visual operations ('routines') that aid the recognition process (e.g. segmentation or grouping) are needed for more complex scenes. Linking human visual processing to performance of DCNNs with increasing depth, we here explored if, how, and when object information is differentiated from the backgrounds they appear on. To this end, we controlled the information in both objects and backgrounds, as well as the relationship between them by adding noise, manipulating background congruence and systematically occluding parts of the image. Results indicate that with an increase in network depth, there is an increase in the distinction between object-and background information. For more shallow networks, results indicated a benefit of training on segmented objects. Overall, these results indicate that, de facto, scene segmentation can be performed by a network of sufficient depth. We conclude that the human brain could perform scene segmentation in the context of object identification without an explicit mechanism, by selecting or "binding" features that belong to the object and ignoring other features, in a manner similar to a very deep convolutional neural network.
引用
收藏
页数:18
相关论文
共 50 条
  • [31] Joint Semantic Segmentation and Depth Estimation with Deep Convolutional Networks
    Mousavian, Arsalan
    Pirsiavash, Hamed
    Kosecka, Jana
    [J]. PROCEEDINGS OF 2016 FOURTH INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2016, : 611 - 619
  • [32] Convolutional Networks with Bracket-style Decoder for Semantic Scene Segmentation
    Hua, Cam-Hao
    Thien Huynh-The
    Lee, Sungyoung
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2018, : 2980 - 2985
  • [33] Image Semantic Segmentation Based on Depth Parallel Convolutional Networks
    Qin, Zi-yang
    [J]. 2018 INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATIONS AND MECHATRONICS ENGINEERING (CCME 2018), 2018, 332 : 239 - 243
  • [34] Sentiment Prediction in Scene Images via Convolutional Neural Networks
    Yao, Junfeng
    Yu, Yao
    Xue, Xiaoling
    [J]. 2016 31ST YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION (YAC), 2016, : 196 - 200
  • [35] Depth-based Subgraph Convolutional Neural Networks
    Xu, Chuanyu
    Wang, Dong
    Zhang, Zhihong
    Wang, Beizhan
    Zhou, Da
    Ren, Guijun
    Bai, Lu
    Cui, Lixin
    Hancock, Edwin R.
    [J]. 2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1024 - 1029
  • [36] Siamese Convolutional Neural Networks for Remote Sensing Scene Classification
    Liu, Xuning
    Zhou, Yong
    Zhao, Jiaqi
    Yao, Rui
    Liu, Bing
    Zheng, Yi
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2019, 16 (08) : 1200 - 1204
  • [37] Outdoor Scene Labeling Using Deep Convolutional Neural Networks
    Wen Jun
    Zhong Chaolliang
    Liu Shirong
    Wang Jian
    [J]. 2015 34TH CHINESE CONTROL CONFERENCE (CCC), 2015, : 3953 - 3958
  • [38] Dance Art Scene Classification Based on Convolutional Neural Networks
    Li, Le
    [J]. SCIENTIFIC PROGRAMMING, 2022, 2022
  • [39] Background Subtraction on Depth Videos with Convolutional Neural Networks
    Wang, Xueying
    Liu, Lei
    Li, Guangli
    Dong, Xiao
    Zhao, Peng
    Feng, Xiaobing
    [J]. 2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [40] Natural Scene Digit Classification Using Convolutional Neural Networks
    Wang, Ziqin
    Jiang, Peilin
    Zhang, Xuetao
    Wang, Fei
    [J]. INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2016, PT II, 2016, 9772 : 311 - 321