Improved modeling of human vision by incorporating robustness to blur in convolutional neural networks

被引:1
|
作者
Jang, Hojin [1 ,2 ,3 ]
Tong, Frank [1 ]
机构
[1] Vanderbilt Univ, Vanderbilt Vis Res Ctr, Dept Psychol, Nashville, TN 37235 USA
[2] MIT, Dept Brain & Cognit Sci, Cambridge, MA 02193 USA
[3] Korea Univ, Dept Brain & Cognit Engn, Seoul, South Korea
基金
美国国家卫生研究院;
关键词
REPRESENTATIONS; RESPONSES; DYNAMICS; STIMULI; OBJECTS; CORTEX; AREA;
D O I
10.1038/s41467-024-45679-0
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Whenever a visual scene is cast onto the retina, much of it will appear degraded due to poor resolution in the periphery; moreover, optical defocus can cause blur in central vision. However, the pervasiveness of blurry or degraded input is typically overlooked in the training of convolutional neural networks (CNNs). We hypothesized that the absence of blurry training inputs may cause CNNs to rely excessively on high spatial frequency information for object recognition, thereby causing systematic deviations from biological vision. We evaluated this hypothesis by comparing standard CNNs with CNNs trained on a combination of clear and blurry images. We show that blur-trained CNNs outperform standard CNNs at predicting neural responses to objects across a variety of viewing conditions. Moreover, blur-trained CNNs acquire increased sensitivity to shape information and greater robustness to multiple forms of visual noise, leading to improved correspondence with human perception. Our results provide multi-faceted neurocomputational evidence that blurry visual experiences may be critical for conferring robustness to biological visual systems. The phenomenon of blurry or degraded visual input in humans has been overlooked in the training of convolutional neural networks (CNNs). Here, the authors show that blur-trained CNNs outperform standard CNNs in predicting neural responses to objects and show improved correspondence with human perception.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] A Simple Convolutional Transfer Neural Networks in Vision Tasks
    Wu, Wenlei
    Lin, Zhaohang
    Ding, Xinghao
    Huang, Yue
    [J]. NEURAL INFORMATION PROCESSING (ICONIP 2017), PT IV, 2017, 10637 : 385 - 392
  • [32] A improved pooling method for convolutional neural networks
    Zhao, Lei
    Zhang, Zhonglin
    [J]. SCIENTIFIC REPORTS, 2024, 14 (01)
  • [33] A improved pooling method for convolutional neural networks
    Lei Zhao
    Zhonglin Zhang
    [J]. Scientific Reports, 14
  • [34] Analyzing the efficiency and robustness of deep convolutional neural networks for modeling natural convection in heterogeneous porous media
    Rajabi, Mohammad Mahdi
    Javaran, Mohammad Reza Hajizadeh
    Bah, Amadou-oury
    Frey, Gabriel
    Le Ber, Florence
    Lehmann, Francois
    Fahs, Marwan
    [J]. INTERNATIONAL JOURNAL OF HEAT AND MASS TRANSFER, 2022, 183
  • [35] On the Correspondence between Human Vision and Convolutional Neural Networks: A Visual Quality Assessment Perspective
    Mahmoudpour, Saeed
    Schelkens, Peter
    [J]. 2023 15TH INTERNATIONAL CONFERENCE ON QUALITY OF MULTIMEDIA EXPERIENCE, QOMEX, 2023, : 153 - 158
  • [36] Improved One-Dimensional Convolutional Neural Networks for Human Motion Recognition
    Wang, Shengzhi
    Xiao, Shuo
    Huang, Zhenzhen
    Xu, Zhiou
    Chen, Wei
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 2544 - 2547
  • [37] Incorporating word attention with convolutional neural networks for abstractive summarization
    Yuan, Chengzhe
    Bao, Zhifeng
    Sanderson, Mark
    Tang, Yong
    [J]. WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2020, 23 (01): : 267 - 287
  • [38] Incorporating word attention with convolutional neural networks for abstractive summarization
    Chengzhe Yuan
    Zhifeng Bao
    Mark Sanderson
    Yong Tang
    [J]. World Wide Web, 2020, 23 : 267 - 287
  • [39] Multiple inputs modeling of hybrid convolutional neural networks for human activity recognition
    Lai, Yi-Chun
    Kan, Yao-Chiang
    Hsu, Kai-Cheng
    Lin, Hsueh-Chun
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 92
  • [40] Occlusion-Robustness of Convolutional Neural Networks via Inverted Cutout
    Koerschens, Matthias
    Bodesheim, Paul
    Denzler, Joachim
    [J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 2829 - 2835