Improved modeling of human vision by incorporating robustness to blur in convolutional neural networks

被引：1

作者：

Jang, Hojin ^{[1
,2
,3
]}

Tong, Frank ^{[1
]}

机构：

[1] Vanderbilt Univ, Vanderbilt Vis Res Ctr, Dept Psychol, Nashville, TN 37235 USA

[2] MIT, Dept Brain & Cognit Sci, Cambridge, MA 02193 USA

[3] Korea Univ, Dept Brain & Cognit Engn, Seoul, South Korea

来源：

NATURE COMMUNICATIONS | 2024年 / 15卷 / 01期

基金：

美国国家卫生研究院;

关键词：

REPRESENTATIONS; RESPONSES; DYNAMICS; STIMULI; OBJECTS; CORTEX; AREA;

D O I：

10.1038/s41467-024-45679-0

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Whenever a visual scene is cast onto the retina, much of it will appear degraded due to poor resolution in the periphery; moreover, optical defocus can cause blur in central vision. However, the pervasiveness of blurry or degraded input is typically overlooked in the training of convolutional neural networks (CNNs). We hypothesized that the absence of blurry training inputs may cause CNNs to rely excessively on high spatial frequency information for object recognition, thereby causing systematic deviations from biological vision. We evaluated this hypothesis by comparing standard CNNs with CNNs trained on a combination of clear and blurry images. We show that blur-trained CNNs outperform standard CNNs at predicting neural responses to objects across a variety of viewing conditions. Moreover, blur-trained CNNs acquire increased sensitivity to shape information and greater robustness to multiple forms of visual noise, leading to improved correspondence with human perception. Our results provide multi-faceted neurocomputational evidence that blurry visual experiences may be critical for conferring robustness to biological visual systems. The phenomenon of blurry or degraded visual input in humans has been overlooked in the training of convolutional neural networks (CNNs). Here, the authors show that blur-trained CNNs outperform standard CNNs in predicting neural responses to objects and show improved correspondence with human perception.

引用

页数：14

共 50 条

[31] A Simple Convolutional Transfer Neural Networks in Vision Tasks
Wu, Wenlei
Lin, Zhaohang
Ding, Xinghao
Huang, Yue
[J]. NEURAL INFORMATION PROCESSING (ICONIP 2017), PT IV, 2017, 10637 : 385 - 392
[32] A improved pooling method for convolutional neural networks
Zhao, Lei
Zhang, Zhonglin
[J]. SCIENTIFIC REPORTS, 2024, 14 (01)
[33] A improved pooling method for convolutional neural networks
Lei Zhao
Zhonglin Zhang
[J]. Scientific Reports, 14
[34] Analyzing the efficiency and robustness of deep convolutional neural networks for modeling natural convection in heterogeneous porous media
Rajabi, Mohammad Mahdi
Javaran, Mohammad Reza Hajizadeh
Bah, Amadou-oury
Frey, Gabriel
Le Ber, Florence
Lehmann, Francois
Fahs, Marwan
[J]. INTERNATIONAL JOURNAL OF HEAT AND MASS TRANSFER, 2022, 183
[35] On the Correspondence between Human Vision and Convolutional Neural Networks: A Visual Quality Assessment Perspective
Mahmoudpour, Saeed
Schelkens, Peter
[J]. 2023 15TH INTERNATIONAL CONFERENCE ON QUALITY OF MULTIMEDIA EXPERIENCE, QOMEX, 2023, : 153 - 158
[36] Improved One-Dimensional Convolutional Neural Networks for Human Motion Recognition
Wang, Shengzhi
Xiao, Shuo
Huang, Zhenzhen
Xu, Zhiou
Chen, Wei
[J]. 2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 2544 - 2547
[37] Incorporating word attention with convolutional neural networks for abstractive summarization
Yuan, Chengzhe
Bao, Zhifeng
Sanderson, Mark
Tang, Yong
[J]. WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2020, 23 (01): : 267 - 287
[38] Incorporating word attention with convolutional neural networks for abstractive summarization
Chengzhe Yuan
Zhifeng Bao
Mark Sanderson
Yong Tang
[J]. World Wide Web, 2020, 23 : 267 - 287
[39] Multiple inputs modeling of hybrid convolutional neural networks for human activity recognition
Lai, Yi-Chun
Kan, Yao-Chiang
Hsu, Kai-Cheng
Lin, Hsueh-Chun
[J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 92
[40] Occlusion-Robustness of Convolutional Neural Networks via Inverted Cutout
Koerschens, Matthias
Bodesheim, Paul
Denzler, Joachim
[J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 2829 - 2835

← 1 2 3 4 5 →