Cracking the neural code for word recognition in convolutional neural networks

被引：0

作者：

Agrawal, Aakash ^{[1
]}

Dehaene, Stanislas ^{[1
,2
]}

机构：

[1] Univ Paris Saclay, NeuroSpin Ctr, INSERM U 992, Cognit Neuroimaging Unit,CEA, Gif Sur Yvette, France

[2] Univ Paris Sci Lettres PSL, Coll France, Paris, France

来源：

PLOS COMPUTATIONAL BIOLOGY | 2024年 / 20卷 / 09期

基金：

欧盟地平线“2020”;

关键词：

LETTER POSITION; WRITTEN WORDS; REPRESENTATION; MODEL; PERCEPTION; INSIGHTS; CORTEX; SPACE; FMRI;

D O I：

10.1371/journal.pcbi.1012430

中图分类号：

Q5 [生物化学];

学科分类号：

071010 ; 081704 ;

摘要：

Learning to read places a strong challenge on the visual system. Years of expertise lead to a remarkable capacity to separate similar letters and encode their relative positions, thus distinguishing words such as FORM and FROM, invariantly over a large range of positions, sizes and fonts. How neural circuits achieve invariant word recognition remains unknown. Here, we address this issue by recycling deep neural network models initially trained for image recognition. We retrain them to recognize written words and then analyze how reading-specialized units emerge and operate across the successive layers. With literacy, a small subset of units becomes specialized for word recognition in the learned script, similar to the visual word form area (VWFA) in the human brain. We show that these units are sensitive to specific letter identities and their ordinal position from the left or the right of a word. The transition from retinotopic to ordinal position coding is achieved by a hierarchy of "space bigram" unit that detect the position of a letter relative to a blank space and that pool across low- and high-frequency-sensitive units from early layers of the network. The proposed scheme provides a plausible neural code for written words in the VWFA, and leads to predictions for reading behavior, error patterns, and the neurophysiology of reading. Reading is a fundamental skill in modern society, yet the neural mechanisms that allow us to quickly recognize words remain poorly understood. Our research aims to unravel how the brain achieves invariant word recognition-the ability to recognize words regardless of their position, size, or font. We studied artificial neural networks trained to recognize words, mirroring human learning. Our findings reveal that these networks develop specialized units for word recognition, similar to the Visual Word Form Area in the human brain. These units are sensitive to specific letters and their positions within a word. Crucially, we discovered that they achieve this by detecting the spaces around words as reference points. This creates a hierarchical system where early layers detect basic features and spaces, while higher layers combine this information to recognize specific letters at certain positions relative to word edges. This "space bigram" model reconciles previous theories of letter bigrams and letter-position coding. Our results suggest that most written languages may be processed using similar basic principles. This understanding could inform better methods for teaching reading and treating reading disorders.

引用

页数：22

共 50 条

[41] CONVOLUTIONAL NEURAL NETWORKS FOR NOISE SIGNAL RECOGNITION
Portsev, Ruslan J.
Makarenko, Andrey V.
2018 IEEE 28TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2018,
[42] Speech Recognition Based on Convolutional Neural Networks
Du Guiming
Wang Xia
Wang Guangyan
Zhang Yan
Li Dan
2016 IEEE INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING (ICSIP), 2016, : 708 - 711
[43] Recognition of flowers using convolutional neural networks
Alkhonin, Abdulrahman
Almutairi, Abdulelah
Alburaidi, Abdulmajeed
Saudagar, Abdul Khader Jilani
INTERNATIONAL JOURNAL OF INTELLIGENT ENGINEERING INFORMATICS, 2020, 8 (03) : 186 - 197
[44] Word Difficulty Prediction Using Convolutional Neural Networks
Basu, Arpan
Garain, Avishek
Naskar, Sudip Kumar
PROCEEDINGS OF THE 2019 IEEE REGION 10 CONFERENCE (TENCON 2019): TECHNOLOGY, KNOWLEDGE, AND SOCIETY, 2019, : 1109 - 1112
[45] ITERATED DILATED CONVOLUTIONAL NEURAL NETWORKS FOR WORD SEGMENTATION
He, H.
Yang, X.
Wu, L.
Wang, G.
NEURAL NETWORK WORLD, 2020, 30 (05) : 333 - 346
[46] QR Code Detection Using Convolutional Neural Networks
Chou, Tzu-Han
Ho, Chuan-Sheng
Kuo, Yan-Fu
2015 INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND INTELLIGENT SYSTEMS (ARIS), 2015,
[47] Convolutional Neural Networks for Classification of Malware Assembly Code
Gibert, Daniel
Bejar, Javier
Mateu, Carles
Planes, Jordi
Solis, Daniel
Vicens, Ramon
RECENT ADVANCES IN ARTIFICIAL INTELLIGENCE RESEARCH AND DEVELOPMENT, 2017, 300 : 221 - 226
[48] LDPC Code Classification using Convolutional Neural Networks
Comar, Bradley
2020 29TH WIRELESS AND OPTICAL COMMUNICATIONS CONFERENCE (WOCC), 2020, : 115 - 120
[49] Code authorship identification using convolutional neural networks
Abuhamad, Mohammed
Rhim, Ji-su
AbuHmed, Tamer
Ullah, Sana
Kang, Sanggil
Nyang, DaeHun
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 95 : 104 - 115
[50] CoNCRA: A Convolutional Neural Networks Code Retrieval Approach
Martins, Marcelo de Rezende
Gerosa, Marco Aurelio
34TH BRAZILIAN SYMPOSIUM ON SOFTWARE ENGINEERING, SBES 2020, 2020, : 526 - 531

← 1 2 3 4 5 →