Human EEG and Recurrent Neural Networks Exhibit Common Temporal Dynamics During Speech Recognition

被引:4
|
作者
Hashemnia, Saeedeh [1 ]
Grasse, Lukas [1 ]
Soni, Shweta [1 ]
Tata, Matthew S. [1 ]
机构
[1] Univ Lethbridge, Dept Neurosci, Canadian Ctr Behav Neurosci, Lethbridge, AB, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
EEG; artificial neural network; speech tracking; auditory; theta; recurrent; RNN; PHASE PATTERNS; OSCILLATIONS; COMPREHENSION; ENTRAINMENT; RESPONSES; DEPENDS; DELTA; THETA;
D O I
10.3389/fnsys.2021.617605
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Recent deep-learning artificial neural networks have shown remarkable success in recognizing natural human speech, however the reasons for their success are not entirely understood. Success of these methods might be because state-of-the-art networks use recurrent layers or dilated convolutional layers that enable the network to use a time-dependent feature space. The importance of time-dependent features in human cortical mechanisms of speech perception, measured by electroencephalography (EEG) and magnetoencephalography (MEG), have also been of particular recent interest. It is possible that recurrent neural networks (RNNs) achieve their success by emulating aspects of cortical dynamics, albeit through very different computational mechanisms. In that case, we should observe commonalities in the temporal dynamics of deep-learning models, particularly in recurrent layers, and brain electrical activity (EEG) during speech perception. We explored this prediction by presenting the same sentences to both human listeners and the Deep Speech RNN and considered the temporal dynamics of the EEG and RNN units for identical sentences. We tested whether the recently discovered phenomenon of envelope phase tracking in the human EEG is also evident in RNN hidden layers. We furthermore predicted that the clustering of dissimilarity between model representations of pairs of stimuli would be similar in both RNN and EEG dynamics. We found that the dynamics of both the recurrent layer of the network and human EEG signals exhibit envelope phase tracking with similar time lags. We also computed the representational distance matrices (RDMs) of brain and network responses to speech stimuli. The model RDMs became more similar to the brain RDM when going from early network layers to later ones, and eventually peaked at the recurrent layer. These results suggest that the Deep Speech RNN captures a representation of temporal features of speech in a manner similar to human brain.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Temporal Feedback Convolutional Recurrent Neural Networks for Speech Command Recognition
    Kim, Taejun
    Nam, Juhan
    PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 437 - 441
  • [2] RECURRENT NEURAL NETWORKS FOR SPEECH RECOGNITION
    VERDEJO, JED
    HERREROS, AP
    LUNA, JCS
    ORTUZAR, MCB
    AYUSO, AR
    LECTURE NOTES IN COMPUTER SCIENCE, 1991, 540 : 361 - 369
  • [3] Speech Recognition with Temporal Neural Networks
    Lin, Payton
    Lyu, Dau-Cheng
    Chang, Yun-Fan
    Tsao, Yu
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 21 - 25
  • [4] Investigating the temporal dynamics of electroencephalogram (EEG) microstates using recurrent neural networks
    Sikka, Apoorva
    Jamalabadi, Hamidreza
    Krylova, Marina
    Alizadeh, Sarah
    van der Meer, Johan N.
    Danyeli, Lena
    Deliano, Matthias
    Vicheva, Petya
    Hahn, Tim
    Koenig, Thomas
    Bathula, Deepti R.
    Walter, Martin
    HUMAN BRAIN MAPPING, 2020, 41 (09) : 2334 - 2346
  • [5] SPEECH RECOGNITION WITH HIERARCHICAL RECURRENT NEURAL NETWORKS
    CHEN, WY
    LIAO, YF
    CHEN, SH
    PATTERN RECOGNITION, 1995, 28 (06) : 795 - 805
  • [6] Visual speech recognition by recurrent neural networks
    Rabi, G
    Lu, SW
    JOURNAL OF ELECTRONIC IMAGING, 1998, 7 (01) : 61 - 69
  • [7] Visual speech recognition by recurrent neural networks
    Rabi, G
    Lu, SW
    1997 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, CONFERENCE PROCEEDINGS, VOLS I AND II: ENGINEERING INNOVATION: VOYAGE OF DISCOVERY, 1997, : 55 - 58
  • [8] Unfolded Recurrent Neural Networks for Speech Recognition
    Saon, George
    Soltau, Hagen
    Emami, Ahmad
    Picheny, Michael
    15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 343 - 347
  • [9] SPEECH RECOGNITION WITH DEEP RECURRENT NEURAL NETWORKS
    Graves, Alex
    Mohamed, Abdel-rahman
    Hinton, Geoffrey
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 6645 - 6649
  • [10] Speech recognition with hierarchical recurrent neural networks
    Department of Communication Engineering, National Chiao Tung University, Hsinchu, Taiwan
    Pattern Recognit, 6 (795-805):