Human EEG and Recurrent Neural Networks Exhibit Common Temporal Dynamics During Speech Recognition

被引：4

作者：

Hashemnia, Saeedeh ^{[1
]}

Grasse, Lukas ^{[1
]}

Soni, Shweta ^{[1
]}

Tata, Matthew S. ^{[1
]}

机构：

[1] Univ Lethbridge, Dept Neurosci, Canadian Ctr Behav Neurosci, Lethbridge, AB, Canada

来源：

FRONTIERS IN SYSTEMS NEUROSCIENCE | 2021年 / 15卷

基金：

加拿大自然科学与工程研究理事会;

关键词：

EEG; artificial neural network; speech tracking; auditory; theta; recurrent; RNN; PHASE PATTERNS; OSCILLATIONS; COMPREHENSION; ENTRAINMENT; RESPONSES; DEPENDS; DELTA; THETA;

D O I：

10.3389/fnsys.2021.617605

中图分类号：

Q189 [神经科学];

学科分类号：

071006 ;

摘要：

Recent deep-learning artificial neural networks have shown remarkable success in recognizing natural human speech, however the reasons for their success are not entirely understood. Success of these methods might be because state-of-the-art networks use recurrent layers or dilated convolutional layers that enable the network to use a time-dependent feature space. The importance of time-dependent features in human cortical mechanisms of speech perception, measured by electroencephalography (EEG) and magnetoencephalography (MEG), have also been of particular recent interest. It is possible that recurrent neural networks (RNNs) achieve their success by emulating aspects of cortical dynamics, albeit through very different computational mechanisms. In that case, we should observe commonalities in the temporal dynamics of deep-learning models, particularly in recurrent layers, and brain electrical activity (EEG) during speech perception. We explored this prediction by presenting the same sentences to both human listeners and the Deep Speech RNN and considered the temporal dynamics of the EEG and RNN units for identical sentences. We tested whether the recently discovered phenomenon of envelope phase tracking in the human EEG is also evident in RNN hidden layers. We furthermore predicted that the clustering of dissimilarity between model representations of pairs of stimuli would be similar in both RNN and EEG dynamics. We found that the dynamics of both the recurrent layer of the network and human EEG signals exhibit envelope phase tracking with similar time lags. We also computed the representational distance matrices (RDMs) of brain and network responses to speech stimuli. The model RDMs became more similar to the brain RDM when going from early network layers to later ones, and eventually peaked at the recurrent layer. These results suggest that the Deep Speech RNN captures a representation of temporal features of speech in a manner similar to human brain.

引用

页数：12

共 50 条

[41] Speech Emotion Recognition: Recurrent Neural Networks Compared to SVM and Linear Regression
Kerkeni, Leila
Serrestou, Youssef
Mbarki, Mohamed
Mahjoub, Mohamed Ali
Raoof, Kosai
Cleder, Catherine
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2017, PT I, 2017, 10613 : 451 - 453
[42] Multiple attention convolutional-recurrent neural networks for speech emotion recognition
Zhang, Zhihao
Wang, Kunxia
2022 10TH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION WORKSHOPS AND DEMOS, ACIIW, 2022,
[43] AUTOMATIC SPEECH EMOTION RECOGNITION USING RECURRENT NEURAL NETWORKS WITH LOCAL ATTENTION
Mirsamadi, Seyedmahdad
Barsoum, Emad
Zhang, Cha
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 2227 - 2231
[44] COMPACT CONVOLUTIONAL RECURRENT NEURAL NETWORKS VIA BINARIZATION FOR SPEECH EMOTION RECOGNITION
Zhao, Huan
Xiao, Yufeng
Han, Jing
Zhang, Zixing
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6690 - 6694
[45] Segment-Based Speech Emotion Recognition Using Recurrent Neural Networks
Tzinis, Efthymios
Potamianos, Alexandros
2017 SEVENTH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2017, : 190 - 195
[46] EEG TOPOGRAPHY RECOGNITION BY NEURAL NETWORKS
HIRAIWA, A
SHIMOHARA, K
TOKUNAGA, Y
IEEE ENGINEERING IN MEDICINE AND BIOLOGY MAGAZINE, 1990, 9 (03): : 39 - 42
[47] EEG topography recognition by neural networks
Hiraiwa, Alkira, 1600, (09):
[48] EEG Emotion Recognition using Parallel Hybrid Convolutional-Recurrent Neural Networks
Putri, Nursilva Aulianisa
Djamal, Esmeralda Contessa
Nugraha, Fikri
Kasyidi, Fatan
2022 INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ITS APPLICATIONS (ICODSA), 2022, : 24 - 29
[49] Stochastic Recurrent Neural Network for Speech Recognition
Chien, Jen-Tzung
Shen, Chen
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1313 - 1317
[50] Review of Neural Networks for Speech Recognition
Lippmann, Richard P.
NEURAL COMPUTATION, 1989, 1 (01) : 1 - 38

← 1 2 3 4 5 →