Dissecting neural computations in the human auditory pathway using deep neural networks for speech

被引:0
|
作者
Yuanning Li
Gopala K. Anumanchipalli
Abdelrahman Mohamed
Peili Chen
Laurel H. Carney
Junfeng Lu
Jinsong Wu
Edward F. Chang
机构
[1] University of California,Department of Neurological Surgery
[2] San Francisco,Weill Institute for Neurosciences
[3] University of California,Department of Electrical Engineering and Computer Science
[4] San Francisco,School of Biomedical Engineering & State Key Laboratory of Advanced Medical Materialsand Devices
[5] University of California,Department of Biomedical Engineering
[6] Berkeley,Neurologic Surgery Department, Huashan Hospital, Shanghai Medical College
[7] Meta AI Research,Brain Function Laboratory, Neurosurgical Institute
[8] ShanghaiTech University,School of Biomedical Engineering & State Key Laboratory of Advanced Medical Materials and Devices
[9] University of Rochester,undefined
[10] Fudan University,undefined
[11] Fudan University,undefined
[12] ShanghaiTech University,undefined
来源
Nature Neuroscience | 2023年 / 26卷
关键词
D O I
暂无
中图分类号
学科分类号
摘要
The human auditory system extracts rich linguistic abstractions from speech signals. Traditional approaches to understanding this complex process have used linear feature-encoding models, with limited success. Artificial neural networks excel in speech recognition tasks and offer promising computational models of speech processing. We used speech representations in state-of-the-art deep neural network (DNN) models to investigate neural coding from the auditory nerve to the speech cortex. Representations in hierarchical layers of the DNN correlated well with the neural activity throughout the ascending auditory system. Unsupervised speech models performed at least as well as other purely supervised or fine-tuned models. Deeper DNN layers were better correlated with the neural activity in the higher-order auditory cortex, with computations aligned with phonemic and syllabic structures in speech. Accordingly, DNN models trained on either English or Mandarin predicted cortical responses in native speakers of each language. These results reveal convergence between DNN model representations and the biological auditory pathway, offering new approaches for modeling neural coding in the auditory cortex.
引用
收藏
页码:2213 / 2225
页数:12
相关论文
共 50 条
  • [1] Dissecting neural computations in the human auditory pathway using deep neural networks for speech
    Li, Yuanning
    Anumanchipalli, Gopala K.
    Mohamed, Abdelrahman
    Chen, Peili
    Carney, Laurel H.
    Lu, Junfeng
    Wu, Jinsong
    Chang, Edward F.
    NATURE NEUROSCIENCE, 2023, 26 (12) : 2213 - 2225
  • [2] Speech Reconstruction from Human Auditory Cortex with Deep Neural Networks
    Yang, Minda
    Sheth, Sameer A.
    Schevon, Catherine A.
    McKhann, Guy M., II
    Mesgarani, Nima
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1121 - 1125
  • [3] Speech watermarking using Deep Neural Networks
    Pavlovic, Kosta
    Kovacevic, Slavko
    Durovic, Igor
    2020 28TH TELECOMMUNICATIONS FORUM (TELFOR), 2020, : 292 - 295
  • [4] Speech Activity Detection Using Deep Neural Networks
    Shahsavari, Sajad
    Sameti, Hossein
    Hadian, Hossein
    2017 25TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2017, : 1564 - 1568
  • [5] Emotional Speech Recognition Using Deep Neural Networks
    Trinh Van, Loan
    Dao Thi Le, Thuy
    Le Xuan, Thanh
    Castelli, Eric
    SENSORS, 2022, 22 (04)
  • [6] SPEECH ENHANCEMENT USING MULTIPLE DEEP NEURAL NETWORKS
    Karjol, Pavan
    Kumar, Ajay M.
    Ghosh, Prasanta Kumar
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5049 - 5053
  • [7] Speech Emotion Recognition using Convolution Neural Networks and Deep Stride Convolutional Neural Networks
    Wani, Taiba Majid
    Gunawan, Teddy Surya
    Qadri, Syed Asif Ahmad
    Mansor, Hasmah
    Kartiwi, Mira
    Ismail, Nanang
    PROCEEDING OF 2020 6TH INTERNATIONAL CONFERENCE ON WIRELESS AND TELEMATICS (ICWT), 2020,
  • [8] The Representation of Speech in Deep Neural Networks
    Scharenborg, Odette
    van der Gouw, Nikki
    Larson, Martha
    Marchiori, Elena
    MULTIMEDIA MODELING, MMM 2019, PT II, 2019, 11296 : 194 - 205
  • [9] BIOMETRIC HUMAN AUTHENTICATION SYSTEM THROUGH SPEECH USING DEEP NEURAL NETWORKS (DNN)
    Mamyrbayev, O.
    Akhmediyarova, A.
    Kydyrbekova, A.
    Mekebayev, N. O.
    Zhumazhanov, B.
    BULLETIN OF THE NATIONAL ACADEMY OF SCIENCES OF THE REPUBLIC OF KAZAKHSTAN, 2020, (05): : 6 - 15
  • [10] Automatic Recognition of Kazakh Speech Using Deep Neural Networks
    Mamyrbayev, Orken
    Turdalyuly, Mussa
    Mekebayev, Nurbapa
    Alimhan, Keylan
    Kydyrbekova, Aizat
    Turdalykyzy, Tolganay
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2019, PT II, 2019, 11432 : 465 - 474