Explainable audio CNNs applied to neural decoding: sound category identification from inferior colliculus

被引:3
|
作者
Ozcan, Fatma [1 ]
Alkan, Ahmet [1 ]
机构
[1] Kahramanmaras Sutcu Imam Univ, Elect & Elect Engn Dept, TR-46100 Kahramanmaras, Turkiye
关键词
Explanability; Interpretation; Pre-trained audio networks; Temporal correlation; Time resolution; Multiunit activity;
D O I
10.1007/s11760-023-02825-3
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Recently, work has been done to understand aspects of how CI processes with sound. Here, we use neural temporal correlation in the inferior colliculus for identifying and categorising the sound that was used as a stimulus. The success of the classification gradually deteriorates for shorter durations. We tried to improve these success values with deep learning methods for audio, on processing windows of 62.5 ms, 250 ms and 1000 ms. We demonstrate that 62.5 ms could be an integration time for temporal correlation. The neural data contains sound features that can be easily processed with artificial neural networks dedicated to audio signals. Network architectures dedicated to audio classification, such as Yamnet, Vggish, Openl3, used in transfer learning, give quite quickly neural data classification results with very high accuracy, compared to image classification networks. In the case of unshuffled correlation images, we have the best accuracy. With noiseless shuffled correlation images, we have the best accuracy, such as for 1000 ms: 100%, for 250 ms: 96.7%, for 62.5 ms: 93.8%, obtained with the OpenL3 network. To evaluate the importance of the contributions of the input features of a neural network to its outputs, we use Explainable Artificial Intelligence. We then used three different explicability methods, such as Grad-CAM, LIME and Occlusion Sensitivity to obtain three sensitive maps. Network uses different regions corresponding to a very high or very low correlation to make its prediction.
引用
收藏
页码:1193 / 1204
页数:12
相关论文
共 37 条
  • [1] Explainable audio CNNs applied to neural decoding: sound category identification from inferior colliculus
    Fatma Özcan
    Ahmet Alkan
    Signal, Image and Video Processing, 2024, 18 : 1193 - 1204
  • [2] Neural decoding of inferior colliculus multiunit activity for sound category identification with temporal correlation and transfer learning
    Ozcan, Fatma
    Alkan, Ahmet
    NETWORK-COMPUTATION IN NEURAL SYSTEMS, 2024, 35 (02) : 101 - 133
  • [3] Neural population encoding and decoding of sound source location across sound level in the rabbit inferior colliculus
    Day, Mitchell L.
    Delgutte, Bertrand
    JOURNAL OF NEUROPHYSIOLOGY, 2016, 115 (01) : 193 - 207
  • [4] Neural representation of sound duration in the inferior colliculus of the mouse
    Xia, YF
    Qi, ZH
    Shen, JX
    ACTA OTO-LARYNGOLOGICA, 2000, 120 (05) : 638 - 643
  • [5] FROM THE INFERIOR COLLICULUS TO A COMPUTATIONAL SOUND LOCALIZATION MODEL
    Liu, Jindong
    Erwin, Harry
    Wermter, Stefan
    NEURAL NETWORK WORLD, 2009, 19 (05) : 499 - 512
  • [6] NEURAL TUNING FOR SOUND DURATION - ROLE OF INHIBITORY MECHANISMS IN THE INFERIOR COLLICULUS
    CASSEDAY, JH
    EHRLICH, D
    COVEY, E
    SCIENCE, 1994, 264 (5160) : 847 - 850
  • [7] Sensitivity of neural responses in the inferior colliculus to statistical features of sound textures
    Mishra, Ambika P.
    Peng, Fei
    Li, Kongyan
    Harper, Nicol S.
    Schnupp, Jan W. H.
    HEARING RESEARCH, 2021, 412
  • [8] A biologically inspired spiking neural network for sound localisation by the inferior colliculus
    Liu, Jindong
    Erwin, Harry
    Wermter, Stefan
    Elsaid, Mahmoud
    ARTIFICIAL NEURAL NETWORKS - ICANN 2008, PT II, 2008, 5164 : 396 - 405
  • [9] A neural ensemble correlation code for sound category identification
    Sadeghi, Mina
    Zhai, Xiu
    Stevenson, Ian H.
    Escabi, Monty A.
    PLOS BIOLOGY, 2019, 17 (10)
  • [10] Neural tuning to sound duration in the inferior colliculus of the big brown bat, Eptesicus fuscus
    Ehrlich, D
    Casseday, JH
    Covey, E
    JOURNAL OF NEUROPHYSIOLOGY, 1997, 77 (05) : 2360 - 2372