STAnet: A Spatiotemporal Attention Network for Decoding Auditory Spatial Attention From EEG

被引:56
|
作者
Su, Enze [1 ]
Cai, Siqi [2 ]
Xie, Longhan [1 ]
Li, Haizhou [2 ,3 ,4 ]
Schultz, Tanja [5 ]
机构
[1] South China Univ Technol, Shien Ming Wu Sch Intelligent Engn, Guangzhou 510460, Guangdong, Peoples R China
[2] Natl Univ Singapore, Dept Elect & Comp Engn, Singapore, Singapore
[3] Chinese Univ Hong Kong Shenzhen, Sch Data Sci, Shenzhen, Peoples R China
[4] Univ Bremen, Machine Listening Lab, Bremen, Germany
[5] Univ Bremen, Cognit Syst Lab, Bremen, Germany
基金
中国国家自然科学基金;
关键词
Electroencephalography; Spatiotemporal phenomena; Feature extraction; Brain modeling; Decoding; Speech enhancement; Pipelines; Auditory attention; brain-computer interface; electroencephalography; spatial attention; temporal attention; CORTICAL REPRESENTATION; SELECTIVE ATTENTION; ATTENDED SPEECH; TRACKING; BRAIN; MODULATION; RESPONSES; DYNAMICS; CHAOS; HAND;
D O I
10.1109/TBME.2022.3140246
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Objective: Humans are able to localize the source of a sound. This enables them to direct attention to a particular speaker in a cocktail party. Psycho-acoustic studies show that the sensory cortices of the human brain respond to the location of sound sources differently, and the auditory attention itself is a dynamic and temporally based brain activity. In this work, we seek to build a computational model which uses both spatial and temporal information manifested in EEG signals for auditory spatial attention detection (ASAD). Methods: We propose an end-to-end spatiotemporal attention network, denoted as STAnet, to detect auditory spatial attention from EEG. The STAnet is designed to assign differentiated weights dynamically to EEG channels through a spatial attention mechanism, and to temporal patterns in EEG signals through a temporal attention mechanism. Results: We report the ASAD experiments on two publicly available datasets. The STAnet outperforms other competitive models by a large margin under various experimental conditions. Its attention decision for 1-second decision window outperforms that of the state-of-the-art techniques for 10-second decision window. Experimental results also demonstrate that the STAnet achieves competitive performance on EEG signals ranging from 64 to as few as 16 channels. Conclusion: This study provides evidence suggesting that efficient low-density EEG online decoding is within reach. Significance: This study also marks an important step towards the practical implementation of ASAD in real life applications.
引用
收藏
页码:2233 / 2242
页数:10
相关论文
共 50 条
  • [21] EEG-BASED DECODING OF AUDITORY ATTENTION TO A TARGET INSTRUMENT IN POLYPHONIC MUSIC
    Cantisani, Giorgia
    Essid, Slim
    Richard, Gael
    2019 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2019, : 80 - 84
  • [22] DECODING SPATIAL AUDITORY ATTENTION USING ELECTROMYOGRAPHIC AURICULAR MUSCLE MONITORING
    Corona-Strauss, Farah
    Hackley, Steven
    Hannemann, Ronny
    Strauss, Daniel
    PSYCHOPHYSIOLOGY, 2018, 55 : S36 - S36
  • [23] AUDITORY ATTENTION DECODING WITH EEG RECORDINGS USING NOISY ACOUSTIC REFERENCE SIGNALS
    Aroudi, Ali
    Mirkovic, Bojana
    De Vos, Maarten
    Doclo, Simon
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 694 - 698
  • [24] Decoding attention control and selection in visual spatial attention
    Hong, Xiangfei
    Bo, Ke
    Meyyappan, Sreenivasan
    Tong, Shanbao
    Ding, Mingzhou
    HUMAN BRAIN MAPPING, 2020, 41 (14) : 3900 - 3921
  • [25] Attention-Based Multiscale Spatial-Temporal Convolutional Network for Motor Imagery EEG Decoding
    Zhang, Yu
    Li, Penghai
    Cheng, Longlong
    Li, Mingji
    Li, Hongji
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2024, 70 (01) : 2423 - 2434
  • [26] Electrodes selection for cortical auditory attention decoding with EEG during speech and music listening
    Simon, Adele
    Bech, Soren
    Loquet, Gerard
    Ostergaard, Jan
    2022 25TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION 2022), 2022,
  • [27] Effect of Independent Component Artifact Rejection on EEG-Based Auditory Attention Decoding
    Keding, Oskar
    Wilroth, Johanna
    Skoglund, Martin A.
    Alickovic, Emina
    32ND EUROPEAN SIGNAL PROCESSING CONFERENCE, EUSIPCO 2024, 2024, : 877 - 881
  • [28] RIEMANNIAN GEOMETRY-BASED DECODING OF THE DIRECTIONAL FOCUS OF AUDITORY ATTENTION USING EEG
    Geirnaert, Simon
    Francart, Tom
    Bertrand, Alexander
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 1115 - 1119
  • [29] Improving EEG-based decoding of the locus of auditory attention through domain adaptation
    Wilroth, Johanna
    Bernhardsson, Bo
    Heskebeck, Frida
    Skoglund, Martin A.
    Bergeling, Carolina
    Alickovic, Emina
    JOURNAL OF NEURAL ENGINEERING, 2023, 20 (06)
  • [30] Envelope Based Deep Source Separation and EEG Auditory Attention Decoding for Speech and Music
    Tanveer, M. Asjid
    Jensen, Jesper
    Tan, Zheng-Hua
    Ostergaard, Jan
    32ND EUROPEAN SIGNAL PROCESSING CONFERENCE, EUSIPCO 2024, 2024, : 872 - 876