A Tutorial on Auditory Attention Identification Methods

被引:60
|
作者
Alickovic, Emina [1 ,2 ]
Lunner, Thomas [1 ,2 ,3 ,4 ]
Gustafsson, Fredrik [1 ]
Ljung, Lennart [1 ]
机构
[1] Linkoping Univ, Dept Elect Engn, Linkoping, Sweden
[2] Oticon AS, Erikshorm Res Ctr, Snekkersten, Denmark
[3] Tech Univ Denmark, Dept Hlth Technol, Hearing Syst, Lyngby, Denmark
[4] Linkoping Univ, Linnaeus Ctr HEAD, Swedish Inst Disabil Res, Linkoping, Sweden
来源
FRONTIERS IN NEUROSCIENCE | 2019年 / 13卷
基金
瑞典研究理事会; 欧盟地平线“2020”;
关键词
cocktail-party problem; auditory attention; linear models; stimulus reconstruction; canonical correlation anaysis (CCA); decoding; encoding; sparse representation; CANONICAL CORRELATION-ANALYSIS; COCKTAIL PARTY; SPEECH-INTELLIGIBILITY; CORTICAL TRACKING; ATTENDED SPEECH; TIME; NOISE; REGRESSION; SELECTION; SPEAKER;
D O I
10.3389/fnins.2019.00153
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Auditory attention identification methods attempt to identify the sound source of a listener's interest by analyzing measurements of electrophysiological data. We present a tutorial on the numerous techniques that have been developed in recent decades, and we present an overview of current trends in multivariate correlation-based and model-based learning frameworks. The focus is on the use of linear relations between electrophysiological and audio data. The way in which these relations are computed differs. For example, canonical correlation analysis (CCA) finds a linear subset of electrophysiological data that best correlates to audio data and a similar subset of audio data that best correlates to electrophysiological data. Model-based (encoding and decoding) approaches focus on either of these two sets. We investigate the similarities and differences between these linear model philosophies. We focus on (1) correlation-based approaches (CCA), (2) encoding/decoding models based on dense estimation, and (3) (adaptive) encoding/decoding models based on sparse estimation. The specific focus is on sparsity-driven adaptive encoding models and comparing the methodology in state-of-the-art models found in the auditory literature. Furthermore, we outline the main signal processing pipeline for how to identify the attended sound source in a cocktail party environment from the raw electrophysiological data with all the necessary steps, complemented with the necessary MATLAB code and the relevant references for each step. Our main aim is to compare the methodology of the available methods, and provide numerical illustrations to some of them to get a feeling for their potential. A thorough performance comparison is outside the scope of this tutorial.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Tutorial on Design and Evaluation Methods for Attention Directing Cues
    Borojeni, Shadan Sadeghian
    Chuang, Lewis
    Loecken, Andreas
    Glatz, Christiane
    Boll, Susanne
    AUTOMOTIVEUI 2016: 8TH INTERNATIONAL CONFERENCE ON AUTOMOTIVE USER INTERFACES AND INTERACTIVE VEHICULAR APPLICATIONS, 2016, : 213 - 215
  • [2] Approaches to robust process identification: A review and tutorial of probabilistic methods
    Kodamana, Hariprasad
    Huang, Biao
    Ranjan, Rishik
    Zhao, Yujia
    Tan, Ruomu
    Sammaknejad, Nima
    JOURNAL OF PROCESS CONTROL, 2018, 66 : 68 - 83
  • [3] AUDITORY-VERBAL PHILOSOPHY - A TUTORIAL
    GOLDBERG, DM
    VOLTA REVIEW, 1993, 95 (03) : 181 - 186
  • [4] Identification of attention and cognitive control networks in a parametric auditory fMRI study
    Westerhausen, Rene
    Moosmann, Matthias
    Alho, Kimmo
    Belsby, Stein-Ove
    Hamalainen, Heikki
    Medvedev, Svyatoslav
    Specht, Karsten
    Hugdahl, Kenneth
    NEUROPSYCHOLOGIA, 2010, 48 (07) : 2075 - 2081
  • [5] A Comparison of Regularization Methods in Forward and Backward Models for Auditory Attention Decoding
    Wong, Daniel D. E.
    Fuglsang, Soren A.
    Hjortkjaer, Jens
    Ceolini, Enea
    Slaney, Malcolm
    de Cheveigne, Alain
    FRONTIERS IN NEUROSCIENCE, 2018, 12
  • [6] ATTENTION MECHANISMS MEDIATE AND SYNTACTIC PRIMING EFFECT IN AUDITORY WORD IDENTIFICATION
    DEUTSCH, A
    BENTIN, S
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-LEARNING MEMORY AND COGNITION, 1994, 20 (03) : 595 - 607
  • [7] System identification - Tutorial
    Yen, KK
    ARTIFICIAL INTELLIGENCE AND MATHEMATICAL METHODS IN PAVEMENT AND GEOMECHANICAL SYSTEMS, 1998, : 179 - 193
  • [8] Language Identification: A Tutorial
    Ambikairajah, Eliathamby
    Li, Haizhou
    Wang, Liang
    Yin, Bo
    Sethu, Vidhyasaharan
    IEEE CIRCUITS AND SYSTEMS MAGAZINE, 2011, 11 (02) : 82 - 108
  • [9] A TUTORIAL ON HEURISTIC METHODS
    SILVER, EA
    VIDAL, RVV
    DEWERRA, D
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 1980, 5 (03) : 153 - 162
  • [10] Contraction-Based Methods for Stable Identification and Robust Machine Learning: a Tutorial
    Manchester, Ian R.
    Revay, Max
    Wang, Ruigang
    2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 2955 - 2962