Matching Pursuit Analysis of Auditory Receptive Fields' Spectro-Temporal Properties

被引:1
|
作者
Bach, Joerg-Hendrik [1 ,2 ]
Kollmeier, Birger [1 ,2 ]
Anemueller, Joern [1 ,2 ]
机构
[1] Carl von Ossietzky Univ Oldenburg, Med Phys, Oldenburg, Germany
[2] Carl von Ossietzky Univ Oldenburg, Cluster Excellence Hearing4All, Oldenburg, Germany
关键词
auditory receptive fields; spectro-temporal patterns; Gabor filters; matching pursuit; acoustic event classification; FILTER BANK FEATURES; SPEECH; NEURONS; REPRESENTATION; RECOGNITION; MIDBRAIN; MODEL; DISCRIMINATION; CLASSIFICATION; SELECTIVITY;
D O I
10.3389/fnsys.2017.00004
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Gabor filters have long been proposed as models for spectro-temporal receptive fields (STRFs), with their specific spectral and temporal rate of modulation qualitatively replicating characteristics of STRF filters estimated from responses to auditory stimuli in physiological data. The present study builds on the Gabor-STRF model by proposing a methodology to quantitatively decompose STRFs into a set of optimally matched Gabor filters through matching pursuit, and by quantitatively evaluating spectral and temporal characteristics of STRFs in terms of the derived optimal Gabor-parameters. To summarize a neuron's spectro-temporal characteristics, we introduce a measure for the "diagonality," i.e., the extent to which an STRF exhibits spectro-temporal transients which cannot be factorized into a product of a spectral and a temporal modulation. With this methodology, it is shown that approximately half of 52 analyzed zebra finch STRFs can each be well approximated by a single Gabor or a linear combination of two Gabor filters. Moreover, the dominant Gabor functions tend to be oriented either in the spectral or in the temporal direction, with truly "diagonal" Gabor functions rarely being necessary for reconstruction of an STRF's main characteristics. As a toy example for the applicability of STRF and Gabor-STRF filters to auditory detection tasks, we use STRF filters as features in an automatic event detection task and compare them to idealized Gabor filters and mel-frequency cepstral coefficients (MFCCs). STRFs classify a set of six everyday sounds with an accuracy similar to reference Gabor features (94% recognition rate). Spectro-temporal STRF and Gabor features outperform reference spectral MFCCs in quiet and in low noise conditions (down to 0 dB signal to noise ratio).
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Spectro-Temporal Analysis of HIE and Asthma Infant Cries Using Auditory Spectrogram
    Chittora, Anshu
    Patil, Hemant A.
    Sailor, Hardik B.
    2015 INTERNATIONAL CONFERENCE ON BIOSIGNAL ANALYSIS, PROCESSING AND SYSTEMS (ICBAPS), 2015,
  • [42] Weighting of Spatial and Spectro-Temporal Cues for Auditory Scene Analysis by Human Listeners
    Bremen, Peter
    Middlebrooks, John C.
    PLOS ONE, 2013, 8 (03):
  • [43] Localized spectro-temporal cepstral analysis of speech
    Bouvrie, Jake
    Ezzat, Tony
    Poggio, Tomaso
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4733 - 4736
  • [44] RELATION OF BINAURAL INTERACTION AND SPECTRO-TEMPORAL CHARACTERISTICS IN THE AUDITORY MIDBRAIN OF THE GRASSFROG
    EPPING, WJM
    EGGERMONT, JJ
    HEARING RESEARCH, 1985, 19 (01) : 15 - 28
  • [45] A COMPARISON OF THE SPECTRO-TEMPORAL SENSITIVITY OF AUDITORY NEURONS TO TONAL AND NATURAL STIMULI
    AERTSEN, AMHJ
    JOHANNESMA, PIM
    BIOLOGICAL CYBERNETICS, 1981, 42 (02) : 145 - 156
  • [46] Dynamics of spectro-temporal tuning in primary auditory cortex of the awake ferret
    Shechter, B.
    Dobbins, H. D.
    Marvit, P.
    Depireux, D. A.
    HEARING RESEARCH, 2009, 256 (1-2) : 118 - 130
  • [47] Tuning for spectro-temporal modulations as a mechanism for auditory discrimination of natural sounds
    Sarah M N Woolley
    Thane E Fremouw
    Anne Hsu
    Frédéric E Theunissen
    Nature Neuroscience, 2005, 8 : 1371 - 1379
  • [48] Spectro-temporal correlates of lexical access during auditory lexical decision
    Brennan, Jonathan
    Lignos, Constantine
    Embick, David
    Roberts, Timothy P. L.
    BRAIN AND LANGUAGE, 2014, 133 : 39 - 46
  • [49] Incorporating behavioral and sensory context into spectro-temporal models of auditory encoding
    David, Stephen V.
    HEARING RESEARCH, 2018, 360 : 107 - 123
  • [50] Tuning for spectro-temporal modulations as a mechanism for auditory discrimination of natural sounds
    Woolley, SMN
    Fremouw, TE
    Hsu, A
    Theunissen, FE
    NATURE NEUROSCIENCE, 2005, 8 (10) : 1371 - 1379