Sound Texture Perception via Statistics of the Auditory Periphery: Evidence from Sound Synthesis

被引:231
|
作者
McDermott, Josh H. [1 ,2 ]
Simoncelli, Eero P. [1 ,2 ,3 ]
机构
[1] NYU, Howard Hughes Med Inst, New York, NY 10003 USA
[2] NYU, Ctr Neural Sci, New York, NY 10003 USA
[3] NYU, Courant Inst Math Sci, New York, NY 10003 USA
关键词
COCKTAIL PARTY; AMPLITUDE-MODULATION; RECEPTIVE-FIELDS; DISCRIMINATION; SPEECH; MECHANISMS; FREQUENCY; RESPONSES; MASKING; CONTEXT;
D O I
10.1016/j.neuron.2011.06.032
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Rainstorms, insect swarms, and galloping horses produce "sound textures"-the collective result of many similar acoustic events. Sound textures are distinguished by temporal homogeneity, suggesting they could be recognized with time-averaged statistics. To test this hypothesis, we processed real-world textures with an auditory model containing filters tuned for sound frequencies and their modulations, and measured statistics of the resulting decomposition. We then assessed the realism and recognizability of novel sounds synthesized to have matching statistics. Statistics of individual frequency channels, capturing spectral power and sparsity, generally failed to produce compelling synthetic textures; however, combining them with correlations between channels produced identifiable and natural-sounding textures. Synthesis quality declined if statistics were computed from biologically implausible auditory models. The results suggest that sound texture perception is mediated by relatively simple statistics of early auditory representations, presumably computed by downstream neural populations. The synthesis methodology offers a powerful tool for their further investigation.
引用
收藏
页码:926 / 940
页数:15
相关论文
共 50 条
  • [21] A MONTAGE APPROACH TO SOUND TEXTURE SYNTHESIS
    O'Leary, Sean
    Robel, Axel
    2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 939 - 943
  • [22] A Montage Approach to Sound Texture Synthesis
    O'Leary, Sean
    Robel, Axel
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (06) : 1094 - 1105
  • [23] Sound Synthesis with Auditory Distortion Products
    Kendall, Gary S.
    Haworth, Christopher
    Cadiz, Rodrigo F.
    COMPUTER MUSIC JOURNAL, 2014, 38 (04) : 5 - 23
  • [24] Sound Spectrum Influences Auditory Distance Perception of Sound Sources Located in a Room Environment
    Spiousas, Ignacio
    Etchemendy, Pablo E.
    Eguia, Manuel C.
    Calcagno, Esteban R.
    Abregu, Ezequiel
    Vergara, Ramiro O.
    FRONTIERS IN PSYCHOLOGY, 2017, 8 : 1 - 16
  • [25] Coding of sound direction in the auditory periphery of the lake sturgeon, Acipenser fulvescens
    Meyer, Michaela
    Popper, Arthur N.
    Fay, Richard R.
    JOURNAL OF NEUROPHYSIOLOGY, 2012, 107 (02) : 658 - 665
  • [26] Auditory illusions, phonetic testing and disability of sound perception
    Rainov, V
    Petrova, J
    INTERNATIONAL JOURNAL OF PSYCHOPHYSIOLOGY, 1998, 30 (1-2) : 214 - 214
  • [27] Perception of a secondary auditory image with three sound sources
    Tan, BTG
    Tang, SH
    Yu, GQ
    ACUSTICA, 2000, 86 (06): : 1034 - 1037
  • [28] AUDITORY PERCEPTION OF NATURAL SOUND CATEGORIES - AN FMRI STUDY
    Sharda, M.
    Singh, N. C.
    NEUROSCIENCE, 2012, 214 : 49 - 58
  • [29] An ANN Model of the Perception of Sound by the Human Auditory System
    Riordan, D.
    Walsh, J.
    Doody, P.
    2013 SEVENTH INTERNATIONAL CONFERENCE ON SENSING TECHNOLOGY (ICST), 2013, : 75 - 80
  • [30] Auditory processing and perception of spectral cues for sound localization
    May, BJ
    EUROPEAN JOURNAL OF NEUROSCIENCE, 2000, 12 : 513 - 513