Sound Texture Perception via Statistics of the Auditory Periphery: Evidence from Sound Synthesis

被引:231
|
作者
McDermott, Josh H. [1 ,2 ]
Simoncelli, Eero P. [1 ,2 ,3 ]
机构
[1] NYU, Howard Hughes Med Inst, New York, NY 10003 USA
[2] NYU, Ctr Neural Sci, New York, NY 10003 USA
[3] NYU, Courant Inst Math Sci, New York, NY 10003 USA
关键词
COCKTAIL PARTY; AMPLITUDE-MODULATION; RECEPTIVE-FIELDS; DISCRIMINATION; SPEECH; MECHANISMS; FREQUENCY; RESPONSES; MASKING; CONTEXT;
D O I
10.1016/j.neuron.2011.06.032
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Rainstorms, insect swarms, and galloping horses produce "sound textures"-the collective result of many similar acoustic events. Sound textures are distinguished by temporal homogeneity, suggesting they could be recognized with time-averaged statistics. To test this hypothesis, we processed real-world textures with an auditory model containing filters tuned for sound frequencies and their modulations, and measured statistics of the resulting decomposition. We then assessed the realism and recognizability of novel sounds synthesized to have matching statistics. Statistics of individual frequency channels, capturing spectral power and sparsity, generally failed to produce compelling synthetic textures; however, combining them with correlations between channels produced identifiable and natural-sounding textures. Synthesis quality declined if statistics were computed from biologically implausible auditory models. The results suggest that sound texture perception is mediated by relatively simple statistics of early auditory representations, presumably computed by downstream neural populations. The synthesis methodology offers a powerful tool for their further investigation.
引用
收藏
页码:926 / 940
页数:15
相关论文
共 50 条
  • [31] EFFECTS OF VISUAL PERCEPTION ON AUDITORY LOCALIZATION OF SOUND SOURCE
    MANKOVSKII, VS
    VOPROSY PSIKHOLOGII, 1969, (04) : 57 - 65
  • [32] Texture synthesis models and material perception in the visual periphery
    Balas, Benjamin
    HUMAN VISION AND ELECTRONIC IMAGING XX, 2015, 9394
  • [33] Perception of auditory events in scenarios with projected and direct sound from various directions
    Wuehle, Tom
    Merchel, Sebastian
    Altinsoy, M. Ercan
    146TH AES CONVENTION, 2019,
  • [34] Dynamic Range Adaptation to Sound Level Statistics in the Auditory Nerve
    Wen, Bo
    Wang, Grace I.
    Dean, Isabel
    Delgutte, Bertrand
    JOURNAL OF NEUROSCIENCE, 2009, 29 (44): : 13797 - 13808
  • [35] SOUND TEXTURE SYNTHESIS USING RI SPECTROGRAMS
    Caracalla, Hugo
    Roebel, Axel
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 416 - 420
  • [36] CONCATENATIVE SOUND TEXTURE SYNTHESIS METHODS AND EVALUATION
    Schwarz, Diemo
    Roebel, Axel
    Yeh, Chunghsin
    LaBurthe, Amaury
    DAFX 16: 19TH INTERNATIONAL CONFERENCE ON DIGITAL AUDIO EFFECTS, 2016, : 217 - 224
  • [37] Separating the Novel Speech Sound Perception of Lexical Tone Chimeras From Their Auditory Signal Manipulations: Behavioral and Electroencephalographic Evidence
    Jeng, Fuh-Cherng
    Hart, Breanna N.
    Lin, Chia-Der
    PERCEPTUAL AND MOTOR SKILLS, 2021, 128 (06) : 2527 - 2543
  • [38] Auditory perception is influenced by the orientation of the trunk relative to a sound source
    Occhigrossi, Chiara
    Brosch, Michael
    Giommetti, Giorgia
    Panichi, Roberto
    Ricci, Giampietro
    Ferraresi, Aldo
    Roscini, Mauro
    Pettorossi, Vito Enrico
    Faralli, Mario
    EXPERIMENTAL BRAIN RESEARCH, 2021, 239 (04) : 1223 - 1234
  • [39] Perception of stochastically undersampled sound waveforms: a model of auditory deafferentation
    Lopez-Poveda, Enrique A.
    Barrios, Pablo
    FRONTIERS IN NEUROSCIENCE, 2013, 7
  • [40] COMPLEX HUMAN AUDITORY PERCEPTION AND SIMULATED SOUND PERFORMANCE PREDICTION
    Alambeigi, Pantea
    Zhao, Sipei
    Burry, Jane
    Qiu, Xiaojun
    PROCEEDINGS OF THE 21ST INTERNATIONAL CONFERENCE ON COMPUTER-AIDED ARCHITECTURAL DESIGN RESEARCH IN ASIA (CAADRIA 2016): LIVING SYSTEMS AND MICRO-UTOPIAS: TOWARDS CONTINUOUS DESIGNING, 2016, : 631 - 640