Sound Texture Perception via Statistics of the Auditory Periphery: Evidence from Sound Synthesis

被引:220
|
作者
McDermott, Josh H. [1 ,2 ]
Simoncelli, Eero P. [1 ,2 ,3 ]
机构
[1] NYU, Howard Hughes Med Inst, New York, NY 10003 USA
[2] NYU, Ctr Neural Sci, New York, NY 10003 USA
[3] NYU, Courant Inst Math Sci, New York, NY 10003 USA
关键词
COCKTAIL PARTY; AMPLITUDE-MODULATION; RECEPTIVE-FIELDS; DISCRIMINATION; SPEECH; MECHANISMS; FREQUENCY; RESPONSES; MASKING; CONTEXT;
D O I
10.1016/j.neuron.2011.06.032
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Rainstorms, insect swarms, and galloping horses produce "sound textures"-the collective result of many similar acoustic events. Sound textures are distinguished by temporal homogeneity, suggesting they could be recognized with time-averaged statistics. To test this hypothesis, we processed real-world textures with an auditory model containing filters tuned for sound frequencies and their modulations, and measured statistics of the resulting decomposition. We then assessed the realism and recognizability of novel sounds synthesized to have matching statistics. Statistics of individual frequency channels, capturing spectral power and sparsity, generally failed to produce compelling synthetic textures; however, combining them with correlations between channels produced identifiable and natural-sounding textures. Synthesis quality declined if statistics were computed from biologically implausible auditory models. The results suggest that sound texture perception is mediated by relatively simple statistics of early auditory representations, presumably computed by downstream neural populations. The synthesis methodology offers a powerful tool for their further investigation.
引用
收藏
页码:926 / 940
页数:15
相关论文
共 50 条
  • [1] The networks underlying auditory discriminations are determined by sound statistics: evidence from fMRI
    Daikhin, L.
    Orlov, T.
    Ahissar, M.
    JOURNAL OF MOLECULAR NEUROSCIENCE, 2011, 45 (SUPPL 1) : S27 - S28
  • [2] Perceptual organization of sound begins in the auditory periphery
    Pressnitzer, Daniel
    Sayles, Mark
    Micheyl, Christophe
    Winter, Ian M.
    CURRENT BIOLOGY, 2008, 18 (15) : 1124 - 1128
  • [3] From Sound to Shape: Auditory Perception of Drawing Movements
    Thoret, Etienne
    Aramaki, Mitsuko
    Kronland-Martinet, Richard
    Velay, Jean-Luc
    Ystad, Solvi
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 2014, 40 (03) : 983 - 994
  • [4] Auditory perception of sound source velocity
    Kaczmarek, T
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2005, 117 (05): : 3149 - 3156
  • [5] Extrinsic sound stimulations and development of periphery auditory synapses
    Kun Hou
    Shiming Yang
    Ke Liu
    Journal of Otology, 2015, 10 (02) : 47 - 50
  • [6] Learning midlevel auditory codes from natural sound statistics
    Mlynarski, Wiktor
    McDermott, Josh H.
    Neural Computation, 2018, 30 (03): : 631 - 669
  • [7] Learning Midlevel Auditory Codes from Natural Sound Statistics
    Mlynarski, Wiktor
    McDermott, Josh H.
    NEURAL COMPUTATION, 2018, 30 (03) : 631 - 669
  • [8] Auditory perception of walls via spectral variations in the ambient sound field
    Ashmead, DH
    Wall, RS
    JOURNAL OF REHABILITATION RESEARCH AND DEVELOPMENT, 1999, 36 (04): : 313 - 322
  • [9] EVIDENCE FOR SOUND PERCEPTION WITH THE LABYRINTH
    BLEEKER, JD
    WIT, HP
    SEGENHOUT, JH
    ACTA OTO-LARYNGOLOGICA, 1980, 89 (1-2) : 76 - 84
  • [10] Cascaded Amplitude Modulations in Sound Texture Perception
    McWalter, Richard
    Dau, Torsten
    FRONTIERS IN NEUROSCIENCE, 2017, 11