Temporal Coherence and the Streaming of Complex Sounds

被引:25
|
作者
Shamma, Shihab [1 ]
Elhilali, Mounya [3 ]
Ma, Ling [1 ,2 ]
Micheyl, Christophe [4 ]
Oxenham, Andrew J. [4 ]
Pressnitzer, Daniel [5 ,6 ]
Yin, Pingbo [1 ]
Xu, Yanbo [1 ]
机构
[1] Univ Maryland, Syst Res Inst, Dept Elect & Comp Engn, College Pk, MD 20742 USA
[2] Univ Maryland, Bioengn Program, College Pk, MD 20742 USA
[3] Johns Hopkins Univ, Dept Elect & Comp Engn, Baltimore, MD 21218 USA
[4] Univ Minnesota, Dept Psychol, Minneapolis, MN 55455 USA
[5] Ecole Normale Super, Equipe Audit, Dept Etudes Cognit, F-75231 Paris, France
[6] Univ Paris 05, UMR CNRS 8158, Lab Psychol Percept, Paris, France
关键词
ATTENTION;
D O I
10.1007/978-1-4614-1590-9_59
中图分类号
R-3 [医学研究方法]; R3 [基础医学];
学科分类号
1001 ;
摘要
Humans and other animals can attend to one of multiple sounds, and follow it selectively over time. The neural underpinnings of this perceptual feat remain mysterious. Some studies have concluded that sounds are heard as separate streams when they activate well-separated populations of central auditory neurons, and that this process is largely pre-attentive. Here, we propose instead that stream formation depends primarily on temporal coherence between responses that encode various features of a sound source. Furthermore, we postulate that only when attention is directed toward a particular feature (e.g., pitch or location) do all other temporally coherent features of that source (e.g., timbre and location) become bound together as a stream that is segregated from the incoherent features of other sources. Experimental neurophysiological evidence in support of this hypothesis will be presented. The focus, however, will be on a computational realization of this idea and a discussion of the insights learned from simulations to disentangle complex sound sources such as speech and music. The model consists of a representational stage of early and cortical auditory processing that creates a multidimensional depiction of various sound attributes such as pitch, location, and spectral resolution. The following stage computes a coherence matrix that summarizes the pair-wise correlations between all channels making up the cortical representation. Finally, the perceived segregated streams are extracted by decomposing the coherence matrix into its uncorrelated components. Questions raised by the model are discussed, especially on the role of attention in streaming and the search for further neural correlates of streaming percepts.
引用
收藏
页码:535 / 543
页数:9
相关论文
共 50 条
  • [1] INFLUENCE OF PHASE COHERENCE UPON THE PITCH OF COMPLEX, PERIODIC SOUNDS
    LICKLIDER, JCR
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1955, 27 (05): : 996 - 996
  • [2] Reverberation challenges the temporal representation of the pitch of complex sounds
    Sayles, Mark
    Winter, Ian M.
    NEURON, 2008, 58 (05) : 789 - 801
  • [3] Rate Versus Temporal Code? A Spatio-Temporal Coherence Model of the Cortical Basis of Streaming
    Elhilali, Mounya
    Ma, Ling
    Micheyl, Christophe
    Oxenham, Andrew
    Shamma, Shihab
    NEUROPHYSIOLOGICAL BASES OF AUDITORY PERCEPTION, 2010, : 497 - +
  • [4] Segregating Complex Sound Sources through Temporal Coherence
    Krishnan, Lakshmi
    Elhilali, Mounya
    Shamma, Shihab
    PLOS COMPUTATIONAL BIOLOGY, 2014, 10 (12)
  • [5] Segregation of complex acoustic scenes based on temporal coherence
    Teki, Sundeep
    Chait, Maria
    Kumar, Sukhbinder
    Shamma, Shihab
    Griffiths, Timothy D.
    ELIFE, 2013, 2
  • [6] Neural Correlates of Auditory Streaming of Harmonic Complex Sounds With Different Phase Relations in the Songbird Forebrain
    Itatani, Naoya
    Klump, Georg M.
    JOURNAL OF NEUROPHYSIOLOGY, 2011, 105 (01) : 188 - 199
  • [7] Learning spectro-temporal representations of complex sounds with parameterized neural networksa)
    Riad, Rachid
    Karadayi, Julien
    Bachoud-Levi, Anne-Catherine
    Dupoux, Emmanuel
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2021, 150 (01): : 353 - 366
  • [8] CODING OF TEMPORAL PARAMETERS OF COMPLEX SOUNDS BY FROG AUDITORY-NERVE FIBERS
    FENG, AS
    HALL, JC
    SIDDIQUE, S
    JOURNAL OF NEUROPHYSIOLOGY, 1991, 65 (03) : 424 - 445
  • [9] Streaming Sounds: Musical Listening in the Digital Age
    Green, Ben
    Walsh, Michael James
    JOURNAL OF SOCIOLOGY, 2025,
  • [10] Topos: Spiking neural networks for temporal pattern recognition in complex real sounds
    Gonzalez-Nalda, Pablo
    Cases, Blanca
    NEUROCOMPUTING, 2008, 71 (4-6) : 721 - 732