The human auditory system uses amplitude modulation to distinguish music from speech

被引:3
|
作者
Chang, Andrew [1 ]
Teng, Xiangbin [2 ]
Assaneo, M. Florencia [3 ]
Poeppel, David [1 ,4 ,5 ,6 ]
机构
[1] NYU, Dept Psychol, New York, NY 10012 USA
[2] Chinese Univ Hong Kong, Dept Psychol, Hong Kong, Peoples R China
[3] Univ Nacl Autonoma Mexico, Inst Neurobiol, Juriquilla, Queretaro, Mexico
[4] Ernst Struengmann Inst Neurosci, Frankfurt, Germany
[5] NYU, Ctr Language Mus & Emot CLaME, New York, NY USA
[6] NYU, Mus & Audio Res Lab MARL, New York, NY USA
关键词
RHYTHM; BEAT; SONG; PERCEPTION; OSCILLATIONS; SOUNDS;
D O I
10.1371/journal.pbio.3002631
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Music and speech are complex and distinct auditory signals that are both foundational to the human experience. The mechanisms underpinning each domain are widely investigated. However, what perceptual mechanism transforms a sound into music or speech and how basic acoustic information is required to distinguish between them remain open questions. Here, we hypothesized that a sound's amplitude modulation (AM), an essential temporal acoustic feature driving the auditory system across processing levels, is critical for distinguishing music and speech. Specifically, in contrast to paradigms using naturalistic acoustic signals (that can be challenging to interpret), we used a noise-probing approach to untangle the auditory mechanism: If AM rate and regularity are critical for perceptually distinguishing music and speech, judging artificially noise-synthesized ambiguous audio signals should align with their AM parameters. Across 4 experiments (N = 335), signals with a higher peak AM frequency tend to be judged as speech, lower as music. Interestingly, this principle is consistently used by all listeners for speech judgments, but only by musically sophisticated listeners for music. In addition, signals with more regular AM are judged as music over speech, and this feature is more critical for music judgment, regardless of musical sophistication. The data suggest that the auditory system can rely on a low-level acoustic property as basic as AM to distinguish music from speech, a simple principle that provokes both neurophysiological and evolutionary experiments and speculations. What enables the human auditory system to distinguish between music and speech? Through a reductionist approach involving judgments on artificially generated ambiguous noise clips, this study shows that the auditory system relies on basic acoustic parameters, such as amplitude modulation rates and regularities, to differentiate between music and speech.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Perceptual speech enhancement exploiting temporal masking properties of human auditory system
    Gunawan, Teddy Surya
    Ambikairajah, Eliathamby
    Epps, Julien
    SPEECH COMMUNICATION, 2010, 52 (05) : 381 - 393
  • [42] Speech Reconstruction from Human Auditory Cortex with Deep Neural Networks
    Yang, Minda
    Sheth, Sameer A.
    Schevon, Catherine A.
    McKhann, Guy M., II
    Mesgarani, Nima
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1121 - 1125
  • [43] Responsiveness of the human auditory cortex to degraded speech sounds: Reduction of amplitude resolution vs. additive noise
    Miettinen, Ismo
    Alku, Paavo
    Salminen, Nelli
    May, Patrick J. C.
    Tiitinen, Hannu
    BRAIN RESEARCH, 2011, 1367 : 298 - 309
  • [44] Short-term plasticity in the auditory system: Differential neural responses to perception and imagery of speech and music
    Meyer, Martin
    Elmer, Stefan
    Baumann, Simon
    Jancke, Lutz
    RESTORATIVE NEUROLOGY AND NEUROSCIENCE, 2007, 25 (3-4) : 411 - 431
  • [45] RANDOM MODULATION AND CHANNELS IN HUMAN AUDITORY-SYSTEM CONCERNED WITH DETECTION OF MODULATION IN FREQUENCY MODULATED TONES
    GREEN, GGR
    KAY, RH
    JOURNAL OF PHYSIOLOGY-LONDON, 1976, 263 (01): : P167 - P168
  • [46] PET imaging of the normal human auditory system: responses to speech in quiet and in background noise
    Salvi, RJ
    Lockwood, AH
    Frisina, RD
    Coad, ML
    Wack, DS
    Frisina, DR
    HEARING RESEARCH, 2002, 170 (1-2) : 96 - 106
  • [47] Decoding of selective attention to continuous speech from the human auditory brainstem response
    Etard, Octave
    Kegler, Mikolaj
    Braiman, Chananel
    Forte, Antonio Elia
    Reichenbach, Tobias
    NEUROIMAGE, 2019, 200 : 1 - 11
  • [48] Central auditory system responses from children while listening to speech in noise
    Benitez-Barrera, Carlos R.
    Key, Alexandra P.
    Ricketts, Todd A.
    Tharpe, Anne Marie
    HEARING RESEARCH, 2021, 403
  • [49] Multi-channel mono-path periodic signal extraction with global amplitude and phase modulation for music and speech signal analysis
    Triki, Mahdi
    Slock, Dirk T. M.
    2005 IEEE/SP 13TH WORKSHOP ON STATISTICAL SIGNAL PROCESSING (SSP), VOLS 1 AND 2, 2005, : 66 - 71
  • [50] Modulation of cerebral blood flow in the human auditory cortex during speech: Role of motor-to-sensory discharges
    Paus, T
    Perry, DW
    Zatorre, RJ
    Worsley, KJ
    Evans, AC
    EUROPEAN JOURNAL OF NEUROSCIENCE, 1996, 8 (11) : 2236 - 2246