The human auditory system uses amplitude modulation to distinguish music from speech

被引:3
|
作者
Chang, Andrew [1 ]
Teng, Xiangbin [2 ]
Assaneo, M. Florencia [3 ]
Poeppel, David [1 ,4 ,5 ,6 ]
机构
[1] NYU, Dept Psychol, New York, NY 10012 USA
[2] Chinese Univ Hong Kong, Dept Psychol, Hong Kong, Peoples R China
[3] Univ Nacl Autonoma Mexico, Inst Neurobiol, Juriquilla, Queretaro, Mexico
[4] Ernst Struengmann Inst Neurosci, Frankfurt, Germany
[5] NYU, Ctr Language Mus & Emot CLaME, New York, NY USA
[6] NYU, Mus & Audio Res Lab MARL, New York, NY USA
关键词
RHYTHM; BEAT; SONG; PERCEPTION; OSCILLATIONS; SOUNDS;
D O I
10.1371/journal.pbio.3002631
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Music and speech are complex and distinct auditory signals that are both foundational to the human experience. The mechanisms underpinning each domain are widely investigated. However, what perceptual mechanism transforms a sound into music or speech and how basic acoustic information is required to distinguish between them remain open questions. Here, we hypothesized that a sound's amplitude modulation (AM), an essential temporal acoustic feature driving the auditory system across processing levels, is critical for distinguishing music and speech. Specifically, in contrast to paradigms using naturalistic acoustic signals (that can be challenging to interpret), we used a noise-probing approach to untangle the auditory mechanism: If AM rate and regularity are critical for perceptually distinguishing music and speech, judging artificially noise-synthesized ambiguous audio signals should align with their AM parameters. Across 4 experiments (N = 335), signals with a higher peak AM frequency tend to be judged as speech, lower as music. Interestingly, this principle is consistently used by all listeners for speech judgments, but only by musically sophisticated listeners for music. In addition, signals with more regular AM are judged as music over speech, and this feature is more critical for music judgment, regardless of musical sophistication. The data suggest that the auditory system can rely on a low-level acoustic property as basic as AM to distinguish music from speech, a simple principle that provokes both neurophysiological and evolutionary experiments and speculations. What enables the human auditory system to distinguish between music and speech? Through a reductionist approach involving judgments on artificially generated ambiguous noise clips, this study shows that the auditory system relies on basic acoustic parameters, such as amplitude modulation rates and regularities, to differentiate between music and speech.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Ability of Human Auditory Perception to Distinguish Human-Imitated Speech
    Zaman, Khalid
    Li, Kai
    Samiul, Islam J. A. M.
    Uezu, Yasufumi
    Kidani, Shunsuke
    Unoki, Masashi
    IEEE ACCESS, 2025, 13 : 6225 - 6236
  • [2] Processing of music and speech by the human auditory cortex: Neuroimaging evidence
    Zatorre, RJ
    BRAIN AND COGNITION, 2004, 54 (02) : 129 - 129
  • [3] Human Frequency Following Responses to Vocoded Speech: Amplitude Modulation Versus Amplitude Plus Frequency Modulation
    Suresh, Chandan H.
    Krishnan, Ananthanarayan
    Luo, Xin
    EAR AND HEARING, 2020, 41 (02): : 300 - 311
  • [4] Amplitude Modulation Features for Emotion Recognition from Speech
    Alam, Md Jahangir
    Attabi, Yazid
    Dumouchel, Pierre
    Kenny, Patrick
    O'Shaughnessy, D.
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2419 - 2423
  • [5] HUMAN AUDITORY ASSESSMENT OF MODULATION FREQUENCY OF AN AMPLITUDE-MODULATED TONE
    ISHCHENKO, SM
    SOVIET PHYSICS ACOUSTICS-USSR, 1977, 23 (01): : 35 - 38
  • [6] An Auditory Inspired Amplitude Modulation Filter Bank for Robust Feature Extraction in Automatic Speech Recognition
    Fraunhofer Institute for Digital Media Technology , Project Group for Hearing, Speech, and Audio Technology , Oldenburg
    D-26129, Germany
    不详
    D-26111, Germany
    不详
    不详
    IEEE Trans. Audio Speech Lang. Process., 11 (1926-1937):
  • [7] An Auditory Inspired Amplitude Modulation Filter Bank for Robust Feature Extraction in Automatic Speech Recognition
    Moritz, Niko
    Anemueller, Joern
    Kollmeier, Birger
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (11) : 1926 - 1937
  • [8] Reconstructing Speech from Human Auditory Cortex
    Pasley, Brian N.
    David, Stephen V.
    Mesgarani, Nima
    Flinker, Adeen
    Shamma, Shihab A.
    Crone, Nathan E.
    Knight, Robert T.
    Chang, Edward F.
    PLOS BIOLOGY, 2012, 10 (01)
  • [9] Speech recognition based on a model of human auditory system
    Koizumi, T
    Mori, M
    Taniguchi, S
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 937 - 940
  • [10] Concurrent encoding of frequency and amplitude modulation in human auditory cortex: MEG evidence
    Luo, Huan
    Wang, Yadong
    Poeppel, David
    Simon, Jonathan Z.
    JOURNAL OF NEUROPHYSIOLOGY, 2006, 96 (05) : 2712 - 2723