The effect of F0 contour on the intelligibility of speech in the presence of interfering sounds for Mandarin Chinese

被引:14
|
作者
Chen, Jing [1 ,2 ]
Yang, Hongying [1 ,2 ]
Wu, Xihong [1 ,2 ]
Moore, Brian C. J. [3 ]
机构
[1] Peking Univ, Speech & Hearing Res Ctr, Dept Machine Intelligence, Beijing 100871, Peoples R China
[2] Peking Univ, Minist Educ, Key Lab Machine Percept, Beijing 100871, Peoples R China
[3] Univ Cambridge, Dept Psychol, Cambridge CB2 3EB, England
来源
基金
中国国家自然科学基金;
关键词
FLATTENED FUNDAMENTAL-FREQUENCY; PERCEIVED SPATIAL SEPARATION; INFORMATIONAL MASKING; PERCEPTION; LANGUAGE; SEGREGATION; MODULATION; RECOGNITION; INTONATION; RELEASE;
D O I
10.1121/1.5023218
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In Mandarin Chinese, the fundamental frequency (F0) contour defines lexical "Tones" that differ in meaning despite being phonetically identical. Flattening the F0 contour impairs the intelligibility of Mandarin Chinese in background sounds. This might occur because the flattening introduces misleading lexical information. To avoid this effect, two types of speech were used: single-Tone speech contained Tones 1 and 0 only, which have a flat F0 contour; multi-Tone speech contained all Tones and had a varying F0 contour. The intelligibility of speech in steady noise was slightly better for single-Tone speech than for multi-Tone speech. The intelligibility of speech in a twotalker masker, with the difference in mean F0 between the target and masker matched across conditions, was worse for the multi-Tone target in the multi-Tone masker than for any other combination of target and masker, probably because informational masking was maximal for this combination. The introduction of a perceived spatial separation between the target and masker, via the precedence effect, led to better performance for all target-masker combinations, especially the multiTone target in the multi-Tone masker. In summary, a flat F0 contour does not reduce the intelligibility of Mandarin Chinese when the introduction of misleading lexical cues is avoided. (C) 2018 Acoustical Society of America.
引用
收藏
页码:864 / 877
页数:14
相关论文
共 50 条
  • [21] F0 patterns in Mandarin statements of Mandarin and Cantonese speakers
    Yang, Yike
    Chen, Si
    Chen, Xi
    INTERSPEECH 2020, 2020, : 4163 - 4167
  • [22] The contribution of changes in F0 and spectral tilt to increased intelligibility of speech produced in noise
    Lu, Youyi
    Cooke, Martin
    SPEECH COMMUNICATION, 2009, 51 (12) : 1253 - 1262
  • [23] GLOTTOGRAPHIC AND AERODYNAMIC ANALYSIS ON CONSONANT ASPIRATION AND ONSET F0 IN MANDARIN CHINESE
    Chi, Yujie
    Honda, Kiyoshi
    Wei, Jianguo
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6480 - 6484
  • [24] F0 range instead of F0 slope is the primary cue for the falling tone of Mandarin
    Zhang, Wei
    Gu, Wentao
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2023, 153 (06): : 3439 - 3446
  • [25] Automatic analysis of speech F0 contour for the characterization of mood changes in bipolar patients
    Guidi, A.
    Vanello, N.
    Bertschy, G.
    Gentili, C.
    Landini, L.
    Scilingo, E. P.
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2015, 17 : 29 - 37
  • [26] Energy and F0 contour modeling with Functional Data Analysis for Emotional Speech Detection
    Pablo Arias, Juan
    Busso, Carlos
    Becerra Yoma, Nestor
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2870 - 2874
  • [27] Multiple f0 contour parallel Viterbi search for unit selection speech synthesis
    Campillo, F.
    Banga, E. R.
    ELECTRONICS LETTERS, 2011, 47 (16) : 937 - 938
  • [28] Determining the base frequency of the F0 contour generation model for the diverse expression of speech
    Arimoto, Yoshiko
    Horiuchi, Yasuo
    Ohno, Sumio
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2025, 46 (01) : 78 - 86
  • [29] Going beyond F0: The acquisition of Mandarin tones
    Rhee, Nari
    Chen, Aoju
    Kuang, Jianjing
    JOURNAL OF CHILD LANGUAGE, 2021, 48 (02) : 387 - 398
  • [30] F0 and Voice Quality of Coarticulated Mandarin Tones
    Huang, Yaqian
    LANGUAGE AND SPEECH, 2025,