The effect of F0 contour on the intelligibility of speech in the presence of interfering sounds for Mandarin Chinese

被引：14

作者：

Chen, Jing ^{[1
,2
]}

Yang, Hongying ^{[1
,2
]}

Wu, Xihong ^{[1
,2
]}

Moore, Brian C. J. ^{[3
]}

机构：

[1] Peking Univ, Speech & Hearing Res Ctr, Dept Machine Intelligence, Beijing 100871, Peoples R China

[2] Peking Univ, Minist Educ, Key Lab Machine Percept, Beijing 100871, Peoples R China

[3] Univ Cambridge, Dept Psychol, Cambridge CB2 3EB, England

来源：

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA | 2018年 / 143卷 / 02期

基金：

中国国家自然科学基金;

关键词：

FLATTENED FUNDAMENTAL-FREQUENCY; PERCEIVED SPATIAL SEPARATION; INFORMATIONAL MASKING; PERCEPTION; LANGUAGE; SEGREGATION; MODULATION; RECOGNITION; INTONATION; RELEASE;

D O I：

10.1121/1.5023218

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In Mandarin Chinese, the fundamental frequency (F0) contour defines lexical "Tones" that differ in meaning despite being phonetically identical. Flattening the F0 contour impairs the intelligibility of Mandarin Chinese in background sounds. This might occur because the flattening introduces misleading lexical information. To avoid this effect, two types of speech were used: single-Tone speech contained Tones 1 and 0 only, which have a flat F0 contour; multi-Tone speech contained all Tones and had a varying F0 contour. The intelligibility of speech in steady noise was slightly better for single-Tone speech than for multi-Tone speech. The intelligibility of speech in a twotalker masker, with the difference in mean F0 between the target and masker matched across conditions, was worse for the multi-Tone target in the multi-Tone masker than for any other combination of target and masker, probably because informational masking was maximal for this combination. The introduction of a perceived spatial separation between the target and masker, via the precedence effect, led to better performance for all target-masker combinations, especially the multiTone target in the multi-Tone masker. In summary, a flat F0 contour does not reduce the intelligibility of Mandarin Chinese when the introduction of misleading lexical cues is avoided. (C) 2018 Acoustical Society of America.

引用

页码：864 / 877

页数：14

共 50 条

[21] F0 patterns in Mandarin statements of Mandarin and Cantonese speakers
Yang, Yike
Chen, Si
Chen, Xi
INTERSPEECH 2020, 2020, : 4163 - 4167
[22] The contribution of changes in F0 and spectral tilt to increased intelligibility of speech produced in noise
Lu, Youyi
Cooke, Martin
SPEECH COMMUNICATION, 2009, 51 (12) : 1253 - 1262
[23] GLOTTOGRAPHIC AND AERODYNAMIC ANALYSIS ON CONSONANT ASPIRATION AND ONSET F0 IN MANDARIN CHINESE
Chi, Yujie
Honda, Kiyoshi
Wei, Jianguo
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6480 - 6484
[24] F0 range instead of F0 slope is the primary cue for the falling tone of Mandarin
Zhang, Wei
Gu, Wentao
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2023, 153 (06): : 3439 - 3446
[25] Automatic analysis of speech F0 contour for the characterization of mood changes in bipolar patients
Guidi, A.
Vanello, N.
Bertschy, G.
Gentili, C.
Landini, L.
Scilingo, E. P.
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2015, 17 : 29 - 37
[26] Energy and F0 contour modeling with Functional Data Analysis for Emotional Speech Detection
Pablo Arias, Juan
Busso, Carlos
Becerra Yoma, Nestor
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2870 - 2874
[27] Multiple f0 contour parallel Viterbi search for unit selection speech synthesis
Campillo, F.
Banga, E. R.
ELECTRONICS LETTERS, 2011, 47 (16) : 937 - 938
[28] Determining the base frequency of the F0 contour generation model for the diverse expression of speech
Arimoto, Yoshiko
Horiuchi, Yasuo
Ohno, Sumio
ACOUSTICAL SCIENCE AND TECHNOLOGY, 2025, 46 (01) : 78 - 86
[29] Going beyond F0: The acquisition of Mandarin tones
Rhee, Nari
Chen, Aoju
Kuang, Jianjing
JOURNAL OF CHILD LANGUAGE, 2021, 48 (02) : 387 - 398
[30] F0 and Voice Quality of Coarticulated Mandarin Tones
Huang, Yaqian
LANGUAGE AND SPEECH, 2025,

← 1 2 3 4 5 →