Discriminative segmental cues to vowel height and consonantal place and voicing in whispered speech

被引:2
|
作者
Jesus, Luis M. T. [1 ,6 ]
Castilho, Sara [2 ]
Ferreira, Anibal [3 ]
Costa, Maria Conceicao [4 ,5 ]
机构
[1] Univ Aveiro, Inst Elect & Informat Engn Aveiro IEETA, Sch Hlth Sci ESSUA, Intelligent Syst Associate Lab LASI, Aveiro, Portugal
[2] Hosp Arcebispo Joao Crisostomo, Cantanhede, Portugal
[3] Univ Porto, Dept Elect & Comp Engn, Porto, Portugal
[4] Univ Aveiro, Dept Math DMat, Aveiro, Portugal
[5] Univ Aveiro, Ctr Res & Dev Math & Applicat CIDMA, Aveiro, Portugal
[6] Univ Aveiro, Campus Univ Santiago, P-3810193 Aveiro, Portugal
关键词
Speech production; Acoustic phonetics; Whispered speech; Vowels; Fricatives; ACOUSTIC ANALYSIS SYSTEMS; FORMANT FREQUENCIES; AMERICAN ENGLISH; PITCH; FRICATIVES; RECONSTRUCTION; CONFIGURATION; LARYNGECTOMY; PERCEPTION; DURATIONS;
D O I
10.1016/j.wocn.2023.101223
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
Purpose: The acoustic signal attributes of whispered speech potentially carry sufficiently distinct information to define vowel spaces and to disambiguate consonant place and voicing, but what these attributes are and the underlying production mechanisms are not fully known. The purpose of this study was to define segmental cues to place and voicing of vowels and sibilant fricatives and to develop an articulatory interpretation of acoustic data.Method: Seventeen speakers produced sustained sibilants and oral vowels, disyllabic words, sentences and read a phonetically balanced text. All the tasks were repeated in voiced and whispered speech, and the sound source and filter analysed using the following parameters: Fundamental frequency, spectral peak frequencies and levels, spectral slopes, sound pressure level and durations. Logistic linear mixed-effects models were developed to understand what acoustic signal attributes carry sufficiently distinct information to disambiguate /i, a/ and /s, ?/.Results: Vowels were produced with significantly different spectral slope, sound pressure level, first and second formant frequencies in voiced and whispered speech. The low frequencies spectral slope of voiced sibilants was significantly different between whispered and voiced speech. The odds of choosing /a/ instead of /i/ were esti-mated to be lower for whispered speech when compared to voiced speech. Fricatives' broad peak frequency was statistically significant when discriminating between /s/ and /?/.Conclusions: First formant frequency and relative duration of vowels are consistently used as height cues, and spectral slope and broad peak frequency are attributes associated with consonantal place of articulation. The rel-ative duration of same-place voiceless fricatives was higher than voiced fricatives both in voiced and whispered speech. The evidence presented in this paper can be used to restore voiced speech signals, and to inform reha-bilitation strategies that can safely explore the production mechanisms of whispering.CO 2023 The Author(s). Published by Elsevier Ltd. This is an open access article under the CC BY license (http:// creativecommons.org/licenses/by/4.0/).
引用
收藏
页数:21
相关论文
共 7 条
  • [1] Identification of words in whispered speech: The role of cues to fricatives' place and voicing
    Jesus, Luis M. T.
    Ferreira, Joana F. S.
    Ferreira, Anibal J. S.
    [J]. JASA EXPRESS LETTERS, 2023, 3 (08):
  • [2] On vowel height and consonantal voicing effects: Data from Italian
    Esposito, A
    [J]. PHONETICA, 2002, 59 (04) : 197 - 231
  • [3] PERCEPTION OF VOICING AND PLACE FEATURES IN WHISPERED SPEECH - DICHOTIC CHOICE ANALYSIS
    ALLEN, J
    HAGGARD, M
    [J]. PERCEPTION & PSYCHOPHYSICS, 1977, 21 (04): : 315 - 322
  • [4] Segmental cues to intonation of statements and polar questions in whispered, semi-whispered and normal speech modes
    Zygis, Marzena
    Pape, Daniel
    Koenig, Laura L.
    Jaskula, Marek
    Jesus, Luis M. T.
    [J]. JOURNAL OF PHONETICS, 2017, 63 : 53 - 74
  • [5] SEPARABILITY OF PLACE AND VOICING CUES FOR INITIAL CONSONANTS IN NATURAL SPEECH
    SOLI, SD
    STRANGE, W
    JENKINS, JJ
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1977, 61 : S47 - S48
  • [6] Prosodic effects on acoustic cues to stop voicing and place of articulation: Evidence from Radio News speech
    Cole, Jennifer
    Kim, Heejin
    Choi, Hansook
    Hasegawa-Johnson, Mark
    [J]. JOURNAL OF PHONETICS, 2007, 35 (02) : 180 - 209
  • [7] Modeling source-tract interaction in speech production: Voicing onset vs. vowel height after a voiceless obstruent
    Lucero, Jorge C.
    Koenig, Laura L.
    Fuchs, Susanne
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2195 - 2198