Testing acoustic voice quality classification across languages and speech styles

被引:1
|
作者
Braun, Bettina [1 ]
Dehe, Nicole [1 ]
Einfeldt, Marieke [1 ]
Wochner, Daniela [1 ]
Zahner-Ritter, Katharina [2 ]
机构
[1] Univ Konstanz, Dept Linguist, Constance, Germany
[2] Univ Trier, Dept 2, Phonet, Trier, Germany
来源
关键词
voice quality; phonation type; acoustic measures; random forest; cross-linguistic generalization; infant-directed speech; German; Chinese; Icelandic; INFANT-DIRECTED SPEECH; PERCEPTION; EMOTION; BREATHY; FEMALE;
D O I
10.21437/Interspeech.2021-315
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Many studies relate acoustic voice quality measures to perceptual classification. We extend this line of research by training a classifier on a balanced set of perceptually annotated voice quality categories with high inter-rater agreement, and test it on speech samples from a different language and on a different speech style. Annotations were done on continuous speech from different laboratory settings. In Experiment 1, we trained a random forest with Standard Chinese and German recordings labelled as modal, breathy, or glottalized. The model had an accuracy of 78.7% on unseen data from the same sample (most important variables were harmonics-to-noise ratio, cepstral-peak prominence, and H1-A2). This model was then used to classify data from a different language (Icelandic, Experiment 2) and to classify a different speech style (German infant-directed speech (IDS), Experiment 3). Cross-linguistic generalizability was high for Icelandic (78.6% accuracy), but lower for German IDS (71.7% accuracy). Accuracy of recordings of adult-directed speech from the same speakers as in Experiment 3 (77%, Experiment 4) suggests that it is the special speech style of IDS, rather than the recording setting that led to lower performance. Results are discussed in terms of efficiency of coding and generalizability across languages and speech styles.
引用
收藏
页码:3920 / 3924
页数:5
相关论文
共 50 条
  • [41] BIMODAL SPEECH-PERCEPTION - AN EXAMINATION ACROSS LANGUAGES
    MASSARO, DW
    COHEN, MM
    GESI, A
    HEREDIA, R
    TSUZAKI, M
    JOURNAL OF PHONETICS, 1993, 21 (04) : 445 - 478
  • [42] Aspectuality across languages: Event construal in speech and gesture
    Nikolaeva, Yulia V.
    VOPROSY YAZYKOZNANIYA, 2020, (04): : 132 - 140
  • [43] Speech Synthesis for Speaker Timbre Translation Across Languages
    Liu, Jiangfeng
    Guo, Yongbin
    Chen, Jinbiao
    Wang, Zixu
    Mao, Aihua
    2022 4TH INTERNATIONAL CONFERENCE ON CONTROL AND ROBOTICS, ICCR, 2022, : 320 - 324
  • [44] ACOUSTIC CUES OF FEMALE VOICE QUALITY
    SATO, H
    ELECTRONICS & COMMUNICATIONS IN JAPAN, 1974, 57 (01): : 29 - 38
  • [45] Acoustic characteristics of the metallic voice quality
    Xavier Fadel, Congeta Bruniere
    Dassie-Leite, Ana Paula
    Santos, Rosane Sampaio
    Rosa, Marcelo de Oliveira
    Marques, Jair Mendes
    CODAS, 2015, 27 (01): : 97 - 100
  • [46] LOOK AT CLOZE TESTING ACROSS LANGUAGES AND LEVELS
    BRIERE, EJ
    CLAUSING, G
    SENKO, D
    PURCELL, E
    MODERN LANGUAGE JOURNAL, 1978, 62 (1-2): : 23 - 26
  • [47] The role of prosody and voice quality in indirect storytelling speech: A cross-narrator perspective in four European languages
    Montano, Raul
    Alias, Francesc
    SPEECH COMMUNICATION, 2017, 88 : 1 - 16
  • [48] Toward transfer of acoustic cues of emphasis across languages
    Tsiartas, Andreas
    Georgiou, Panayiotis G.
    Narayanan, Shrikanth S.
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3450 - 3453
  • [49] Relationship between acoustic measures of voice and judgments of voice quality
    Daniel Zaoming Huang
    中国眼耳鼻喉科杂志, 2000, (01) : 49 - 51
  • [50] Acoustic Voice Quality Index as a Potential Tool for Voice Screening
    Faharn, Maryam
    Laukkanen, Anne-Maria
    Ikavalko, Tero
    Rantala, Leena
    Geneid, Ahmed
    Holmqvist-Jamsen, Sofia
    Ruusuvirta, Kaarina
    Pirla, Sirpa
    JOURNAL OF VOICE, 2021, 35 (02) : 226 - 232