Hesitations in Spontaneous Speech: Acoustic Analysis and Detection

被引:3
|
作者
Verkhodanova, Vasilisa [1 ]
Shapranov, Vladimir [1 ]
Kipyatkova, Irina [1 ]
机构
[1] SPIIRAS, St Petersburg, Russia
来源
基金
俄罗斯基础研究基金会;
关键词
Speech disfluencies; Hesitations; Filled pauses; Lengthenings; Speech processing; Support vector machines; FILLED PAUSES;
D O I
10.1007/978-3-319-66429-3_39
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Spontaneous speech is different from any other type of speech in many ways, with speech disfluencies being the prominent feature. These phenomena both play an important role in communication, and also cause problems for automatic speech processing. In this study we present the results of acoustic analysis of the most frequent disfluencies voiced hesitations (filled pauses and lengthenings) across different speaking styles in spontaneous Russian speech, as well as results of experiments on their detection using SVM classifier on a joint Russian and English spontaneous speech corpus. Results of acoustic analysis showed significant differences in fundamental frequency and energy distribution ratios of hesitations and their contexts across speaking styles in Russian: comparing to the dialogues, in monologues speakers exhibit more prosodic cues for the adjacent context and hesitations. Experiments on detection of voiced hesitations on a mixed language and style corpus with SVM resulted in achieving F1-score = 0.48 (With F1-score = 0.55 for only Russian data).
引用
收藏
页码:398 / 406
页数:9
相关论文
共 50 条
  • [41] Automated Acoustic Analysis in Detection of Spontaneous Swallows in Parkinson’s Disease
    Marzieh Golabbakhsh
    Ali Rajaei
    Mahmoud Derakhshan
    Saeed Sadri
    Masoud Taheri
    Peyman Adibi
    Dysphagia, 2014, 29 : 572 - 577
  • [42] Acoustic and prosodic characteristics of vocal hesitations in three languages
    Vasilescu, Ioana
    Adda-Decker, Martine
    Nemoto, Rena
    TRAITEMENT AUTOMATIQUE DES LANGUES, 2008, 49 (03): : 199 - 228
  • [43] DEVELOPMENT OF TEMPORAL PATTERNING AND VOCAL HESITATIONS IN SPONTANEOUS NARRATIVES
    KOWAL, S
    OCONNELL, DC
    SABIN, EJ
    JOURNAL OF PSYCHOLINGUISTIC RESEARCH, 1975, 4 (03) : 195 - 207
  • [44] Acoustic and Language Based Deep Learning Approaches for Alzheimer's Dementia Detection From Spontaneous Speech
    Mahajan, Pranav
    Baths, Veeky
    FRONTIERS IN AGING NEUROSCIENCE, 2021, 13
  • [45] Hesitations and relative proeminence in prosodic constituents in children's speech
    Sampaio Villega, Cristyane de Camargo
    Chacon, Lourenco
    CODAS, 2022, 34 (02):
  • [46] Acoustic analysis of friendly speech
    Chen, FX
    Li, AJ
    Wang, HB
    Wang, TQ
    Fang, Q
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 569 - 572
  • [47] Acoustic Analysis of PD Speech
    Chenausky, Karen
    MacAuslan, Joel
    Goldhor, Richard
    PARKINSONS DISEASE, 2011, 2011
  • [48] FUNCTIONS OF HESITATIONS IN SPEECH OF NORMAL FAMILIES AND FAMILIES OF SCHIZOPHRENIC PATIENTS
    MISHLER, EG
    WAXLER, NE
    LANGUAGE AND SPEECH, 1970, 13 (02) : 102 - &
  • [49] An acoustic and lexical analysis of emotional valence in spontaneous speech: Autobiographical memory recall in older adults
    Nazareth, Deniece S.
    Tournier, Ellen
    Leimkotter, Sarah
    Janse, Esther
    Heylen, Dirk
    Westerhof, Gerben J.
    Truong, Khiet P.
    INTERSPEECH 2019, 2019, : 3287 - 3291
  • [50] Robustness Improvement of Hypernasal Speech Detection by Acoustic Analysis and the Rademacher Complexity Model
    Delgado-Trejos, E.
    Sepulveda-Sepulveda, F. A.
    Castellanos-Dominguez, G.
    ADVANCES IN BIOMEDICAL RESEARCH, PROCEEDINGS, 2010, : 159 - +