Hesitations in Spontaneous Speech: Acoustic Analysis and Detection

被引:3
|
作者
Verkhodanova, Vasilisa [1 ]
Shapranov, Vladimir [1 ]
Kipyatkova, Irina [1 ]
机构
[1] SPIIRAS, St Petersburg, Russia
来源
基金
俄罗斯基础研究基金会;
关键词
Speech disfluencies; Hesitations; Filled pauses; Lengthenings; Speech processing; Support vector machines; FILLED PAUSES;
D O I
10.1007/978-3-319-66429-3_39
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Spontaneous speech is different from any other type of speech in many ways, with speech disfluencies being the prominent feature. These phenomena both play an important role in communication, and also cause problems for automatic speech processing. In this study we present the results of acoustic analysis of the most frequent disfluencies voiced hesitations (filled pauses and lengthenings) across different speaking styles in spontaneous Russian speech, as well as results of experiments on their detection using SVM classifier on a joint Russian and English spontaneous speech corpus. Results of acoustic analysis showed significant differences in fundamental frequency and energy distribution ratios of hesitations and their contexts across speaking styles in Russian: comparing to the dialogues, in monologues speakers exhibit more prosodic cues for the adjacent context and hesitations. Experiments on detection of voiced hesitations on a mixed language and style corpus with SVM resulted in achieving F1-score = 0.48 (With F1-score = 0.55 for only Russian data).
引用
收藏
页码:398 / 406
页数:9
相关论文
共 50 条
  • [21] EFFECTS OF TASK DIFFICULTY AND ANXIETY ON HESITATIONS IN SPEECH
    LAY, CH
    PAIVIO, A
    CANADIAN JOURNAL OF BEHAVIOURAL SCIENCE, 1969, 1 (01): : 25 - &
  • [22] Empty speech pause detection in spontaneous speech
    Stejskal, Vojtech
    Bourbakis, Nikolaos
    Esposito, Anna
    ICTAI: 2009 21ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, 2009, : 237 - 242
  • [23] Keyword Detection for Spontaneous Speech
    Li, Weifeng
    Billard, Aude
    Bourlard, Herve
    PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOLS 1-9, 2009, : 4327 - 4331
  • [24] Acoustic cues to femininity and masculinity in spontaneous speech
    Nylen, Fredrik
    Holmberg, Jenny
    Soedersten, Maria
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2024, 155 (05): : 3090 - 3100
  • [25] An acoustic measure for word prominence in spontaneous speech
    Wang, Dagen
    Narayanan, Shrikanth
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (02): : 690 - 701
  • [26] Acoustic and comparative analysis of the phoneme /g/ in the spontaneous speech of Euskadi, Andalusia and Madrid
    Sola, Alicia
    LOQUENS, 2023, 10 (1-2):
  • [27] Acoustic feature analysis and discriminative modeling of filled pauses for spontaneous speech recognition
    Wu, CH
    Yan, GL
    JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2004, 36 (2-3): : 91 - 104
  • [28] Acoustic Feature Analysis and Discriminative Modeling of Filled Pauses for Spontaneous Speech Recognition
    Chung-Hsien Wu
    Gwo-Lang Yan
    Journal of VLSI signal processing systems for signal, image and video technology, 2004, 36 : 91 - 104
  • [29] Automatic Detection of Cognitive Impairments through Acoustic Analysis of Speech
    Nagumo, Ryosuke
    Zhang, Yaming
    Ogawa, Yuki
    Hosokawa, Mitsuharu
    Abe, Kengo
    Ukeda, Takaaki
    Sumi, Sadayuki
    Kurita, Satoshi
    Nakakubo, Sho
    Lee, Sangyoon
    Doi, Takehiko
    Shimada, Hiroyuki
    CURRENT ALZHEIMER RESEARCH, 2020, 17 (01) : 60 - 68
  • [30] HESITATIONS IN CHILDRENS SPEECH DURING EXPLANATION AND DESCRIPTION
    LEVIN, H
    SILVERMA.I
    FORD, BL
    JOURNAL OF VERBAL LEARNING AND VERBAL BEHAVIOR, 1967, 6 (04): : 560 - &