Hesitations in Spontaneous Speech: Acoustic Analysis and Detection

被引:3
|
作者
Verkhodanova, Vasilisa [1 ]
Shapranov, Vladimir [1 ]
Kipyatkova, Irina [1 ]
机构
[1] SPIIRAS, St Petersburg, Russia
来源
基金
俄罗斯基础研究基金会;
关键词
Speech disfluencies; Hesitations; Filled pauses; Lengthenings; Speech processing; Support vector machines; FILLED PAUSES;
D O I
10.1007/978-3-319-66429-3_39
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Spontaneous speech is different from any other type of speech in many ways, with speech disfluencies being the prominent feature. These phenomena both play an important role in communication, and also cause problems for automatic speech processing. In this study we present the results of acoustic analysis of the most frequent disfluencies voiced hesitations (filled pauses and lengthenings) across different speaking styles in spontaneous Russian speech, as well as results of experiments on their detection using SVM classifier on a joint Russian and English spontaneous speech corpus. Results of acoustic analysis showed significant differences in fundamental frequency and energy distribution ratios of hesitations and their contexts across speaking styles in Russian: comparing to the dialogues, in monologues speakers exhibit more prosodic cues for the adjacent context and hesitations. Experiments on detection of voiced hesitations on a mixed language and style corpus with SVM resulted in achieving F1-score = 0.48 (With F1-score = 0.55 for only Russian data).
引用
收藏
页码:398 / 406
页数:9
相关论文
共 50 条
  • [31] Syllable detection in read and spontaneous speech
    Pfitzinger, HR
    Burger, S
    Heid, S
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1261 - 1264
  • [32] Speech/Non-Speech Detection in Malay Language Spontaneous Speech
    Izzad, M.
    Jamil, Nursuriati
    Abu Bakar, Zainab
    2013 INTERNATIONAL CONFERENCE ON COMPUTING, MANAGEMENT AND TELECOMMUNICATIONS (COMMANTEL), 2013, : 219 - 224
  • [33] Using Syllables as Acoustic Units for Spontaneous Speech Recognition
    Hejtmanek, Jan
    TEXT, SPEECH AND DIALOGUE, 2010, 6231 : 299 - 305
  • [34] Improving Acoustic Models for Russian Spontaneous Speech Recognition
    Prudnikov, Alexey
    Medennikov, Ivan
    Mendelev, Valentin
    Korenevsky, Maxim
    Khokhlov, Yuri
    SPEECH AND COMPUTER (SPECOM 2015), 2015, 9319 : 234 - 242
  • [35] Contribution of acoustic analysis to the detection of vocoid epenthesis in apraxia of speech and other motor speech disorders
    Bourqui, Marion
    Pernon, Michaela
    Fougeron, Cecile
    Laganaro, Marina
    APHASIOLOGY, 2022, 36 (07) : 854 - 867
  • [36] Deep Learning for Acoustic Irony Classification in Spontaneous Speech
    Gent, Helen
    Adams, Chase
    Shih, Chilin
    Tang, Yan
    INTERSPEECH 2022, 2022, : 3993 - 3997
  • [37] Automatic detection of Parkinson's disease based on acoustic analysis of speech
    Braga, Diogo
    Madureira, Ana M.
    Coelho, Luis
    Ajith, Reuel
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2019, 77 : 148 - 158
  • [38] Combining NLP techniques and acoustic analysis for semantic focus detection in speech
    Beke, Andras
    Szaszak, Gyoergy
    2014 5TH IEEE CONFERENCE ON COGNITIVE INFOCOMMUNICATIONS (COGINFOCOM), 2014, : 493 - 497
  • [39] HESITATIONS IN CHILDERN'S SPEECH: EVIDENCE OF LANGUAGE COMPLEXITY
    Chacon, Lourenco
    Sampaio Villega, Cristyane De Camargo
    CADERNOS DE ESTUDOS LINGUISTICOS, 2012, 54 (01): : 81 - 95
  • [40] Automated Acoustic Analysis in Detection of Spontaneous Swallows in Parkinson's Disease
    Golabbakhsh, Marzieh
    Rajaei, Ali
    Derakhshan, Mahmoud
    Sadri, Saeed
    Taheri, Masoud
    Adibi, Peyman
    DYSPHAGIA, 2014, 29 (05) : 572 - 577