Automatic Turn Segmentation in Spoken Conversations

被引:0
|
作者
Ivanov, Alexei V. [1 ]
Riccardi, Giuseppe [1 ]
机构
[1] Univ Trent, Dept Informat Engn & Comp Sci, Trento, Italy
关键词
spoken turn boundary; spoken dialogs; modulation spectrum; Bayesian information criterion; Kullback-Leibler divergence; SPEECH;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper we have studied the problem of detecting the spoken turn boundaries in human-human spoken conversations. The automation of this task is essential to enable the analysis, recognition and understanding of the speech transcriptions and dialog structures (e.g. turn taking, dialog act segmentation etc.). The problem formulation is different from previous work on metadata extraction in that we work on the time domain for the detection of boundaries. This approach has the advantage of giving fine grain measures of speech events and does not rely on the automatic speech transcriptions. We have explored applicability of different algorithms for this task and have found that a hidden Markov model combining results of the modulation spectrum analysis and Kullback-Leibler divergence of adjacent signal portions produces the best results. The performance of the algorithms has been evaluated on the Switchboard conversational speech corpus.
引用
收藏
页码:3130 / 3133
页数:4
相关论文
共 50 条
  • [31] LEARNING CONCEPTS THROUGH CONVERSATIONS IN SPOKEN DIALOGUE SYSTEMS
    Jia, Robin
    Heck, Larry
    Hakkani-Tur, Dilek
    Nikolov, Georgi
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5725 - 5729
  • [32] Invisible conversations: subjects spoken but unheard in gynecological visits
    Meneghel, Stela Nazareth
    Pinheiro Andrade, Daniela Negraes
    Hesler, Lilian Zielke
    CIENCIA & SAUDE COLETIVA, 2021, 26 (01): : 275 - 284
  • [33] AUTOMATIC RECOGNITION OF SPOKEN WORDS
    VONKELLER, TG
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1968, 44 (01): : 385 - +
  • [34] (AUTOMATIC INDENTIFICATION OF SPOKEN DIGITS)
    BACKHAUSEN, WJ
    PHONETICA, 1971, 24 (02) : 86 - +
  • [35] Timing in conversation is dynamically adjusted turn by turn in dyadic telephone conversations
    Pouw, Wim
    Holler, Judith
    COGNITION, 2022, 222
  • [36] The turn of the page: spoken quotation in shared reading
    Gordon, John
    CLASSROOM DISCOURSE, 2020, 11 (04) : 366 - 387
  • [37] AUTOMATIC MONITORING OF SUBSCRIBER CONVERSATIONS
    ERICSSON, EA
    ERICSSON REVIEW, 1969, 46 (03): : 70 - &
  • [38] Turn off the Clock during Patient Conversations
    不详
    ALLERGO JOURNAL, 2006, 15 (08) : 542 - 542
  • [39] An automatic Pasteur turn
    Marmier, L
    COMPTES RENDUS DES SEANCES DE LA SOCIETE DE BIOLOGIE ET DE SES FILIALES, 1925, 93 : 637 - 638
  • [40] Phonotactic regularities in the segmentation of spoken Italian
    Tagliapietra, Lara
    Fanari, Rachele
    De Candia, Chiara
    Tabossi, Patrizia
    QUARTERLY JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 2009, 62 (02): : 392 - 415