Automatic Turn Segmentation in Spoken Conversations

被引:0
|
作者
Ivanov, Alexei V. [1 ]
Riccardi, Giuseppe [1 ]
机构
[1] Univ Trent, Dept Informat Engn & Comp Sci, Trento, Italy
关键词
spoken turn boundary; spoken dialogs; modulation spectrum; Bayesian information criterion; Kullback-Leibler divergence; SPEECH;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper we have studied the problem of detecting the spoken turn boundaries in human-human spoken conversations. The automation of this task is essential to enable the analysis, recognition and understanding of the speech transcriptions and dialog structures (e.g. turn taking, dialog act segmentation etc.). The problem formulation is different from previous work on metadata extraction in that we work on the time domain for the detection of boundaries. This approach has the advantage of giving fine grain measures of speech events and does not rely on the automatic speech transcriptions. We have explored applicability of different algorithms for this task and have found that a hidden Markov model combining results of the modulation spectrum analysis and Kullback-Leibler divergence of adjacent signal portions produces the best results. The performance of the algorithms has been evaluated on the Switchboard conversational speech corpus.
引用
收藏
页码:3130 / 3133
页数:4
相关论文
共 50 条
  • [21] Of Timing, Turn-Taking, and Conversations
    Stephen J. Cowley
    Journal of Psycholinguistic Research, 1998, 27 : 541 - 571
  • [22] TIMING AND TURN TAKING IN CHILDRENS CONVERSATIONS
    GARVEY, C
    BERNINGER, G
    DISCOURSE PROCESSES, 1981, 4 (01) : 27 - 57
  • [23] Automatic Segmentation of Spoken Word Signals into Letters Based on Amplitude Variation for Speech to Text Transcription
    Roy, Anik
    Phadikar, Santanu
    INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS, VOL 2, 2015, 340 : 621 - 628
  • [24] DENOTING SEGMENTATION IN SPOKEN TEXTS
    Sosnowska, Natalia
    ROCZNIKI HUMANISTYCZNE, 2009, 57 (06): : 189 - 200
  • [25] Utterance segmentation of spoken Chinese
    Wang, Haifeng
    Gao, Wen
    Li, Sheng
    Jisuanji Xuebao/Chinese Journal of Computers, 1999, 22 (10): : 1009 - 1013
  • [26] Reading Turn by Turn: Hierarchical Attention Architecture for Spoken Dialogue Comprehension
    Liu, Zhengyuan
    Chen, Nancy F.
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 5460 - 5466
  • [27] AUTOMATIC RECOGNITION OF SPOKEN DIGITS
    DAVIS, KH
    BIDDULPH, R
    BALASHEK, S
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1952, 24 (06): : 637 - 642
  • [28] AUTOMATIC RECOGNITION OF SPOKEN NUMERALS
    SEBESTYEN, G
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1960, 32 (11): : 1516 - 1517
  • [29] Recognition of Personality Traits from Human Spoken Conversations
    Ivanov, A. V.
    Riccardi, G.
    Sporka, A. J.
    Franc, J.
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1560 - +
  • [30] A DEEP LEARNING APPROACH TO MODELING COMPETITIVENESS IN SPOKEN CONVERSATIONS
    Chowdhury, Shammur Absar
    Riccardi, Giuseppe
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5680 - 5684