Automatic Turn Segmentation in Spoken Conversations

被引:0
|
作者
Ivanov, Alexei V. [1 ]
Riccardi, Giuseppe [1 ]
机构
[1] Univ Trent, Dept Informat Engn & Comp Sci, Trento, Italy
关键词
spoken turn boundary; spoken dialogs; modulation spectrum; Bayesian information criterion; Kullback-Leibler divergence; SPEECH;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper we have studied the problem of detecting the spoken turn boundaries in human-human spoken conversations. The automation of this task is essential to enable the analysis, recognition and understanding of the speech transcriptions and dialog structures (e.g. turn taking, dialog act segmentation etc.). The problem formulation is different from previous work on metadata extraction in that we work on the time domain for the detection of boundaries. This approach has the advantage of giving fine grain measures of speech events and does not rely on the automatic speech transcriptions. We have explored applicability of different algorithms for this task and have found that a hidden Markov model combining results of the modulation spectrum analysis and Kullback-Leibler divergence of adjacent signal portions produces the best results. The performance of the algorithms has been evaluated on the Switchboard conversational speech corpus.
引用
收藏
页码:3130 / 3133
页数:4
相关论文
共 50 条
  • [1] Automatic Word Segmentation for Spoken Cantonese
    Fung, Roxana
    Bigi, Brigitte
    2015 INTERNATIONAL CONFERENCE ORIENTAL COCOSDA HELD JOINTLY WITH 2015 CONFERENCE ON ASIAN SPOKEN LANGUAGE RESEARCH AND EVALUATION (O-COCOSDA/CASLRE), 2015, : 196 - 201
  • [2] Predicting User Satisfaction from Turn-Taking in Spoken Conversations
    Chowdhury, Shammur Absar
    Stepanov, Evgeny A.
    Riccardi, Giuseppe
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2910 - 2914
  • [3] Automatic story segmentation for spoken document retrieval
    Hui, PY
    Tang, XO
    Meng, HM
    Lam, W
    Gao, XB
    10TH IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3: MEETING THE GRAND CHALLENGE: MACHINES THAT SERVE PEOPLE, 2001, : 1319 - 1322
  • [4] SPOKEN MALAY LANGUAGE INFLUENCE ON AUTOMATIC TRANSCRIPTION AND SEGMENTATION
    Husni, Husniza
    Yusof, Yuhanis
    Kamaruddin, Siti Sakira
    COMPUTING & INFORMATICS, 4TH INTERNATIONAL CONFERENCE, 2013, 2013, : 133 - 137
  • [5] AUTOMATIC TURN SEGMENTATION FOR MOVIE & TV SUBTITLES
    Lison, Pierre
    Meena, Raveesh
    2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 245 - 252
  • [6] AUTOMATIC DETECTION OF CONFLICTS IN SPOKEN CONVERSATIONS: RATINGS AND ANALYSIS OF BROADCAST POLITICAL DEBATES
    Kim, Samuel
    Valente, Fabio
    Vinciarelli, Alessandro
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 5089 - 5092
  • [7] Automatic grammar inference based on sentence segmentation for spoken Chinese
    Zhang, He
    Wu, Xiaojun
    Wang, Xiaodong
    Zheng, Fang
    Qinghua Daxue Xuebao/Journal of Tsinghua University, 2009, 49 (SUPPL. 1): : 1322 - 1327
  • [8] SIMULTANEOUS DIALOG ACT SEGMENTATION AND CLASSIFICATION FROM HUMAN-HUMAN SPOKEN CONVERSATIONS
    Quarteroni, Silvia
    Ivanov, Alexei V.
    Riccardi, Giuseppe
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5596 - 5599
  • [9] On the effects of automatic transcription and segmentation errors in Hungarian spoken language processing
    Tündik M.Á.
    Kaszás V.
    Szaszák G.
    Periodica polytechnica Electrical engineering and computer science, 2019, 63 (04): : 254 - 262
  • [10] Creating a Data Set of Abstractive Summaries of Turn-labeled Spoken Human-Computer Conversations
    Meijer, Virginia
    Hendrickx, Iris
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 2236 - 2244