Audio-based unsupervised segmentation of multiparty dialogue

被引:0
|
作者
Hsueh, Pei-Yun [1 ]
机构
[1] Univ Edinburgh, Sch Informat, Edinburgh EH8 9WL, Midlothian, Scotland
关键词
meetings; clustering methods; acoustic signal processing;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we explore a novel way to leverage audio information for unsupervised segmentation of multiparty dialogue. Our system which segments directly on patterns derived from audio sources is evaluated with previous work that segments on lexical patterns found in transcripts. We examine the effectiveness of both systems on recovering a two-layer structure of meeting dialogue. We demonstrate that the audio-based system performs significantly better than the word-based system on this task. In particular, it effectively recover segments of off-topic discussion. Results are encouraging as the audio information used in the system can be obtained in near real time and with absence of manual and ASR transcripts. It is particularly desirable when a system has to be operated online, or in unfamiliar domains and languages.
引用
收藏
页码:5049 / 5052
页数:4
相关论文
共 50 条
  • [1] Sociometry based multiparty audio recordings segmentation
    Vinciarelli, Alessandro
    2006 IEEE International Conference on Multimedia and Expo - ICME 2006, Vols 1-5, Proceedings, 2006, : 1801 - 1804
  • [2] Audio-Based Onset Detection applied to Chewing Cycle Segmentation
    Kopyto, David
    Zhang, Rui
    Amft, Oliver
    IWSC'21: PROCEEDINGS OF THE 2021 ACM INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTERS, 2021, : 124 - 128
  • [3] Audio-based mindfulness stress reduction to audio-based relaxation treatment in a sample of couples
    Adam, F.
    Potet, A.
    SEXOLOGIES, 2022, 31 (01) : 7 - 13
  • [4] Automatic topic segmentation and labeling in multiparty dialogue
    Hsueh, Pei-Yun
    Moore, Johanna D.
    2006 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, 2006, : 98 - +
  • [5] Unsupervised Audio Segmentation based on Restricted Boltzmann Machines
    Pikrakis, Aggelos
    5TH INTERNATIONAL CONFERENCE ON INFORMATION, INTELLIGENCE, SYSTEMS AND APPLICATIONS, IISA 2014, 2014, : 311 - 314
  • [6] Audio-based context recognition
    Eronen, AJ
    Peltonen, VT
    Tuomi, JT
    Klapuri, AP
    Fagerlund, S
    Sorsa, T
    Lorho, G
    Huopaniemi, J
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (01): : 321 - 329
  • [7] Developing an Audio-based Game
    Im, Byoung Uk
    Baek, Nakhoon
    2014 INTERNATIONAL CONFERENCE ON IT CONVERGENCE AND SECURITY (ICITCS), 2014,
  • [8] AUDIO-BASED CLASSIFICATION OF SPEAKER CHARACTERISTICS
    Dutta, Promiti
    Haubold, Alexander
    ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 422 - 425
  • [9] A REGULARIZED KERNEL-BASED APPROACH TO UNSUPERVISED AUDIO SEGMENTATION
    Harchaoui, Zaid
    Vallet, Felicien
    Lung-Yut-Fong, Alexandre
    Cappe, Olivier
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 1665 - 1668
  • [10] Adaptive Audio-Based Context Recognition
    Dargie, Waltenegus
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2009, 39 (04): : 715 - 725