Audio-based unsupervised segmentation of multiparty dialogue

被引：0

作者：

Hsueh, Pei-Yun ^{[1
]}

机构：

[1] Univ Edinburgh, Sch Informat, Edinburgh EH8 9WL, Midlothian, Scotland

来源：

2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12 | 2008年

关键词：

meetings; clustering methods; acoustic signal processing;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, we explore a novel way to leverage audio information for unsupervised segmentation of multiparty dialogue. Our system which segments directly on patterns derived from audio sources is evaluated with previous work that segments on lexical patterns found in transcripts. We examine the effectiveness of both systems on recovering a two-layer structure of meeting dialogue. We demonstrate that the audio-based system performs significantly better than the word-based system on this task. In particular, it effectively recover segments of off-topic discussion. Results are encouraging as the audio information used in the system can be obtained in near real time and with absence of manual and ASR transcripts. It is particularly desirable when a system has to be operated online, or in unfamiliar domains and languages.

引用

页码：5049 / 5052

页数：4

共 50 条

[1] Sociometry based multiparty audio recordings segmentation
Vinciarelli, Alessandro
2006 IEEE International Conference on Multimedia and Expo - ICME 2006, Vols 1-5, Proceedings, 2006, : 1801 - 1804
[2] Audio-Based Onset Detection applied to Chewing Cycle Segmentation
Kopyto, David
Zhang, Rui
Amft, Oliver
IWSC'21: PROCEEDINGS OF THE 2021 ACM INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTERS, 2021, : 124 - 128
[3] Audio-based mindfulness stress reduction to audio-based relaxation treatment in a sample of couples
Adam, F.
Potet, A.
SEXOLOGIES, 2022, 31 (01) : 7 - 13
[4] Automatic topic segmentation and labeling in multiparty dialogue
Hsueh, Pei-Yun
Moore, Johanna D.
2006 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, 2006, : 98 - +
[5] Unsupervised Audio Segmentation based on Restricted Boltzmann Machines
Pikrakis, Aggelos
5TH INTERNATIONAL CONFERENCE ON INFORMATION, INTELLIGENCE, SYSTEMS AND APPLICATIONS, IISA 2014, 2014, : 311 - 314
[6] Audio-based context recognition
Eronen, AJ
Peltonen, VT
Tuomi, JT
Klapuri, AP
Fagerlund, S
Sorsa, T
Lorho, G
Huopaniemi, J
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (01): : 321 - 329
[7] Developing an Audio-based Game
Im, Byoung Uk
Baek, Nakhoon
2014 INTERNATIONAL CONFERENCE ON IT CONVERGENCE AND SECURITY (ICITCS), 2014,
[8] AUDIO-BASED CLASSIFICATION OF SPEAKER CHARACTERISTICS
Dutta, Promiti
Haubold, Alexander
ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 422 - 425
[9] A REGULARIZED KERNEL-BASED APPROACH TO UNSUPERVISED AUDIO SEGMENTATION
Harchaoui, Zaid
Vallet, Felicien
Lung-Yut-Fong, Alexandre
Cappe, Olivier
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 1665 - 1668
[10] Adaptive Audio-Based Context Recognition
Dargie, Waltenegus
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2009, 39 (04): : 715 - 725

← 1 2 3 4 5 →