Audio-based unsupervised segmentation of multiparty dialogue

被引：0

作者：

Hsueh, Pei-Yun ^{[1
]}

机构：

[1] Univ Edinburgh, Sch Informat, Edinburgh EH8 9WL, Midlothian, Scotland

来源：

2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12 | 2008年

关键词：

meetings; clustering methods; acoustic signal processing;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, we explore a novel way to leverage audio information for unsupervised segmentation of multiparty dialogue. Our system which segments directly on patterns derived from audio sources is evaluated with previous work that segments on lexical patterns found in transcripts. We examine the effectiveness of both systems on recovering a two-layer structure of meeting dialogue. We demonstrate that the audio-based system performs significantly better than the word-based system on this task. In particular, it effectively recover segments of off-topic discussion. Results are encouraging as the audio information used in the system can be obtained in near real time and with absence of manual and ASR transcripts. It is particularly desirable when a system has to be operated online, or in unfamiliar domains and languages.

引用

页码：5049 / 5052

页数：4

共 50 条

[31] AUDIO-BASED AUTOMATIC MANAGEMENT OF TV COMMERCIALS
Duxans, Helenca
Conejero, David
Anguera, Xavier
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 1305 - 1308
[32] Audio-based event detection for sports video
Baillie, M
Jose, JM
IMAGE AND VIDEO RETRIEVAL, PROCEEDINGS, 2003, 2728 : 300 - 309
[33] Audio-Based Wildfire Detection on Embedded Systems
Huang, Hung-Tien
Downey, Austin R. J.
Bakos, Jason D.
ELECTRONICS, 2022, 11 (09)
[34] Audio-Based Care for Managing Diabetes in Adults
Reddy, Shivani
Booth, Graham
Coker-Schwimmer, Manny
Kugley, Shannon
Rodriguez-Borja, Ivette
Patel, Sheila V.
Fujita, Miku
Philbrick, Sarah
Ruwala, Richa
Albritton, Jordan A.
Crotty, Karen
MEDICAL CARE, 2025, 63 (02) : 152 - 163
[35] Audio-based performance evaluation of squash players
Hajdu-Szucs, Katalin
Fenyvesi, Nora
Steger, Jozsef
Vattay, Gabor
PLOS ONE, 2018, 13 (03):
[36] Robust Audio-based Classification of Video Genre
Rouvier, Mickael
Linares, Georges
Matrouf, Driss
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1155 - 1158
[37] Event detection in an audio-based sensor network
Smeaton, Alan F.
McHugh, Michael
MULTIMEDIA SYSTEMS, 2006, 12 (03) : 179 - 194
[38] Navigating by audio-based probing and fuzzy routing
Polojarvi, Mikko
Saloranta, Timo
Riekki, Jukka
PROCEEDINGS OF THE 17TH INTERNATIONAL ACADEMIC MINDTREK CONFERENCE: MAKING SENSE OF CONVERGING MEDIA, 2013, : 87 - 94
[39] A Survey of Audio-Based Music Classification and Annotation
Fu, Zhouyu
Lu, Guojun
Ting, Kai Ming
Zhang, Dengsheng
IEEE TRANSACTIONS ON MULTIMEDIA, 2011, 13 (02) : 303 - 319
[40] AUDIO-BASED DETECTION OF EXPLICIT CONTENT IN MUSIC
Vaglio, Andrea
Hennequin, Romain
Moussallam, Manuel
Richard, Gael
d'Alche-Buc, Florence
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 526 - 530

← 1 2 3 4 5 →