Audio-based unsupervised segmentation of multiparty dialogue

被引:0
|
作者
Hsueh, Pei-Yun [1 ]
机构
[1] Univ Edinburgh, Sch Informat, Edinburgh EH8 9WL, Midlothian, Scotland
来源
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12 | 2008年
关键词
meetings; clustering methods; acoustic signal processing;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we explore a novel way to leverage audio information for unsupervised segmentation of multiparty dialogue. Our system which segments directly on patterns derived from audio sources is evaluated with previous work that segments on lexical patterns found in transcripts. We examine the effectiveness of both systems on recovering a two-layer structure of meeting dialogue. We demonstrate that the audio-based system performs significantly better than the word-based system on this task. In particular, it effectively recover segments of off-topic discussion. Results are encouraging as the audio information used in the system can be obtained in near real time and with absence of manual and ASR transcripts. It is particularly desirable when a system has to be operated online, or in unfamiliar domains and languages.
引用
收藏
页码:5049 / 5052
页数:4
相关论文
共 50 条
  • [31] AUDIO-BASED AUTOMATIC MANAGEMENT OF TV COMMERCIALS
    Duxans, Helenca
    Conejero, David
    Anguera, Xavier
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 1305 - 1308
  • [32] Audio-based event detection for sports video
    Baillie, M
    Jose, JM
    IMAGE AND VIDEO RETRIEVAL, PROCEEDINGS, 2003, 2728 : 300 - 309
  • [33] Audio-Based Wildfire Detection on Embedded Systems
    Huang, Hung-Tien
    Downey, Austin R. J.
    Bakos, Jason D.
    ELECTRONICS, 2022, 11 (09)
  • [34] Audio-Based Care for Managing Diabetes in Adults
    Reddy, Shivani
    Booth, Graham
    Coker-Schwimmer, Manny
    Kugley, Shannon
    Rodriguez-Borja, Ivette
    Patel, Sheila V.
    Fujita, Miku
    Philbrick, Sarah
    Ruwala, Richa
    Albritton, Jordan A.
    Crotty, Karen
    MEDICAL CARE, 2025, 63 (02) : 152 - 163
  • [35] Audio-based performance evaluation of squash players
    Hajdu-Szucs, Katalin
    Fenyvesi, Nora
    Steger, Jozsef
    Vattay, Gabor
    PLOS ONE, 2018, 13 (03):
  • [36] Robust Audio-based Classification of Video Genre
    Rouvier, Mickael
    Linares, Georges
    Matrouf, Driss
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1155 - 1158
  • [37] Event detection in an audio-based sensor network
    Smeaton, Alan F.
    McHugh, Michael
    MULTIMEDIA SYSTEMS, 2006, 12 (03) : 179 - 194
  • [38] Navigating by audio-based probing and fuzzy routing
    Polojarvi, Mikko
    Saloranta, Timo
    Riekki, Jukka
    PROCEEDINGS OF THE 17TH INTERNATIONAL ACADEMIC MINDTREK CONFERENCE: MAKING SENSE OF CONVERGING MEDIA, 2013, : 87 - 94
  • [39] A Survey of Audio-Based Music Classification and Annotation
    Fu, Zhouyu
    Lu, Guojun
    Ting, Kai Ming
    Zhang, Dengsheng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2011, 13 (02) : 303 - 319
  • [40] AUDIO-BASED DETECTION OF EXPLICIT CONTENT IN MUSIC
    Vaglio, Andrea
    Hennequin, Romain
    Moussallam, Manuel
    Richard, Gael
    d'Alche-Buc, Florence
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 526 - 530