Castsearch - Context based spoken document retrieval

被引：0

作者：

Molgaard, Lasse Lohilahti ^{[1
]}

Jorgensen, Kasper Winther ^{[1
]}

Hansen, Lars Kai ^{[1
]}

机构：

[1] Tech Univ Denmark Richard Petersens Plads, Bldg 321, DK-2800 Lyngby, Denmark

来源：

2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3 | 2007年

关键词：

audio retrieval; document clustering; non-negative matrix factorization; text mining;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

The paper describes our work on the development of a system for retrieval of relevant stories from broadcast news. The system utilizes a combination of audio processing and text mining. The audio processing consists of a segmentation step that partitions the audio into speech and music. The speech is further segmented into speaker segments and then transcribed using an automatic speech recognition system, to yield text input for clustering using non-negative matrix factorization (NMF). We find semantic topics that are used to evaluate the performance for topic detection. Based on these topics we show that a novel query expansion can be performed to return more intelligent search results. We also show that the query expansion helps overcome errors of the automatic transcription.

引用

页码：93 / +

页数：2

共 50 条

[21] Phonetic recognition for spoken document retrieval
Ng, K
Zue, VW
PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 325 - 328
[22] Probabilistic aspects in spoken document retrieval
Macherey, W. (w.macherey@informatik.rwth-aachen.de), 1600, Hindawi Publishing Corporation (2003):
[23] New approaches to spoken document retrieval
Wechsler, M
Munteanu, E
Schäuble, P
INFORMATION RETRIEVAL, 2000, 3 (03): : 173 - 188
[24] A novel approach to perform context‐based automatic spoken document retrieval of political speeches based on wavelet tree indexing
Anishka Gupta
Divakar Yadav
Multimedia Tools and Applications, 2021, 80 : 22209 - 22229
[25] SPEECHFIND: Spoken document retrieval for a national gallery of the spoken word
Hansen, JHL
Huang, RQ
Mangalath, P
Zhou, B
Seadle, M
Deller, JR
NORSIG 2004: PROCEEDINGS OF THE 6TH NORDIC SIGNAL PROCESSING SYMPOSIUM, 2004, 46 : 1 - 4
[26] A novel approach to perform context-based automatic spoken document retrieval of political speeches based on wavelet tree indexing
Gupta, Anishka
Yadav, Divakar
MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (14) : 22209 - 22229
[27] A NEURAL DOCUMENT LANGUAGE MODELING FRAMEWORK FOR SPOKEN DOCUMENT RETRIEVAL
Yen, Li-Phen
Wu, Zhen-Yu
Chen, Kuan-Yu
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 8139 - 8143
[28] GMM Adaptation based Online Speaker Segmentation for Spoken Document Retrieval
Park, Kyungmi
Park, Jeong-sik
Oh, Yung-Hwan
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2010, 56 (02) : 1123 - 1129
[29] Posterior probability based indexing method for Chinese spoken document retrieval
Zheng, Tie-Ran
Han, Ji-Qing
Harbin Gongye Daxue Xuebao/Journal of Harbin Institute of Technology, 2009, 41 (08): : 97 - 102
[30] ESSENCE VECTOR-BASED QUERY MODELING FOR SPOKEN DOCUMENT RETRIEVAL
Chen, Kuan-Yu
Liu, Shih-Hung
Chen, Berlin
Wang, Hsin-Min
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 6274 - 6278

← 1 2 3 4 5 →