Castsearch - Context based spoken document retrieval

被引：0

作者：

Molgaard, Lasse Lohilahti ^{[1
]}

Jorgensen, Kasper Winther ^{[1
]}

Hansen, Lars Kai ^{[1
]}

机构：

[1] Tech Univ Denmark Richard Petersens Plads, Bldg 321, DK-2800 Lyngby, Denmark

来源：

2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3 | 2007年

关键词：

audio retrieval; document clustering; non-negative matrix factorization; text mining;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

The paper describes our work on the development of a system for retrieval of relevant stories from broadcast news. The system utilizes a combination of audio processing and text mining. The audio processing consists of a segmentation step that partitions the audio into speech and music. The speech is further segmented into speaker segments and then transcribed using an automatic speech recognition system, to yield text input for clustering using non-negative matrix factorization (NMF). We find semantic topics that are used to evaluate the performance for topic detection. Based on these topics we show that a novel query expansion can be performed to return more intelligent search results. We also show that the query expansion helps overcome errors of the automatic transcription.

引用

页码：93 / +

页数：2

共 50 条

[1] IMPROVING PHONEME-BASED SPOKEN DOCUMENT RETRIEVAL WITH PHONETIC CONTEXT EXPANSION
Olivier, Le Blouch
Collen, Patrice
2008 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-4, 2008, : 1217 - 1220
[2] SEMANTIC QUERY EXPANSION AND CONTEXT-BASED DISCRIMINATIVE TERM MODELING FOR SPOKEN DOCUMENT RETRIEVAL
Tu, Tsung-wei
Lee, Hung-yi
Chou, Yu-yu
Lee, Lin-shan
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 5085 - 5088
[3] Experiments in spoken document retrieval
Jones, K.Sparck
Jones, G.J.F.
Foote, J.T.
Young, S.J.
Information Processing and Management, 1996, 32 (04): : 399 - 417
[4] An architecture for spoken document retrieval
Terol, RM
Martínez-Barco, P
Palomar, M
TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2004, 3206 : 505 - 511
[5] Experiments in spoken document retrieval
Sparck-Jones, K
Jones, GJF
Foote, JT
Young, SJ
INFORMATION PROCESSING & MANAGEMENT, 1996, 32 (04) : 399 - 417
[6] A Soundex-Based Approach for Spoken Document Retrieval
Alejandro Reyes-Barragan, M.
Villasenor-Pineda, Luis
Montes-y-Gomez, Manuel
MICAI 2008: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2008, 5317 : 204 - 211
[7] Spoken Document Retrieval Based on Approximated Sequence Alignment
Comas, Pere R.
Turmo, Jordi
TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2008, 5246 : 285 - 292
[8] Subword-based approaches for spoken document retrieval
Ng, K
Zue, VW
SPEECH COMMUNICATION, 2000, 32 (03) : 157 - 186
[9] Statistical Lattice-Based Spoken Document Retrieval
Chia, Tee Kiah
Sim, Khe Chai
Li, Haizhou
Ng, Hwee Tou
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2010, 28 (01)
[10] Spoken Document Retrieval System based on Phonemic Transcribing
Tatarinova, Alexandra
Prozorov, Dmitriy
2017 IEEE EAST-WEST DESIGN & TEST SYMPOSIUM (EWDTS), 2017,

← 1 2 3 4 5 →