Castsearch - Context based spoken document retrieval

被引:0
|
作者
Molgaard, Lasse Lohilahti [1 ]
Jorgensen, Kasper Winther [1 ]
Hansen, Lars Kai [1 ]
机构
[1] Tech Univ Denmark Richard Petersens Plads, Bldg 321, DK-2800 Lyngby, Denmark
关键词
audio retrieval; document clustering; non-negative matrix factorization; text mining;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The paper describes our work on the development of a system for retrieval of relevant stories from broadcast news. The system utilizes a combination of audio processing and text mining. The audio processing consists of a segmentation step that partitions the audio into speech and music. The speech is further segmented into speaker segments and then transcribed using an automatic speech recognition system, to yield text input for clustering using non-negative matrix factorization (NMF). We find semantic topics that are used to evaluate the performance for topic detection. Based on these topics we show that a novel query expansion can be performed to return more intelligent search results. We also show that the query expansion helps overcome errors of the automatic transcription.
引用
收藏
页码:93 / +
页数:2
相关论文
共 50 条
  • [21] Phonetic recognition for spoken document retrieval
    Ng, K
    Zue, VW
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 325 - 328
  • [22] Probabilistic aspects in spoken document retrieval
    Macherey, W. (w.macherey@informatik.rwth-aachen.de), 1600, Hindawi Publishing Corporation (2003):
  • [23] New approaches to spoken document retrieval
    Wechsler, M
    Munteanu, E
    Schäuble, P
    INFORMATION RETRIEVAL, 2000, 3 (03): : 173 - 188
  • [24] A novel approach to perform context‐based automatic spoken document retrieval of political speeches based on wavelet tree indexing
    Anishka Gupta
    Divakar Yadav
    Multimedia Tools and Applications, 2021, 80 : 22209 - 22229
  • [25] SPEECHFIND: Spoken document retrieval for a national gallery of the spoken word
    Hansen, JHL
    Huang, RQ
    Mangalath, P
    Zhou, B
    Seadle, M
    Deller, JR
    NORSIG 2004: PROCEEDINGS OF THE 6TH NORDIC SIGNAL PROCESSING SYMPOSIUM, 2004, 46 : 1 - 4
  • [26] A novel approach to perform context-based automatic spoken document retrieval of political speeches based on wavelet tree indexing
    Gupta, Anishka
    Yadav, Divakar
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (14) : 22209 - 22229
  • [27] A NEURAL DOCUMENT LANGUAGE MODELING FRAMEWORK FOR SPOKEN DOCUMENT RETRIEVAL
    Yen, Li-Phen
    Wu, Zhen-Yu
    Chen, Kuan-Yu
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 8139 - 8143
  • [28] GMM Adaptation based Online Speaker Segmentation for Spoken Document Retrieval
    Park, Kyungmi
    Park, Jeong-sik
    Oh, Yung-Hwan
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2010, 56 (02) : 1123 - 1129
  • [29] Posterior probability based indexing method for Chinese spoken document retrieval
    Zheng, Tie-Ran
    Han, Ji-Qing
    Harbin Gongye Daxue Xuebao/Journal of Harbin Institute of Technology, 2009, 41 (08): : 97 - 102
  • [30] ESSENCE VECTOR-BASED QUERY MODELING FOR SPOKEN DOCUMENT RETRIEVAL
    Chen, Kuan-Yu
    Liu, Shih-Hung
    Chen, Berlin
    Wang, Hsin-Min
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 6274 - 6278