Castsearch - Context based spoken document retrieval

被引:0
|
作者
Molgaard, Lasse Lohilahti [1 ]
Jorgensen, Kasper Winther [1 ]
Hansen, Lars Kai [1 ]
机构
[1] Tech Univ Denmark Richard Petersens Plads, Bldg 321, DK-2800 Lyngby, Denmark
关键词
audio retrieval; document clustering; non-negative matrix factorization; text mining;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The paper describes our work on the development of a system for retrieval of relevant stories from broadcast news. The system utilizes a combination of audio processing and text mining. The audio processing consists of a segmentation step that partitions the audio into speech and music. The speech is further segmented into speaker segments and then transcribed using an automatic speech recognition system, to yield text input for clustering using non-negative matrix factorization (NMF). We find semantic topics that are used to evaluate the performance for topic detection. Based on these topics we show that a novel query expansion can be performed to return more intelligent search results. We also show that the query expansion helps overcome errors of the automatic transcription.
引用
收藏
页码:93 / +
页数:2
相关论文
共 50 条
  • [31] ENHANCED BERT-BASED RANKING MODELS FOR SPOKEN DOCUMENT RETRIEVAL
    Lin, Hsiao-Yun
    Lo, Tien-Hong
    Chen, Berlin
    2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 601 - 606
  • [32] I-VECTOR BASED LANGUAGE MODELING FOR SPOKEN DOCUMENT RETRIEVAL
    Chen, Kuan-Yu
    Lee, Hung-Shin
    Wang, Hsin-Min
    Chen, Berlin
    Chen, Hsin-Hsi
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [33] Enhancing Query Formulation for Spoken Document Retrieval
    Chen, Berlin
    Chen, Yi-Wen
    Chen, Kuan-Yu
    Wang, Hsin-Min
    Yu, Kuen-Tyng
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2014, 30 (03) : 553 - 569
  • [34] Phonetic Query Expansion for Spoken Document Retrieval
    Mamou, Jonathan
    Ramabhadran, Bhuvana
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2106 - +
  • [35] Extractive spoken document summarization for information retrieval
    Chen, Berlin
    Chen, Yi-Ting
    PATTERN RECOGNITION LETTERS, 2008, 29 (04) : 426 - 437
  • [36] Spoken document summarization and retrieval for wireless application
    Wu, CH
    Huang, CL
    Hsieh, CH
    2005 INTERNATIONAL CONFERENCE ON WIRELESS NETWORKS, COMMUNICATIONS AND MOBILE COMPUTING, VOLS 1 AND 2, 2005, : 1388 - 1393
  • [37] The Cambridge University spoken document retrieval system
    Johnson, SE
    Jourlin, P
    Moore, GL
    Jones, KS
    Woodland, PC
    ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 49 - 52
  • [38] Phonetic query expansion for spoken document retrieval
    Reyes-Barragan, Alejandro
    Villasenor-Pineda, Luis
    Montes-y-Gomez, Manuel
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2011, (47): : 57 - 64
  • [39] Cambridge University spoken document retrieval system
    Johnson, S.E.
    Jourlin, P.
    Moore, G.L.
    Sparck Jones, K.
    Woodland, P.C.
    ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1999, 1 : 49 - 52
  • [40] Spoken document retrieval for the languages of Hong Kong
    Meng, HM
    Hui, PY
    PROCEEDINGS OF 2001 INTERNATIONAL SYMPOSIUM ON INTELLIGENT MULTIMEDIA, VIDEO AND SPEECH PROCESSING, 2001, : 201 - 204