ENHANCING QUERY EXPANSION FOR SEMANTIC RETRIEVAL OF SPOKEN CONTENT WITH AUTOMATICALLY DISCOVERED ACOUSTIC PATTERNS

被引:0
|
作者
Lee, Hung-yi [1 ]
Li, Yun-Chiao [2 ]
Chung, Cheng-Tao [3 ]
Lee, Lin-shan [2 ,3 ]
机构
[1] Acad Sinica, Res Ctr Informat Technol Innovat, Taipei, Taiwan
[2] Natl Taiwan Univ, Grad Inst Commun Engn, Taipei, Taiwan
[3] Natl Taiwan Univ, Grad Inst Elect Engn, Taipei, Taiwan
关键词
Query Expansion; Acoustic Pattern Discovery;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Query expansion techniques were originally developed for text information retrieval in order to retrieve the documents not containing the query terms but semantically related to the query. This is achieved by assuming the terms frequently occurring in the top-ranked documents in the first-pass retrieval results to be query-related and using them to expand the query to do the second-pass retrieval. However, when this approach was used for spoken content retrieval, the inevitable recognition errors and the OOV problems in ASR make it difficult for many query-related terms to be included in the expanded query, and much of the information carried by the speech signal is lost during recognition and not recoverable. In this paper, we propose to use a second ASR engine based on acoustic patterns automatically discovered from the spoken archive used for retrieval. These acoustic patterns are discovered directly based on the signal characteristics, and therefore can compensate for the information lost during recognition to a good extent. When a text query is entered, the system generates the first-pass retrieval results based on the transcriptions of the spoken segments obtained via the conventional ASR. The acoustic patterns frequently occurring in the spoken segments ranked on top of the first-pass results are considered as query-related, and the spoken segments containing these query-related acoustic patterns are retrieved. In this way, even though some query-related terms are OOV or incorrectly recognized, the segments including these terms can still be retrieved by acoustic patterns corresponding to these terms. Preliminary experiments performed on Mandarin broadcast news offered very encouraging results.
引用
收藏
页码:8297 / 8301
页数:5
相关论文
共 50 条
  • [1] TOWARDS UNSUPERVISED SEMANTIC RETRIEVAL OF SPOKEN CONTENT WITH QUERY EXPANSION BASED ON AUTOMATICALLY DISCOVERED ACOUSTIC PATTERNS
    Li, Yun-Chiao
    Lee, Hung-yi
    Chung, Cheng-Tao
    Chan, Chun-an
    Lee, Lin-shan
    [J]. 2013 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2013, : 198 - 203
  • [2] Improved Semantic Retrieval of Spoken Content by Document/Query Expansion with Random Walk Over Acoustic Similarity Graphs
    Lee, Hung-Yi
    Lee, Lin-Shan
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (01) : 80 - 94
  • [3] ENHANCING AUTOMATICALLY DISCOVERED MULTI-LEVEL ACOUSTIC PATTERNS CONSIDERING CONTEXT CONSISTENCY WITH APPLICATIONS IN SPOKEN TERM DETECTION
    Chung, Cheng-Tao
    Hsu, Wei-Ning
    Lee, Cheng-Yi
    Lee, Lin-Shan
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5231 - 5235
  • [4] Enhancing electronic medical record retrieval through semantic query expansion
    Hemant Jain
    Cheng Thao
    Huimin Zhao
    [J]. Information Systems and e-Business Management, 2012, 10 : 165 - 181
  • [5] Enhancing electronic medical record retrieval through semantic query expansion
    Jain, Hemant
    Thao, Cheng
    Zhao, Huimin
    [J]. INFORMATION SYSTEMS AND E-BUSINESS MANAGEMENT, 2012, 10 (02) : 165 - 181
  • [6] Enhancing Query Formulation for Spoken Document Retrieval
    Chen, Berlin
    Chen, Yi-Wen
    Chen, Kuan-Yu
    Wang, Hsin-Min
    Yu, Kuen-Tyng
    [J]. JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2014, 30 (03) : 553 - 569
  • [7] Phonetic Query Expansion for Spoken Document Retrieval
    Mamou, Jonathan
    Ramabhadran, Bhuvana
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2106 - +
  • [8] Phonetic query expansion for spoken document retrieval
    Reyes-Barragan, Alejandro
    Villasenor-Pineda, Luis
    Montes-y-Gomez, Manuel
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2011, (47): : 57 - 64
  • [9] On the Effectiveness of Contextualisation Techniques in Spoken Query Spoken Content Retrieval
    Racca, David N.
    Jones, Gareth J. F.
    [J]. SIGIR'16: PROCEEDINGS OF THE 39TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2016, : 933 - 936
  • [10] SEMANTIC QUERY EXPANSION AND CONTEXT-BASED DISCRIMINATIVE TERM MODELING FOR SPOKEN DOCUMENT RETRIEVAL
    Tu, Tsung-wei
    Lee, Hung-yi
    Chou, Yu-yu
    Lee, Lin-shan
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 5085 - 5088