Improved Semantic Retrieval of Spoken Content by Document/Query Expansion with Random Walk Over Acoustic Similarity Graphs

被引:8
|
作者
Lee, Hung-Yi [1 ]
Lee, Lin-Shan [1 ]
机构
[1] Natl Taiwan Univ, Dept Elect Engn, Taipei 10617, Taiwan
关键词
Document expansion; latent semantic analysis; query expansion; random walk; spoken content retrieval; TERM DETECTION; INFORMATION-RETRIEVAL; SYSTEMS;
D O I
10.1109/TASLP.2013.2285469
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In a text context, document/query expansion has proven very useful in retrieving objects semantically related to the query. However, when applying text-based techniques on spoken content, the inevitable recognition errors seriously degrade performance even when the retrieval process is performed over lattices. We propose the estimation of more accurate term distributions ( or unigram language models) for the spoken documents by acoustic similarity graphs. In this approach, a graph is constructed for each term describing the acoustic similarity among all signal regions hypothesized to be the considered term. Score propagation based on a random walk over the graph offers more reliable scores of the term hypotheses, which in turn yield more accurate term distributions ( or unigram language models). This approach was applied with the language modeling retrieval approach, including using document expansion based on latent topic analysis and query expansion with a query-regularized mixture model. We extend these approaches from words to subword n-grams, and the query expansion from document-level to utterance-level and from term-based to topic-based. Experiments performed on Mandarin broadcast news showed improved performance under almost all tested conditions.
引用
收藏
页码:80 / 94
页数:15
相关论文
共 19 条
  • [1] ENHANCING QUERY EXPANSION FOR SEMANTIC RETRIEVAL OF SPOKEN CONTENT WITH AUTOMATICALLY DISCOVERED ACOUSTIC PATTERNS
    Lee, Hung-yi
    Li, Yun-Chiao
    Chung, Cheng-Tao
    Lee, Lin-shan
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8297 - 8301
  • [2] Using semantic and phonetic term similarity for spoken document retrieval and spoken query processing
    Crestani, F
    [J]. TECHNOLOGIES FOR CONSTRUCTING INTELLIGENT SYSTEMS 1: TASKS, 2002, 89 : 363 - 375
  • [3] IMPROVED SEMANTIC RETRIEVAL OF SPOKEN CONTENT BY LANGUAGE MODELS ENHANCED WITH ACOUSTIC SIMILARITY GRAPH
    Lee, Hung-yi
    Wen, Tsung-Hsien
    Lee, Lin-Shan
    [J]. 2012 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2012), 2012, : 182 - 187
  • [4] Phonetic Query Expansion for Spoken Document Retrieval
    Mamou, Jonathan
    Ramabhadran, Bhuvana
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2106 - +
  • [5] Phonetic query expansion for spoken document retrieval
    Reyes-Barragan, Alejandro
    Villasenor-Pineda, Luis
    Montes-y-Gomez, Manuel
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2011, (47): : 57 - 64
  • [6] TOWARDS UNSUPERVISED SEMANTIC RETRIEVAL OF SPOKEN CONTENT WITH QUERY EXPANSION BASED ON AUTOMATICALLY DISCOVERED ACOUSTIC PATTERNS
    Li, Yun-Chiao
    Lee, Hung-yi
    Chung, Cheng-Tao
    Chan, Chun-an
    Lee, Lin-shan
    [J]. 2013 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2013, : 198 - 203
  • [7] Effects of Query Expansion for Spoken Document Passage Retrieval
    Akiba, Tomoyosi
    Honda, Koichiro
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2148 - 2151
  • [8] SEMANTIC QUERY EXPANSION AND CONTEXT-BASED DISCRIMINATIVE TERM MODELING FOR SPOKEN DOCUMENT RETRIEVAL
    Tu, Tsung-wei
    Lee, Hung-yi
    Chou, Yu-yu
    Lee, Lin-shan
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 5085 - 5088
  • [9] Query based biomedical document retrieval for clinical information access with the semantic similarity
    Gupta S.
    Sharaff A.
    Nagwani N.K.
    [J]. Multimedia Tools and Applications, 2024, 83 (18) : 55305 - 55317
  • [10] Spoken document retrieval: Acoustic variability over the past 100 years
    Hansen, JHL
    [J]. 2005 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), 2005, : 6 - 7