Study of Entity-Topic Models for OOV Proper Name Retrieval

被引:0
|
作者
Sheikh, Imran [1 ]
Illina, Irina
Fohr, Dominique
机构
[1] Univ Lorraine, LORIA, UMR 7503, F-54506 Vandoeuvre Les Nancy, France
关键词
proper names; OOV; topic models; LVCSR;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Retrieving Proper Names (PNs) relevant to an audio document can improve speech recognition and content based audio -video indexing. Latent Dirichlet Allocation (LDA) topic model has been used to retrieve Out-Of-Vocabulary (OOV) PNs relevant to an audio document with good recall rates. However, retrieval of OOV PNs using LDA is affected by two issues, which we study in this paper: (1) Word Frequency Bias (less frequent OOV PNs are ranked lower); (2) Loss of Specificity (the reduced topic space representation loses lexical context). Entity-Topic models have been proposed as extensions of LDA to specifically learn relations between words, entities (PNs) and topics. We study OOV PN retrieval with Entity-Topic models and show that they are also affected by word frequency bias and loss of specificity. We evaluate our proposed methods for rare OOV PN re-ranking and lexical context re-ranking for LDA as well as for Entity Topic models. The results show an improvement in both Recall and the Mean Average Precision.
引用
收藏
页码:1344 / 1348
页数:5
相关论文
共 50 条
  • [31] Enhancing Relational Topic Models with Named Entity Induced Links
    Kuhr, Felix
    Lichtenberger, Mathis
    Braun, Tanya
    Moeller, Ralf
    2021 IEEE 15TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2021), 2021, : 314 - 317
  • [32] DIFFERENT WORD REPRESENTATIONS AND THEIR COMBINATION FOR PROPER NAME RETRIEVAL FROM DIACHRONIC DOCUMENTS
    Illina, Irina
    Fohr, Dominique
    2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2015, : 1 - 7
  • [33] Assessment of the Quality of Topic Models for Information Retrieval Applications
    Yuan, Meng
    Lin, Pauline
    Rashidi, Lida
    Zobel, Justin
    PROCEEDINGS OF THE 2023 ACM SIGIR INTERNATIONAL CONFERENCE ON THE THEORY OF INFORMATION RETRIEVAL, ICTIR 2023, 2023, : 265 - 274
  • [34] Probabilistic Topic Models for Text Data Retrieval and Analysis
    Zhai, ChengXiang
    SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, : 1399 - 1401
  • [35] Exploiting Temporal Topic Models in Social Media Retrieval
    Tran, Tuan A.
    SIGIR 2012: PROCEEDINGS OF THE 35TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2012, : 999 - 999
  • [36] Topic signature language models for ad hoc retrieval
    Zhou, Xiaohua
    Hu, Xiaohua
    Zhang, Xiaodan
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2007, 19 (09) : 1276 - 1287
  • [37] Combining probability models and web mining models: a framework for proper name transliteration
    Yilu Zhou
    Feng Huang
    Hsinchun Chen
    Information Technology and Management, 2008, 9 : 91 - 103
  • [38] Combining probability models and web mining models: a framework for proper name transliteration
    Zhou, Yilu
    Huang, Feng
    Chen, Hsinchun
    INFORMATION TECHNOLOGY & MANAGEMENT, 2008, 9 (02): : 91 - 103
  • [39] Spoken Proper Name Retrieval for Limited Resource Languages Using Multilingual Hybrid Representations
    Akbacak, Murat
    Hansen, John H. L.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (06): : 1486 - 1495
  • [40] Fusiform Gyrus Phospho-Tau is Associated with Failure of Proper Name Retrieval in Aging
    Tennant, Victoria R.
    Harrison, Theresa M.
    Adams, Jenna N.
    La Joie, Renaud
    Winer, Joseph R.
    Jagust, William J.
    ANNALS OF NEUROLOGY, 2021, 90 (06) : 988 - 993