A NEURAL DOCUMENT LANGUAGE MODELING FRAMEWORK FOR SPOKEN DOCUMENT RETRIEVAL

被引:0
|
作者
Yen, Li-Phen [1 ]
Wu, Zhen-Yu [1 ]
Chen, Kuan-Yu [1 ]
机构
[1] Natl Taiwan Univ Sci & Technol, Taipei, Taiwan
关键词
Spoken document retrieval; language model; language representations; INFORMATION-RETRIEVAL;
D O I
10.1109/icassp40776.2020.9054066
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Recent developments in deep learning have led to a significant innovation in various classic and practical subjects, including speech recognition, computer vision, question answering, information retrieval and so on. In the context of natural language processing (NLP), language representations learned by referring to autoregressive language modeling or autoencoding have shown giant successes in many downstream tasks, so the school of studies have become a major stream of research recently. Because the immenseness of multimedia data along with speech have spread around the world in our daily life, spoken document retrieval (SDR), which aims at retrieving relevant multimedia contents to satisfy users' queries, has become an important research subject in the past decades. Targeting on enhancing the SDR performance, the paper concentrates on proposing a neural retrieval framework, which assembles the merits of using language modeling (LM) mechanism in SDR and leveraging the abstractive information learned by the language representation models. Consequently, to our knowledge, this is a pioneer study on supervised training of a neural LM-based SDR framework, especially combined with the pretrained language representation methods. A series of empirical SDR experiments conducted on a benchmark collection demonstrate the good efficacy of the proposed framework, compared to several existing strong baseline systems.
引用
收藏
页码:8139 / 8143
页数:5
相关论文
共 50 条
  • [21] The CLEF 2003 cross-language spoken document retrieval track
    Federico, M
    Jones, GJF
    [J]. COMPARATIVE EVALUATION OF MULTILINGUAL INFORMATION ACCESS SYSTEMS, 2003, 3237 : 646 - 652
  • [22] Statistical language models for query-by-example spoken document retrieval
    Lopez-Otero, Paula
    Parapar, Javier
    Barreiro, Alvaro
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (11-12) : 7927 - 7949
  • [23] SPOKEN DOCUMENT RETRIEVAL BY DISCRIMINATIVE MODELING IN A HIGH DIMENSIONAL FEATURE SPACE
    Oba, Takanobu
    Hori, Takaaki
    Nakamura, Atsushi
    Ito, Akinori
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 5153 - 5156
  • [24] Spoken Document Retrieval Leveraging Unsupervised and Supervised Topic Modeling Techniques
    Chen, Kuan-Yu
    Wang, Hsin-Min
    Chen, Berlin
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2012, E95D (05): : 1195 - 1205
  • [25] ESSENCE VECTOR-BASED QUERY MODELING FOR SPOKEN DOCUMENT RETRIEVAL
    Chen, Kuan-Yu
    Liu, Shih-Hung
    Chen, Berlin
    Wang, Hsin-Min
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 6274 - 6278
  • [26] Semantic Indexing and Document Retrieval for Personalized Language Modeling
    Stas, Jan
    Hladek, Daniel
    Juhar, Jozef
    [J]. PROCEEDINGS OF 2017 INTERNATIONAL SYMPOSIUM ELMAR, 2017, : 157 - 161
  • [27] Cluster-based Language Model for Spoken Document Retrieval Using NMF-Based Document Clustering
    Hu, Xinhui
    Isotani, Ryosuke
    Kawai, Hisashi
    Nakamura, Satoshi
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 705 - 708
  • [28] Spoken Document Retrieval for Oral Presentations Integrating Global Document Similarities into Local Document Similarities
    Nanjo, Hiroaki
    Iyonaga, Yusuke
    Yoshimi, Takehiko
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 1285 - 1288
  • [29] Exeter at CLEF 2003: Cross-language spoken document retrieval experiments
    Jones, GJF
    Lam-Adesina, A
    [J]. COMPARATIVE EVALUATION OF MULTILINGUAL INFORMATION ACCESS SYSTEMS, 2003, 3237 : 653 - 657
  • [30] Exeter at CLEF 2002: Cross-language spoken document retrieval experiments
    Jones, GJF
    Lam-Adesina, AM
    [J]. ADVANCES IN CROSS-LANGUAGE INFORMATION RETRIEVAL, 2003, 2785 : 458 - 475