A NEURAL DOCUMENT LANGUAGE MODELING FRAMEWORK FOR SPOKEN DOCUMENT RETRIEVAL

被引:0
|
作者
Yen, Li-Phen [1 ]
Wu, Zhen-Yu [1 ]
Chen, Kuan-Yu [1 ]
机构
[1] Natl Taiwan Univ Sci & Technol, Taipei, Taiwan
关键词
Spoken document retrieval; language model; language representations; INFORMATION-RETRIEVAL;
D O I
10.1109/icassp40776.2020.9054066
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Recent developments in deep learning have led to a significant innovation in various classic and practical subjects, including speech recognition, computer vision, question answering, information retrieval and so on. In the context of natural language processing (NLP), language representations learned by referring to autoregressive language modeling or autoencoding have shown giant successes in many downstream tasks, so the school of studies have become a major stream of research recently. Because the immenseness of multimedia data along with speech have spread around the world in our daily life, spoken document retrieval (SDR), which aims at retrieving relevant multimedia contents to satisfy users' queries, has become an important research subject in the past decades. Targeting on enhancing the SDR performance, the paper concentrates on proposing a neural retrieval framework, which assembles the merits of using language modeling (LM) mechanism in SDR and leveraging the abstractive information learned by the language representation models. Consequently, to our knowledge, this is a pioneer study on supervised training of a neural LM-based SDR framework, especially combined with the pretrained language representation methods. A series of empirical SDR experiments conducted on a benchmark collection demonstrate the good efficacy of the proposed framework, compared to several existing strong baseline systems.
引用
收藏
页码:8139 / 8143
页数:5
相关论文
共 50 条
  • [1] I-VECTOR BASED LANGUAGE MODELING FOR SPOKEN DOCUMENT RETRIEVAL
    Chen, Kuan-Yu
    Lee, Hung-Shin
    Wang, Hsin-Min
    Chen, Berlin
    Chen, Hsin-Hsi
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [2] Exploring the Use of Significant Words Language Modeling for Spoken Document Retrieval
    Chen, Ying-Wen
    Chen, Kuan-Yu
    Wang, Hsin-Min
    Chen, Berlin
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2889 - 2893
  • [3] NEURAL RELEVANCE-AWARE QUERY MODELING FOR SPOKEN DOCUMENT RETRIEVAL
    Lo, Tien-Hong
    Chen, Ying-Wen
    Chen, Kuan-Yu
    Wang, Hsin-Min
    Chen, Berlin
    [J]. 2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2017, : 466 - 473
  • [4] A LOCALITY-PRESERVING ESSENCE VECTOR MODELING FRAMEWORK FOR SPOKEN DOCUMENT RETRIEVAL
    Chen, Kuan-Yu
    Liu, Shih-Hung
    Chen, Berlin
    Wang, Hsin-Min
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5665 - 5669
  • [5] Spoken Document Retrieval With Unsupervised Query Modeling Techniques
    Chen, Berlin
    Chen, Kuan-Yu
    Chen, Pei-Ning
    Chen, Yi-Wen
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (09): : 2602 - 2612
  • [6] Experiments in spoken document retrieval
    Sparck-Jones, K
    Jones, GJF
    Foote, JT
    Young, SJ
    [J]. INFORMATION PROCESSING & MANAGEMENT, 1996, 32 (04) : 399 - 417
  • [7] An architecture for spoken document retrieval
    Terol, RM
    Martínez-Barco, P
    Palomar, M
    [J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2004, 3206 : 505 - 511
  • [8] Exploring an Unsupervised, Language Independent, Spoken Document Retrieval System
    Caranica, Alexandru
    Cucu, Horia
    Buzo, Andi
    [J]. 2016 14TH INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI), 2016,
  • [9] Language model expansion using webdata for spoken document retrieval
    Masumura, Ryo
    Hahm, Seongjun
    Ito, Akinori
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2144 - 2147
  • [10] New Approaches to Spoken Document Retrieval
    Martin Wechsler
    Eugen Munteanu
    Peter Schäuble
    [J]. Information Retrieval, 2000, 3 : 173 - 188