Exploring an Unsupervised, Language Independent, Spoken Document Retrieval System

被引:0
|
作者
Caranica, Alexandru [1 ]
Cucu, Horia [1 ]
Buzo, Andi [1 ]
机构
[1] Univ Politehn Bucuresti, Speech & Dialogue SpeeD Res Lab, Bucharest, Romania
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
With the increasing availability of spoken documents in different languages, there is a need of systems performing automatic and unsupervised search on audio streams, containing speech, in a document retrieval scenario. We are interested in retrieving information from multilingual speech data, from spoken documents such as broadcast news, video archives or even telephone conversations. The ultimate goal of a Spoken Document Retrieval System is to enable vocabulary-independent search over large collections of speech content, to find written or spoken "queries" or reoccurring speech data. If the language is known, the task is relatively simple. One could use a large vocabulary continuous speech recognition (LVCSR) tool to produce highly accurate word transcripts, which are then indexed and query terms are retrieved from the index. However, if the language is unknown, hence queries are not part of the recognizer`s vocabulary, the relevant audio documents cannot be retrieved. Thus, search metrics are affected, and documents retrieved are no longer relevant to the user. In this paper we investigate whether the use of input features derived from multi-language resources helps the process of unsupervised spoken term detection, independent of the language. Moreover, we explore the use of multi objective search, by combining both language detection and LVCSR based search, with unsupervised Spoken Term Detection (STD). In order to achieve this, we make use of multiple open-source tools and in-house acoustic and language models, to propose a language independent spoken document retrieval system.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Exploring the Use of Significant Words Language Modeling for Spoken Document Retrieval
    Chen, Ying-Wen
    Chen, Kuan-Yu
    Wang, Hsin-Min
    Chen, Berlin
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2889 - 2893
  • [2] Spoken Document Retrieval With Unsupervised Query Modeling Techniques
    Chen, Berlin
    Chen, Kuan-Yu
    Chen, Pei-Ning
    Chen, Yi-Wen
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (09): : 2602 - 2612
  • [3] A NEURAL DOCUMENT LANGUAGE MODELING FRAMEWORK FOR SPOKEN DOCUMENT RETRIEVAL
    Yen, Li-Phen
    Wu, Zhen-Yu
    Chen, Kuan-Yu
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 8139 - 8143
  • [4] Spoken Document Retrieval Leveraging Unsupervised and Supervised Topic Modeling Techniques
    Chen, Kuan-Yu
    Wang, Hsin-Min
    Chen, Berlin
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2012, E95D (05): : 1195 - 1205
  • [5] The Cambridge University spoken document retrieval system
    Johnson, SE
    Jourlin, P
    Moore, GL
    Jones, KS
    Woodland, PC
    [J]. ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 49 - 52
  • [6] Cambridge University spoken document retrieval system
    Johnson, S.E.
    Jourlin, P.
    Moore, G.L.
    Sparck Jones, K.
    Woodland, P.C.
    [J]. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1999, 1 : 49 - 52
  • [7] Language model expansion using webdata for spoken document retrieval
    Masumura, Ryo
    Hahm, Seongjun
    Ito, Akinori
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2144 - 2147
  • [8] Improving Spoken Document Retrieval by. Unsupervised Language Model Adaptation Using Utterance-based Web Search
    Herms, Robert
    Ritter, Marc
    Wilhelm-Stein, Thomas
    Eibl, Maximilian
    [J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1430 - 1433
  • [9] Independent retrieval of number and grammatical gender in spoken language production
    Leek, E. C.
    Schiemenz, S.
    Roberts, J. R.
    Jones, E. Wyn
    Thomas, E.
    Gathercole, V. C.
    Tainturier, M. J.
    [J]. BRAIN AND LANGUAGE, 2007, 103 (1-2) : 63 - 64
  • [10] RWTH speech recognition system and spoken document retrieval
    RWTH Aachen - Univ of Technology, Aachen, Germany
    [J]. IECON Proc, 1600, (2022-2027):