EFFICIENT SUBWORD LATTICE RETRIEVAL FOR GERMAN SPOKEN TERM DETECTION

被引:12
|
作者
Mertens, Timo [1 ,2 ]
Schneider, Daniel [2 ]
机构
[1] NTNU, Dept Elect & Telecommun, Trondheim, Norway
[2] Fraunhofer IAIS, Schloss Birlinghoven, St Augustin 53754, Germany
关键词
spoken term detection; spoken document retrieval; speech recognition; speech search;
D O I
10.1109/ICASSP.2009.4960726
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We present a lattice-based STD method for German broadcast news data and compare it to a previously proposed fuzzy search. Due to the important out-of-vocabulary (OOV) problem in German, we evaluate suitable subword indexing units for lattice retrieval. Hybrid lattice retrieval of words and subwords is investigated because of the robust nature of words as an indexing unit. We show that by using efficient lattice graph and score pruning techniques, precision of subword retrieval is increased by 8% absolute with only a small loss in recall. Additionally, a speed-up of up to 6 times can be observed.
引用
收藏
页码:4885 / +
页数:2
相关论文
共 50 条
  • [41] EXPLOITING DIVERSITY FOR SPOKEN TERM DETECTION
    Mangu, Lidia
    Soltau, Hagen
    Kuo, Hong-Kwang
    Kingsbury, Brian
    Saon, George
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8282 - 8286
  • [42] Optimization of Spoken Term Detection System
    Wang, Chuanxu
    Zhang, Pengyuan
    JOURNAL OF APPLIED MATHEMATICS, 2012,
  • [43] Semantically Expanded Spoken Term Detection
    Kozhirbayev, Zhanibek
    Yessenbayev, Zhandos
    IEEE ACCESS, 2024, 12 : 177844 - 177855
  • [44] Multilingual spoken term detection: a review
    Deekshitha, G.
    Mary, Leena
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2020, 23 (03) : 653 - 667
  • [45] ASSESSING SEARCH TERM STRENGTH IN SPOKEN TERM DETECTION
    Torbati, Amir Hossein Harati Nejad
    Picone, Joe
    2013 IEEE INTERNATIONAL MULTI-DISCIPLINARY CONFERENCE ON COGNITIVE METHODS IN SITUATION AWARENESS AND DECISION SUPPORT (COGSIMA), 2013, : 114 - 117
  • [46] Model-Based Unsupervised Spoken Term Detection with Spoken Queries
    Chan, Chun-an
    Lee, Lin-shan
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (07): : 1330 - 1342
  • [47] A New Syllable-lattice Based Approach for Mandarin Spoken Document Retrieval
    Zhang, Lei
    Gao, Yunxia
    Xiang, Xuezhi
    Lu, Dong
    2009 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP 2009), 2009, : 1175 - 1178
  • [48] Two-stage Vocabulary-free Spoken Document Retrieval - Subword Identification and Re-recognition of the Identified Sections -
    Itoh, Yoshiaki
    Otake, Takayuki
    Iwata, Kohei
    Kojima, Kazunori
    Ishigame, Masaaki
    Tanaka, Kazuyo
    Lee, Shi-wook
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1161 - +
  • [49] AUTOMATIC TOPIC DETECTION STRATEGY FOR INFORMATION RETRIEVAL IN SPOKEN DOCUMENT
    Jin, Shan
    Misra, Hemant
    Sikora, Thomas
    Jose, Joemon
    2009 10TH INTERNATIONAL WORKSHOP ON IMAGE ANALYSIS FOR MULTIMEDIA INTERACTIVE SERVICES, 2009, : 300 - +
  • [50] EFFICIENT SYSTEM COMBINATION FOR SYLLABLE-CONFUSION-NETWORK-BASED CHINESE SPOKEN TERM DETECTION
    Gao, Jie
    Zhao, Qingwei
    Yan, Yonghong
    Shao, Jian
    2008 6TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2008, : 366 - 369