Combination of diverse subword units in spoken term detection

被引:0
|
作者
Lee, Shi-wook [1 ]
Tanaka, Kazuyo [2 ]
Itoh, Yoshiaki [3 ]
机构
[1] Natl Inst Adv Ind Sci & Technol, Tokyo, Japan
[2] Univ Tsukuba, Tsukuba, Ibaraki 305, Japan
[3] Iwate Prefectural Univ, Takizawa, Iwate, Japan
关键词
spoken term detection; keyword search; system combination; phonetic recognition; diversity;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper focuses on the following two points: First, we try to clarify the effect of combination systems from two aspects, accuracy and heterogeneity. And then we evaluate our unique subword unit, called Sub-Phonetic Segment (SPS) to maximize performance improvement by combination. Combination systems usually yield higher performance than any individual system. When the systems being combined are individually accurate but also mutually heterogeneous, the improvement by combination can be maximized. From this consideration, we estimate heterogeneity by correlation of false alarm errors of combined systems and confirm that lower correlation of two systems yields the better performance improvement by combination. Comparative tests of several combination approaches are carried out on subword-based spoken term detection. Since subword-based systems use constrained linguistic knowledge, it is fairly straightforward to verify the heterogeneity of combined systems. Experimental results show that the most significant improvements can be achieved by combination of two different subword units, triphone and SPS, which are highly heterogeneous subword units with low correlation of false alarm detections.
引用
收藏
页码:3685 / 3689
页数:5
相关论文
共 50 条
  • [1] EFFECTIVE COMBINATION OF HETEROGENEOUS SUBWORD-BASED SPOKEN TERM DETECTION SYSTEMS
    Lee, Shi-wook
    Tanaka, Kazuyo
    Itoh, Yoshiaki
    2014 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY SLT 2014, 2014, : 436 - 441
  • [2] Merging Search Spaces for Subword Spoken Term Detection
    Mertens, Timo
    Schneider, Daniel
    Koehler, Joachim
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2075 - +
  • [3] EFFICIENT SUBWORD LATTICE RETRIEVAL FOR GERMAN SPOKEN TERM DETECTION
    Mertens, Timo
    Schneider, Daniel
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4885 - +
  • [4] SUBWORD-BASED SPOKEN TERM DETECTION IN AUDIO COURSE LECTURES
    Rose, Richard
    Norouzian, Atta
    Reddy, Aarthi
    Coy, Andre
    Gupta, Vishwa
    Karafiat, Martin
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5282 - 5285
  • [5] SPOKEN TERM DETECTION USING DYNAMIC MATCH SUBWORD CONFUSION NETWORK
    Gao, Jie
    Shao, Jian
    Zhang, Qingqing
    Zhao, Qingwei
    Yan, Yonghong
    ICNC 2008: FOURTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 4, PROCEEDINGS, 2008, : 250 - 254
  • [6] Vocabulary Independent Spoken Query: a Case for Subword Units
    Gouvea, Evandro
    Ezzat, Tony
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 1680 - 1683
  • [7] Named Entity Recognition of Spoken Documents using Subword Units
    Paass, Gerhard
    Pilz, Anja
    Schwenninger, Jochen
    2009 IEEE THIRD INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2009), 2009, : 529 - 534
  • [8] Spoken Term Detection Results using Plural Subword Models by Estimating Detection Performance for Each Query
    Itoh, Yoshiaki
    Iwata, Kohei
    Ishigame, Masaaki
    Tanaka, Kazuyo
    Lee, Shi-wook
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2128 - 2131
  • [9] Spoken Term Detection from Bilingual Spontaneous Speech Using Code-switched Lattice-based Structures for Words and Subword Units
    Lee, Hung-Yi
    Tang, Yueh-Lien
    Tang, Hao
    Lee, Lin-Shan
    2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 410 - +
  • [10] Efficient System Combination for Chinese Spoken Term Detection
    Gao Jie
    Shao Jian
    Zhao Qingwei
    Yan Yonghong
    CHINESE JOURNAL OF ELECTRONICS, 2010, 19 (03): : 457 - 462