EFFICIENT SYSTEM COMBINATION FOR SYLLABLE-CONFUSION-NETWORK-BASED CHINESE SPOKEN TERM DETECTION

被引:0
|
作者
Gao, Jie [1 ]
Zhao, Qingwei [1 ]
Yan, Yonghong [1 ]
Shao, Jian [2 ]
机构
[1] Chinese Acad Sci, Inst Acoust, ThinkIT Speech Lab, Beijing, Peoples R China
[2] Zhejiang Univ, Hangzhou, Zhejiang, Peoples R China
关键词
syllable confusion network; Chinese spoken term detection; system combination; speech indexing;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper examines the system combination issue for syllable-confusion-network (SCN)-based Chinese spoken term detection (STD). System combination for STD usually leads to improvements in accuracy but suffers from increased index size or complicated index structure. This paper explores methods for efficient combination of a word-based system and a syllable-based system while keeping the compactness of the indices. First, a composite SCN is generated using two approaches: lattice combination (The SCN is generated from a combined lattice) and confusion network combination (Two SCNs are combined into one). Then a simple compact index is constructed from this composite SCN by merging cross-system redundant information. The experimental result on a 60-hour corpus shows a relative accuracy improvement of 14.7% is achieved over the baseline syllable-based system. Meanwhile, it reduces the index size by 22.3% compared to the commonly adopted score combination method when achieves comparable accuracy.
引用
收藏
页码:366 / 369
页数:4
相关论文
共 50 条
  • [41] SYSTEM AND KEYWORD DEPENDENT FUSION FOR SPOKEN TERM DETECTION
    Van Tung Pham
    Chen, Nancy F.
    Sivadas, Sunil
    Xu, Haihua
    Chen, I-Fan
    Ni, Chongjia
    Chng, Eng Siong
    Li, Haizhou
    2014 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY SLT 2014, 2014, : 430 - 435
  • [42] CNN based Query by Example Spoken Term Detection
    Ram, Dhananjay
    Miculicich, Lesly
    Bourlard, Herve
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 92 - 96
  • [43] Spoken term detection based on improved index structure
    1600, Academy Publisher (08):
  • [44] A Network Intrusion Detection Method Based on Domain Confusion
    Qu, Yanze
    Ma, Hailong
    Jiang, Yiming
    Bu, Youjun
    ELECTRONICS, 2023, 12 (05)
  • [45] Query-by-example spoken term detection based on phonetic posteriorgram Query-by-example spoken term detection based on phonetic posteriorgram
    Song, Beili
    Zhang, Wei-Qiang
    Cai, Meng
    Liu, Jia
    Johnson, Michael T.
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON EDUCATION, MANAGEMENT AND COMPUTING TECHNOLOGY, 2015, 30 : 1255 - 1260
  • [46] A speaker adaptive Chinese syllable recognition system based on discriminative training
    Zhou, L
    Imai, S
    1996 IEEE TENCON - DIGITAL SIGNAL PROCESSING APPLICATIONS PROCEEDINGS, VOLS 1 AND 2, 1996, : 31 - 36
  • [47] Fusing multiple systems into a compact lattice index for Chinese spoken term detection
    Meng, Sha
    Yu, Peng
    Liu, Jia
    Seide, Frank
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4345 - +
  • [48] Zero-resource audio-only spoken term detection based on a combination of template matching techniques
    Muscariello, Armando
    Gravier, Guillaume
    Bimbot, Frederic
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 928 - 931
  • [49] MEMORY EFFICIENT SUBSEQUENCE DTW FOR QUERY-BY-EXAMPLE SPOKEN TERM DETECTION
    Anguera, Xavier
    Ferrarons, Miquel
    2013 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME 2013), 2013,
  • [50] Spoken Term Detection Based on Feature Space Trajectory Information
    Tian Y.-H.
    He Q.-H.
    Zheng R.-W.
    Wei Z.
    Li Y.-X.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2023, 51 (10): : 2915 - 2924