EFFICIENT SYSTEM COMBINATION FOR SYLLABLE-CONFUSION-NETWORK-BASED CHINESE SPOKEN TERM DETECTION

被引:0
|
作者
Gao, Jie [1 ]
Zhao, Qingwei [1 ]
Yan, Yonghong [1 ]
Shao, Jian [2 ]
机构
[1] Chinese Acad Sci, Inst Acoust, ThinkIT Speech Lab, Beijing, Peoples R China
[2] Zhejiang Univ, Hangzhou, Zhejiang, Peoples R China
关键词
syllable confusion network; Chinese spoken term detection; system combination; speech indexing;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper examines the system combination issue for syllable-confusion-network (SCN)-based Chinese spoken term detection (STD). System combination for STD usually leads to improvements in accuracy but suffers from increased index size or complicated index structure. This paper explores methods for efficient combination of a word-based system and a syllable-based system while keeping the compactness of the indices. First, a composite SCN is generated using two approaches: lattice combination (The SCN is generated from a combined lattice) and confusion network combination (Two SCNs are combined into one). Then a simple compact index is constructed from this composite SCN by merging cross-system redundant information. The experimental result on a 60-hour corpus shows a relative accuracy improvement of 14.7% is achieved over the baseline syllable-based system. Meanwhile, it reduces the index size by 22.3% compared to the commonly adopted score combination method when achieves comparable accuracy.
引用
收藏
页码:366 / 369
页数:4
相关论文
共 50 条
  • [21] A robust/fast spoken term detection method based on a syllable n-gram index with a distance metric
    Nakagawa, Seiichi
    Iwami, Keisuke
    Fujii, Yasuhisa
    Yamamoto, Kazumasa
    SPEECH COMMUNICATION, 2013, 55 (03) : 470 - 485
  • [22] Combination of diverse subword units in spoken term detection
    Lee, Shi-wook
    Tanaka, Kazuyo
    Itoh, Yoshiaki
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3685 - 3689
  • [23] Fast Spoken Term Detection Using Pre-retrieval Results of Syllable Bigrams
    Saito, Hiroyuki
    Itoh, Yoshiaki
    Kojima, Kazunori
    Ishigame, Masaaki
    Tanaka, Kazuyo
    Lee, Shi-Wook
    2012 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2012,
  • [24] A study of lattice-based spoken term detection for Chinese spontaneous speech
    Meng, Sha
    Yu, Peng
    Seide, Frank
    Liu, Jia
    2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 635 - +
  • [25] EM-based Phoneme Confusion Matrix Generation for Low-resource Spoken Term Detection
    Xu, Di
    Wang, Yun
    Metze, Florian
    2014 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY SLT 2014, 2014, : 424 - 429
  • [26] Confusion Network Based Recurrent Neural Network Language Modeling for Chinese OCR Error Detection
    Chen, Jinying
    Wu, Yue
    Cao, Huaigu
    Natarajan, Prem
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 1266 - 1271
  • [27] A HYBRID FRAGMENT/SYLLABLE-BASED SYSTEM FOR IMPROVED OOV TERM DETECTION
    Xu, Yong
    Guo, Wu
    Dai, LiRong
    2012 8TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, 2012, : 378 - 382
  • [28] Spoken term detection based on DTW
    Hou J.
    Xie L.
    Yang P.
    Xiao X.
    Leung C.-C.
    Xu H.
    Wang L.
    Lü H.
    Ma B.
    Chng E.
    Li H.
    Xie, Lei (lxie@nwpu.edu.cn), 1600, Tsinghua University (57): : 18 - 23
  • [29] An efficient TF-IDF based Query by Example Spoken Term Detection
    Singh, Akanksha
    Arora, Vipul
    Chen, Yi-Ping Phoebe
    2024 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI 2024, 2024, : 170 - 175
  • [30] Multiple Confusion Network Application in MT System Combination
    Liu, Yupeng
    Ma, Chunguang
    Liu, Lemao
    Liu, Shui
    PROCEEDINGS OF 2015 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN), 2015, : 59 - 62