EFFICIENT SYSTEM COMBINATION FOR SYLLABLE-CONFUSION-NETWORK-BASED CHINESE SPOKEN TERM DETECTION

被引:0
|
作者
Gao, Jie [1 ]
Zhao, Qingwei [1 ]
Yan, Yonghong [1 ]
Shao, Jian [2 ]
机构
[1] Chinese Acad Sci, Inst Acoust, ThinkIT Speech Lab, Beijing, Peoples R China
[2] Zhejiang Univ, Hangzhou, Zhejiang, Peoples R China
关键词
syllable confusion network; Chinese spoken term detection; system combination; speech indexing;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper examines the system combination issue for syllable-confusion-network (SCN)-based Chinese spoken term detection (STD). System combination for STD usually leads to improvements in accuracy but suffers from increased index size or complicated index structure. This paper explores methods for efficient combination of a word-based system and a syllable-based system while keeping the compactness of the indices. First, a composite SCN is generated using two approaches: lattice combination (The SCN is generated from a combined lattice) and confusion network combination (Two SCNs are combined into one). Then a simple compact index is constructed from this composite SCN by merging cross-system redundant information. The experimental result on a 60-hour corpus shows a relative accuracy improvement of 14.7% is achieved over the baseline syllable-based system. Meanwhile, it reduces the index size by 22.3% compared to the commonly adopted score combination method when achieves comparable accuracy.
引用
收藏
页码:366 / 369
页数:4
相关论文
共 50 条
  • [31] An approach for efficient open vocabulary spoken term detection
    Norouzian, Atta
    Rose, Richard
    SPEECH COMMUNICATION, 2014, 57 : 50 - 62
  • [32] Rescoring by a Deep Neural Network for Spoken Term Detection
    Konno, Ryota
    Kojima, Kazunori
    Tanaka, Kazuyo
    Lee, Shi-wook
    Itoh, Yoshiaki
    2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 1207 - 1211
  • [33] Fusing Multiple Confidence Measures for Chinese Spoken Term Detection
    Ma, Zejun
    Wang, Xiaorui
    Xu, Bo
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1936 - 1939
  • [34] Robust Spoken Term Detection Using Combination of Phone-Based and Word-Based Recognition
    Iwata, Kenji
    Shinoda, Koichi
    Furui, Sadaoki
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2195 - 2198
  • [35] Syllable-based Chinese text/spoken document retrieval using text/speech queries
    Bai, BR
    Chen, BL
    Wang, HM
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2000, 14 (05) : 603 - 616
  • [36] Punctuation Prediction for Chinese Spoken Sentence Based on Model Combination
    Chen, Xiao
    Ke, Dengfeng
    Xu, Bo
    PRACTICAL APPLICATIONS OF INTELLIGENT SYSTEMS, ISKE 2013, 2014, 279 : 1069 - 1078
  • [37] Model-Based Unsupervised Spoken Term Detection with Spoken Queries
    Chan, Chun-an
    Lee, Lin-shan
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (07): : 1330 - 1342
  • [38] Neural Network Based End-to-End Query by Example Spoken Term Detection
    Ram, Dhananjay
    Miculicich, Lesly
    Bourlard, Herve
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 (28) : 1416 - 1427
  • [39] EFFICIENT SUBWORD LATTICE RETRIEVAL FOR GERMAN SPOKEN TERM DETECTION
    Mertens, Timo
    Schneider, Daniel
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4885 - +
  • [40] The SRI/OGI 2006 Spoken Term Detection System
    Vergyri, Dimitra
    Shafran, Izhak
    Stolcke, Andreas
    Gadde, Ramana R.
    Akbacak, Murat
    Roark, Brian
    Wang, Wen
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2836 - +