Utterance verification using prosodic information for Mandarin telephone speech keyword spotting

被引:2
|
作者
Chen, YJ [1 ]
Wu, CH [1 ]
Yan, GL [1 ]
机构
[1] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan 70101, Taiwan
关键词
D O I
10.1109/ICASSP.1999.759762
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
in this paper, the prosodic information, a very special and important feature in Mandarin speech, is used For Mandarin telephone speech utterance verification. A two-stage strategy, with recognition followed by verification, is adopted. For keyword recognition, 59 context-independent subsyllables, i.e., 7-7 INITAL's and 37 FINAL's in Mandarin speech, and one background/silence model, are used as the basic recognition units. For utterance verification, 12 anti-subsyllable HMM's, 175 context-dependent prosodic HMM's, and five anti-prosodic HMM's. are constructed. A keyword verification function combining phonetic-phase and prosodic-phase verification is investigated. Using a test set of 2400 conversational speech utterances from 20 speakers (12 males and 8 females), at 8.5% false rejection, the proposed verification method resulted in 17.8% false alarm rate. Furthermore, this method was able to correctly reject 90.4% of nonkeywords. Comparison with a baseline system without prosodic-phase verification shows that the prosodic information can benefit the verification performance.
引用
收藏
页码:697 / 700
页数:4
相关论文
共 50 条
  • [1] Utterance verification for spontaneous mandarin speech keyword spotting
    Xin, L
    Wang, BX
    [J]. 2001 INTERNATIONAL CONFERENCES ON INFO-TECH AND INFO-NET PROCEEDINGS, CONFERENCE A-G: INFO-TECH & INFO-NET: A KEY TO BETTER LIFE, 2001, : C397 - C401
  • [2] Keyword Spotting in Continuous Speech Using Spectral and Prosodic Information Fusion
    Laxmi Pandey
    Rajesh M. Hegde
    [J]. Circuits, Systems, and Signal Processing, 2019, 38 : 2767 - 2791
  • [3] Keyword Spotting in Continuous Speech Using Spectral and Prosodic Information Fusion
    Pandey, Laxmi
    Hegde, Rajesh M.
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2019, 38 (06) : 2767 - 2791
  • [4] An utterance verification algorithm in keyword spotting system
    Dai, HS
    Zhu, XY
    Luo, YP
    Yang, SY
    [J]. PATTERN RECOGNITION AND IMAGE ANALYSIS, PT 2, PROCEEDINGS, 2005, 3523 : 555 - 561
  • [5] Fast Keyword Spotting in Telephone Speech
    Nouza, Jan
    Silovsky, Jan
    [J]. RADIOENGINEERING, 2009, 18 (04) : 665 - 670
  • [6] A new keyword spotting approach for spontaneous mandarin speech
    Zhang, Pengyuan
    Han, Jiang
    Shao, Jian
    Yan, Yonghong
    [J]. 2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 764 - +
  • [7] Fusion of Spectral and Prosodic Information using Combined Error Optimization for Keyword Spotting
    Pandey, Laxmi
    Chaudhary, Kuldeep
    Hegde, Rajesh M.
    [J]. 2017 TWENTY-THIRD NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2017,
  • [8] Keyword Spotting Based On CTC and RNN For Mandarin Chinese Speech
    Wang, Yiyan
    Long, Yanhua
    [J]. 2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 374 - 378
  • [9] Integration of phonetic and prosodic information for robust utterance verification
    Wu, CH
    Chen, YJ
    Yan, GL
    [J]. IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 2000, 147 (01): : 55 - 61
  • [10] THE 2016 BBN GEORGIAN TELEPHONE SPEECH KEYWORD SPOTTING SYSTEM
    Alumae, Tanel
    Karakos, Damianos
    Hartmann, William
    Hsiao, Roger
    Zhang, Le
    Long Nguyen
    Tsakalidis, Stavros
    Schwartz, Richard
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5755 - 5759