Combining key-phrase detection and subword-based verification for flexible speech understanding

被引:0
|
作者
Kawahara, T
Lee, CH
Juang, BH
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A flexible speech understanding framework combining key-phrase detection and verification is presented. Detection of semantically-tagged key-phrases directly leads to robust understanding. In order to select reliable detection and eliminate false alarms, utterance verification technique is incorporated. A phrase verifier combines subword-based likelihood ratios of correct models and anti-subword alternate models. A confidence measure that focuses on mis-matched subwords is proposed and demonstrated as the most effective. The combined strategy drastically improves the semantic accuracy for out-of-grammar utterances, while maintaining the performance for in-grammar samples. We also found that utterance verification applied after grammar-based decoding is not so effective as the proposed detection and verification strategy.
引用
收藏
页码:1159 / 1162
页数:4
相关论文
共 45 条
  • [1] Key-phrase detection and verification for flexible speech understanding
    Kawahara, T
    Lee, CH
    Juang, BH
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 861 - 864
  • [2] Flexible speech understanding based on combined key-phrase detection and verification
    Kyoto Univ, Kyoto, Japan
    [J]. IEEE Trans Speech Audio Process, 6 (558-568):
  • [3] Flexible speech understanding based on combined key-phrase detection and verification
    Kawahara, T
    Lee, CH
    Juang, BH
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1998, 6 (06): : 558 - 568
  • [4] Topic independent language model for key-phrase detection and verification
    Kawahara, T
    Doshita, S
    [J]. ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 685 - 688
  • [5] Topic independent language model for key-phrase detection and verification
    Kawahara, Tatsuya
    Doshita, Shuji
    [J]. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1999, 2 : 685 - 688
  • [6] Discrimination power weighted subword-based speaker verification
    Chan, SM
    Siu, MH
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 45 - 48
  • [7] Phrase language models for detection and verification-based speech understanding
    Kawahara, T
    Doshita, S
    Lee, CH
    [J]. 1997 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, PROCEEDINGS, 1997, : 49 - 56
  • [8] SUBWORD-BASED LARGE-VOCABULARY SPEECH RECOGNITION
    LEE, CH
    GAUVAIN, JL
    PIERACCINI, R
    RABINER, LR
    [J]. AT&T TECHNICAL JOURNAL, 1993, 72 (05): : 25 - 36
  • [9] TransKP: Transformer based Key-Phrase Extraction
    Rungta, Mukund
    Kumar, Rishabh
    Dhaliwal, Mehak Preet
    Tiwari, Hemant
    Vala, Vanraj
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [10] PERCEPTUAL AUDIO FEATURES FOR UNSUPERVISED KEY-PHRASE DETECTION
    von Zeddelmann, Dirk
    Kurth, Frank
    Mueller, Meinard
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 257 - 260