Phrase language models for detection and verification-based speech understanding

被引:0
|
作者
Kawahara, T [1 ]
Doshita, S [1 ]
Lee, CH [1 ]
机构
[1] Kyoto Univ, Dept Informat Sci, Kyoto 606, Japan
关键词
D O I
10.1109/ASRU.1997.658977
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a phrase language model that has two key features. First, the model is oriented for robust understanding of unconstrained speech. Second, it does not need a large task-specific training corpus. The basic idea is that we focus on the stable and significant patterns of variable-length phrase expressions rather than uniformly modeling the whole utterances, and then classify them into task-dependent portions and task-independent ones. While the task-dependent key-phrases are trained with a small amount of task-specific data, the task-independent model is constructed with other large corpora that are not necessarily related to the current task. The task-independent model extracts expressions specific to the dialogue style rather than the task domain, and complements the task-dependent key-phrase model to enhance the detection and verification performance.
引用
下载
收藏
页码:49 / 56
页数:8
相关论文
共 50 条
  • [1] Flexible speech understanding based on combined key-phrase detection and verification
    Kyoto Univ, Kyoto, Japan
    IEEE Trans Speech Audio Process, 6 (558-568):
  • [2] Flexible speech understanding based on combined key-phrase detection and verification
    Kawahara, T
    Lee, CH
    Juang, BH
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1998, 6 (06): : 558 - 568
  • [3] Key-phrase detection and verification for flexible speech understanding
    Kawahara, T
    Lee, CH
    Juang, BH
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 861 - 864
  • [4] Combining key-phrase detection and subword-based verification for flexible speech understanding
    Kawahara, T
    Lee, CH
    Juang, BH
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1159 - 1162
  • [5] Speaker Verification-Based Evaluation of Single-Channel Speech Separation
    Maciejewski, Matthew
    Watanabe, Shinji
    Khudanpur, Sanjeev
    INTERSPEECH 2021, 2021, : 3520 - 3524
  • [6] Deriving phrase-based language models
    Heeman, PA
    Damnati, G
    1997 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, PROCEEDINGS, 1997, : 41 - 48
  • [7] Topic independent language model for key-phrase detection and verification
    Kawahara, T
    Doshita, S
    ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 685 - 688
  • [8] Topic independent language model for key-phrase detection and verification
    Kawahara, Tatsuya
    Doshita, Shuji
    ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1999, 2 : 685 - 688
  • [9] UAV Attack Detection and Mitigation Using a Localization Verification-Based Autoencoder
    Aladi, Ahmed
    Alsusa, Emad
    IEEE ACCESS, 2023, 11 : 117752 - 117764
  • [10] VERIFICATION-BASED PAIRWISE GAIT IDENTIFICATION
    Tong, Suibing
    Fu, Yuzhuo
    Ling, Hefei
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2017,