From Stochastic Speech Recognition to Understanding: An HMM-Based approach

被引:1
|
作者
Boda, PP [1 ]
机构
[1] Nokia Res Ctr, Speech & Audio Syst Lab, FIN-33721 Tampere, Finland
关键词
D O I
10.1109/ASRU.1997.658979
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents results achieved with an HMM-based Stochastic Speech Understanding approach. The applied embedded training Is directly adapted from Continuous Speech Recognition and utilises transcribed text corpus without explicit time alignments. The proposed method is tested on two databases, one in English, the other one in Finnish, from two different demonstration applications (Currency Inquiry and City Bus Timetable Inquiry systems). The results indicate the applicability of the proposed method and show that semantically relevant parts of input queries can be identified with a 5-8% error rate on the semantic unit and 13-20% error rate on the sentence level. The segmentation capability of the approach indicates that the system is capable of exploring the meaningful parts of the queries on an unsupervised fashion.
引用
收藏
页码:57 / 64
页数:8
相关论文
共 50 条
  • [1] An HMM-based speech recognition IC
    Han, W
    Hon, KW
    Chan, CF
    Lee, T
    Choy, CS
    Pun, KP
    Ching, PC
    [J]. PROCEEDINGS OF THE 2003 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL II: COMMUNICATIONS-MULTIMEDIA SYSTEMS & APPLICATIONS, 2003, : 744 - 747
  • [2] A maximum model distance approach for HMM-based speech recognition
    Kwong, S
    He, QH
    Man, KF
    Tang, KS
    [J]. PATTERN RECOGNITION, 1998, 31 (03) : 219 - 229
  • [3] Peripheral features for HMM-based speech recognition
    Fukuda, T
    Takigawa, M
    Nitta, T
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 129 - 132
  • [4] A BAYESIAN APPROACH TO HMM-BASED SPEECH SYNTHESIS
    Hashimoto, Kei
    Zen, Heiga
    Nankaku, Yoshihiko
    Masuko, Takashi
    Tokuda, Keiichi
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4029 - +
  • [5] Use of voicing features in HMM-based speech recognition
    Thomson, DL
    Chengalvarayan, R
    [J]. SPEECH COMMUNICATION, 2002, 37 (3-4) : 197 - 211
  • [6] Modified Viterbi Scoring for HMM-Based Speech Recognition
    Jo, Jihyuck
    Kim, Han-Gyu
    Park, In-Cheol
    Jung, Bang Chul
    Yoo, Hoyoung
    [J]. INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2019, 25 (02): : 351 - 358
  • [7] Normalized training for HMM-based visual speech recognition
    Nankaku, Yoshihiko
    Tokuda, Keiichi
    Kitamura, Tadashi
    Kobayashi, Takao
    [J]. ELECTRONICS AND COMMUNICATIONS IN JAPAN PART III-FUNDAMENTAL ELECTRONIC SCIENCE, 2006, 89 (11): : 40 - 50
  • [8] Normalized training for HMM-based visual speech recognition
    Nankaku, Y
    Tokuda, K
    Kitamura, T
    Kobayashi, T
    [J]. 2000 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL III, PROCEEDINGS, 2000, : 234 - 237
  • [9] Simplified scoring methods for HMM-based speech recognition
    Paramonov, Pavel
    Sutula, Nadezhda
    [J]. SOFT COMPUTING, 2016, 20 (09) : 3455 - 3460
  • [10] An HMM-based method for Thai spelling speech recognition
    Pisarn, C.
    Theeramunkong, T.
    [J]. COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2007, 54 (01) : 76 - 95