Large Vocabulary Speech Recognition: Speaker Dependent and Speaker Independent

被引:0
|
作者
Hemakumar, G. [1 ]
Punitha, P. [2 ]
机构
[1] Govt Coll Women, Dept Comp Sci, Mandya, India
[2] PESIT, Dept MCA, Bangalore, Karnataka, India
关键词
Speaker independent; Speaker dependent; Normal fit; Baum-Welch algorithm;
D O I
10.1007/978-81-322-2250-7_8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses the problem of large vocabulary isolated word and continuous Kannada speech recognition using the syllables and combination of Hidden Markov Model (HMM) and Normal fit method. The models designed for speaker dependent and speaker independent mode of working. This experiment has covered 6 million words among the 10 million words from Hampi text corpus. Here 3-state Baum-Welch algorithm is used for training. For the 2 successor outputted lambda(A, B, pi) is combined and passed into normal fit, the outputted normal fit parameter is labeled has syllable or sub-word. In terms of memory requirement and recognition rate the proposed model is compared with Gaussian Mixture Model and HMM (3-state Baum-Welch algorithm). This paper clearly shows that combination of HMM and normal fit technique will reduce the memory size while building and storing the speech models and works with excellent recognition rate. The average WRR is 91.22 % and average WER is 8.78 %. All computations are done using mat lab.
引用
下载
收藏
页码:73 / 80
页数:8
相关论文
共 50 条
  • [21] SPEAKER-CONSISTENT PARSING FOR SPEAKER-INDEPENDENT CONTINUOUS SPEECH RECOGNITION
    YAMAGUCHI, K
    SINGER, H
    MATSUNAGA, S
    SAGAYAMA, S
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1995, E78D (06) : 719 - 724
  • [22] Speaker adaptation techniques for speech recognition with a speaker-independent phonetic recognizer
    Kim, WG
    Jang, M
    COMPUTATIONAL INTELLIGENCE AND SECURITY, PT 1, PROCEEDINGS, 2005, 3801 : 95 - 100
  • [23] SPEAKER-DEPENDENT LARGE VOCABULARY WORD RECOGNITION USING THE SPLIT METHOD
    SUGAMURA, N
    FURUI, S
    REVIEW OF THE ELECTRICAL COMMUNICATIONS LABORATORIES, 1986, 34 (03): : 327 - 333
  • [24] SPEAKER ADAPTATION BY VARIABLE REFERENCE MODEL SUBSPACE AND APPLICATION TO LARGE VOCABULARY SPEECH RECOGNITION
    Teng, Wen Xuan
    Gravier, Guillaume
    Bimbot, Frederic
    Soufflet, Frederic
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4381 - 4384
  • [25] Generalized mel frequency cepstral coefficients for large-vocabulary speaker-independent continuous-speech recognition
    Vergin, R
    O'Shaughnessy, D
    Farhat, A
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1999, 7 (05): : 525 - 532
  • [26] Robust out-of-vocabulary rejection for low-complexity speaker independent speech recognition
    Broun, CC
    Campbell, WM
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1811 - 1814
  • [27] Biomimetic pattern recognition for speaker-independent speech recognition
    Qin, H
    Wang, SJ
    Sun, H
    PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS AND BRAIN, VOLS 1-3, 2005, : 1290 - 1294
  • [28] Across-speaker Articulatory Normalization for Speaker-independent Silent Speech Recognition
    Wang, Jun
    Samal, Ashok
    Green, Jordan R.
    15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1179 - 1183
  • [29] Speaker-independent speech recognition based on tree-structured speaker clustering
    Kosaka, T
    Matsunaga, S
    Sagayama, S
    COMPUTER SPEECH AND LANGUAGE, 1996, 10 (01): : 55 - 74
  • [30] Speaker Adversarial Neural Network (SANN) for Speaker-independent Speech Emotion Recognition
    Fahad, Md Shah
    Ranjan, Ashish
    Deepak, Akshay
    Pradhan, Gayadhar
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2022, 41 (11) : 6113 - 6135