Speaker Dependent Continuous Kannada Speech Recognition Using HMM

被引:9
|
作者
Hemakumar, G. [1 ,2 ]
Punitha, P. [3 ]
机构
[1] Bharathiar Univ, Coimbatore, Tamil Nadu, India
[2] Govt Coll Women, Dept Comp Sci, Mandya, India
[3] PESIT, Dept MCA, Bangalore, Karnataka, India
关键词
Speaker Dependent; Short time energy; magnitude of signal;
D O I
10.1109/ICICA.2014.88
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses the problem of Kannada speech recognition. The designed algorithm recognizes continuous Kannada speech using HMM Method and works in the speaker dependent mode. The proposed method first preprocesses the original Kannada speech signal then framing is done for every 20 millisecond with an overlapping of 6.5 millisecond. Secondly voiced part is detected through computing dynamic threshold using short time energy and magnitude of signal. Thirdly in the voiced part of signal extracts Linear-Predictive Coding (LPC) coefficients, and converts them into Real Cepstrum Coefficients. Fourthly, Real Cepstrum Coefficients are passed into k-means clustering algorithm keeping k = 3 and then passed into Baum-Welch Algorithm, using this 3 state HMM model is designed for each syllables / subwords / sentence. In this paper for experiment used 20 unique sentences which can use has commands to simple mobile sets. Each of these sentences was recorded for 10 times for training and 3 times for testing of one male speaker. The command success rate of individually uttered of sentences in experiments is excellent and has reached accuracy rate of 87.76% and miss rate of about 12.24%, the precision of 0.56, recall rate of 0.68 and F1 measure of 0.61. Computations are done using Mat lab.
引用
收藏
页码:402 / 405
页数:4
相关论文
共 50 条
  • [1] Speaker Independent Urdu Speech Recognition Using HMM
    Ashraf, Javed
    Iqbal, Naveed
    Khattak, Naveed Sarfraz
    Zaidi, Ather Mohsin
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, 2010, 6177 : 140 - 148
  • [2] Speaker Dependent, Speaker Independent and Cross Language Emotion Recognition From Speech Using GMM and HMM
    Bhaykar, Manav
    Yadav, Jainath
    Rao, K. Sreenivasa
    2013 NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2013,
  • [3] Kannada Continuous Speech Recognition Using Deep Learning
    Paul, Shubhojeet
    Bhattacharjee, Vandana
    Saha, Sujan Kumar
    ADVANCED NETWORK TECHNOLOGIES AND INTELLIGENT COMPUTING, ANTIC 2023, PT IV, 2024, 2093 : 258 - 269
  • [4] Compensation of speaker directivity in speech recognition using HMM composition
    Giron, F
    Minami, Y
    Tanaka, M
    Furuya, K
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 253 - 256
  • [5] Speech/speaker recognition using a HMM/GMM hybrid model
    Rodriguez, E
    Ruiz, B
    Garcia-Crespo, A
    Garcia, F
    AUDIO- AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, 1997, 1206 : 227 - 234
  • [6] Continuous Kannada Noisy Speech Recognition
    Pasha, Nadeem
    Roopa, S.
    2018 INTERNATIONAL CONFERENCE ON RECENT INNOVATIONS IN ELECTRICAL, ELECTRONICS & COMMUNICATION ENGINEERING (ICRIEECE 2018), 2018, : 857 - 861
  • [7] RECOGNITION OF SPEAKER-DEPENDENT CONTINUOUS SPEECH WITH KEAL
    MERCIER, G
    BIGORGNE, D
    MICLET, L
    LEGUENNEC, L
    QUERRE, M
    IEE PROCEEDINGS-I COMMUNICATIONS SPEECH AND VISION, 1989, 136 (02): : 145 - 154
  • [8] Continuous Speech Recognition of Kannada Language using Triphone Modeling
    Sajjan, Sharada C.
    Vijaya, C.
    PROCEEDINGS OF THE 2016 IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2016, : 451 - 455
  • [9] Continuous Hindi Speech Recognition Using Gaussian Mixture HMM
    Kuamr, Ankit
    Dua, Mohit
    Choudhary, Tripti
    2014 IEEE STUDENTS' CONFERENCE ON ELECTRICAL, ELECTRONICS AND COMPUTER SCIENCE (SCEECS), 2014,
  • [10] Speaker Independent Isolated Speech Recognition System for Tamil Language using HMM
    Vimala, C.
    Radha, V.
    INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY AND SYSTEM DESIGN 2011, 2012, 30 : 1097 - 1102