HMM Based Language Identification from Speech Utterances of Popular Indic Languages Using Spectral and Prosodic Features

被引:1
|
作者
Sadanandam, Manchala [1 ]
机构
[1] Kakatiya Univ, Univ Engn Coll, CSE, Warangal 506009, Telangana, India
关键词
Language Identification System (LID); acoustic features; prosodic features; HMM; Indian spoken languages; pitch and MFCC;
D O I
10.18280/ts.380232
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Language identification system (LID) is a system which automatically recognises the languages of short-term duration of unknown utterance of human beings. It recognises the discriminate features and reveals the language of utterance that belongs to. In this paper, we consider concatenated feature vectors of Mel Frequency Cepstral Coefficients (MFCC) and Pitch for designing LID. We design a reference model one for each language using 14-dimensional feature vectors using Hidden Markov model (HMM) then evaluate against all reference models of listed languages. The likelihood value of test sample feature vectors given in the evaluation is considered to decide the language of unknown utterance of test speech sample. In this paper we consider seven Indian languages for the experimental set up and the performance of system is evaluated. The average performance of the system is 89.31% and 90.63% for three states and four states HMM for 3sec test speech utterances respectively and also it is also observed that the system gives significant results with 3sec test speech for four state HMM even though we follow simple procedure.
引用
收藏
页码:521 / 528
页数:8
相关论文
共 50 条
  • [21] Emotion Recognition Using Prosodic and Spectral Features of Speech and Naive Bayes Classifier
    Khan, Atreyee
    Roy, Uttam Kumar
    2017 2ND IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2017, : 1017 - 1021
  • [23] Dialect Identification Using Spectral and Prosodic Features on Single and Ensemble Classifiers
    Chittaragi, Nagaratna B.
    Prakash, Ambareesh
    Koolagudi, Shashidhar G.
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2018, 43 (08) : 4289 - 4302
  • [24] Improvement of Naturalness for an HMM-based Vietnamese Speech Synthesis using the Prosodic information
    Thanh-Son Phan
    Tu-Cuong Duong
    Anh-Tuan Dinh
    Tat-Thang Vu
    Chi-Mai Luong
    PROCEEDINGS OF 2013 IEEE RIVF INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION TECHNOLOGIES: RESEARCH, INNOVATION, AND VISION FOR THE FUTURE (RIVF), 2013, : 276 - 281
  • [25] Dialect Identification Using Spectral and Prosodic Features on Single and Ensemble Classifiers
    Nagaratna B. Chittaragi
    Ambareesh Prakash
    Shashidhar G. Koolagudi
    Arabian Journal for Science and Engineering, 2018, 43 : 4289 - 4302
  • [26] Neural network classifiers for language identification using phonotactic and prosodic features
    Mary, L
    Rao, KS
    Yegnanarayana, B
    2005 INTERNATIONAL CONFERENCE ON INTELLIGENT SENSING AND INFORMATION PROCESSING, PROCEEDINGS, 2005, : 404 - 408
  • [27] Emotion Recognition from Speech using Prosodic and Linguistic Features
    Pervaiz, Mahwish
    Khan, Tamim Ahmed
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (08) : 84 - 90
  • [28] Spoken Language Identification Using Spectral Features
    Koolagudi, Shashidhar G.
    Rastogi, Deepika
    Rao, K. Sreenivasa
    CONTEMPORARY COMPUTING, 2012, 306 : 496 - +
  • [29] An Innovative Method for Speech Signal Emotion Recognition Based on Spectral Features Using GMM and HMM Techniques
    Mohammed Jawad Al-Dujaili Al-Khazraji
    Abbas Ebrahimi-Moghadam
    Wireless Personal Communications, 2024, 134 : 735 - 753
  • [30] An Innovative Method for Speech Signal Emotion Recognition Based on Spectral Features Using GMM and HMM Techniques
    Al-Khazraji, Mohammed Jawad Al-Dujaili
    Ebrahimi-Moghadam, Abbas
    WIRELESS PERSONAL COMMUNICATIONS, 2024, 134 (02) : 735 - 753