Identification of Language using Mel-Frequency Cepstral Coefficients (MFCC)

被引:44
|
作者
Koolagudi, Shashidhar G. [1 ]
Rastogi, Deepika [1 ]
Rao, K. Sreenivasa [2 ]
机构
[1] Graph Era Univ, Sch Comp, Dehra Dun 248002, Uttarakhand, India
[2] Indian Inst Technol, Kharagpur 721302, W Bengal, India
关键词
Gaussian Mixture Model; Language identification; Mel-frequency Cepstral Coefficient; Spectral features;
D O I
10.1016/j.proeng.2012.06.392
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper focuses on the task of identifying a language from speech signal. In this paper, we have use Mel-frequency cepstral coefficient as features. Language identification models are developed for fifteen Indian languages namely Assamese, Bangla, Guajarati, Hindi, Kannada, Kashmiri, Malayalam, Marathi, Nepali, Oriya, Punjabi, Rajasthani, Tamil, Telugu and Urdu using these spectral features. The identification of above mentioned languages is carried out using Gaussian mixture model. A Semi natural read database is used for obtaining the language specific information. MFCC is obtained by using linear cosine transform of log power spectrum on a nonlinear mel-frequency scale. This paper shows that the performance of Language identification system is better when trained and tested with twenty nine features as compared to six, eight, thirteen, nineteen and twenty one MECC features. It means more the number of features we use better the result we get. The average language recognition rate over fifteen Indian languages is around 88\%. (C) 2012 Published by Elsevier Ltd. Selection and/or peer-review under responsibility of Noorul Islam Centre for Higher Education
引用
收藏
页码:3391 / 3398
页数:8
相关论文
共 50 条
  • [1] Mel-frequency Cepstral Coefficients for Eye Movement Identification
    Nguyen Viet Cuong
    Vu Dinh
    Lam Si Tung Ho
    2012 IEEE 24TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2012), VOL 1, 2012, : 253 - 260
  • [2] MUSICAL INSTRUMENT IDENTIFICATION USING MULTISCALE MEL-FREQUENCY CEPSTRAL COEFFICIENTS
    Sturm, Bob L.
    Morvidone, Marcela
    Daudet, Laurent
    18TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2010), 2010, : 477 - 481
  • [3] Walk Identification using a smart carpet and Mel-Frequency Cepstral Coefficient (MFCC) features
    Muheidat, Fadi
    Tyrer, W. Harry
    Popescu, Mihail
    2018 40TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2018, : 4249 - 4252
  • [4] Fingerprint Recognition Using Mel-Frequency Cepstral Coefficients
    Hashad F.G.
    Halim T.M.
    Diab S.M.
    Sallam B.M.
    El-Samie F.E.A.
    Pattern Recognition and Image Analysis, 2010, 20 (03) : 360 - 369
  • [5] Using Mel-Frequency Cepstral Coefficients in Missing Data Technique
    Zhang Jun
    Sam Kwong
    Wei Gang
    Qingyang Hong
    EURASIP Journal on Advances in Signal Processing, 2004
  • [6] Using Mel-Frequency Cepstral Coefficients in Missing Data Technique
    Jun, Z. (zhj_angun@sina.com.cn), 1600, Hindawi Publishing Corporation (2004):
  • [7] Voice Recognition and Marking Using Mel-frequency Cepstral Coefficients
    Sheu, Jia-Shing
    Chen, Ching-Wen
    SENSORS AND MATERIALS, 2020, 32 (10) : 3209 - 3220
  • [8] Using Mel-frequency cepstral coefficients in missing data technique
    Jun, Z
    Kwong, S
    Gang, W
    Hong, QY
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2004, 2004 (03) : 340 - 346
  • [9] Computing Mel-frequency cepstral coefficients on the power spectrum
    Molau, S
    Pitz, M
    Schlüter, R
    Ney, H
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 73 - 76
  • [10] Feature Extraction of some Quranic Recitation using Mel-Frequency Cepstral Coeficients (MFCC)
    Bezoui, Mouaz
    Elmoutaouakkil, Abdelmajid
    Beni-hssane, Abderrahim
    PROCEEDINGS OF 2016 5TH INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS), 2016, : 127 - 131