Identification of Language using Mel-Frequency Cepstral Coefficients (MFCC)

被引:44
|
作者
Koolagudi, Shashidhar G. [1 ]
Rastogi, Deepika [1 ]
Rao, K. Sreenivasa [2 ]
机构
[1] Graph Era Univ, Sch Comp, Dehra Dun 248002, Uttarakhand, India
[2] Indian Inst Technol, Kharagpur 721302, W Bengal, India
关键词
Gaussian Mixture Model; Language identification; Mel-frequency Cepstral Coefficient; Spectral features;
D O I
10.1016/j.proeng.2012.06.392
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper focuses on the task of identifying a language from speech signal. In this paper, we have use Mel-frequency cepstral coefficient as features. Language identification models are developed for fifteen Indian languages namely Assamese, Bangla, Guajarati, Hindi, Kannada, Kashmiri, Malayalam, Marathi, Nepali, Oriya, Punjabi, Rajasthani, Tamil, Telugu and Urdu using these spectral features. The identification of above mentioned languages is carried out using Gaussian mixture model. A Semi natural read database is used for obtaining the language specific information. MFCC is obtained by using linear cosine transform of log power spectrum on a nonlinear mel-frequency scale. This paper shows that the performance of Language identification system is better when trained and tested with twenty nine features as compared to six, eight, thirteen, nineteen and twenty one MECC features. It means more the number of features we use better the result we get. The average language recognition rate over fifteen Indian languages is around 88\%. (C) 2012 Published by Elsevier Ltd. Selection and/or peer-review under responsibility of Noorul Islam Centre for Higher Education
引用
收藏
页码:3391 / 3398
页数:8
相关论文
共 50 条
  • [21] Multiple time resolutions for derivatives of mel-frequency cepstral coefficients
    Stemmer, G
    Hacker, C
    Nöth, E
    Niemann, H
    ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS, 2001, : 37 - 40
  • [22] How many Mel-frequency cepstral coefficients to be utilized in speech recognition? A study with the Bengali language
    Hasan, Md. Rakibul
    Hasan, Md. Mahbub
    Hossain, Md Zakir
    JOURNAL OF ENGINEERING-JOE, 2021, 2021 (12): : 817 - 827
  • [23] Emotions Understanding Model from Spoken Language using Deep Neural Networks and Mel-Frequency Cepstral Coefficients
    de Pinto, Marco Giuseppe
    Polignano, Marco
    Lops, Pasquale
    Semeraro, Giovanni
    2020 IEEE INTERNATIONAL CONFERENCE ON EVOLVING AND ADAPTIVE INTELLIGENT SYSTEMS (EAIS), 2020,
  • [24] Modified Mel-Frequency cepstral coefficient
    Saha, G
    Yadhunandan, US
    Proceedings of the Sixth IASTED International Conference on Signal and Image Processing, 2004, : 215 - 219
  • [25] Faults detection using Gaussian mixture models, mel-frequency cepstral coefficients and kurtosis
    Nelwamondo, Fulufhelo V.
    Marwala, Tshilidzi
    2006 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-6, PROCEEDINGS, 2006, : 290 - 295
  • [26] Automatic Speaker Recognition Using Mel-Frequency Cepstral Coefficients Through Machine Learning
    Ayvaz, Ugur
    Guruler, Huseyin
    Khan, Faheem
    Ahmed, Naveed
    Whangbo, Taegkeun
    Bobomirzaevich, Abdusalomov Akmalbek
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 71 (03): : 5511 - 5521
  • [27] Voice Control for a Gripper using Mel-Frequency Cepstral Coefficients and Gaussian Mixture Models
    Velasco-Hernandez, Gustavo
    Diaz-Toro, Andres
    2015 20TH SYMPOSIUM ON SIGNAL PROCESSING, IMAGES AND COMPUTER VISION (STSIVA), 2015,
  • [28] Pitch Prediction from Mel-frequency Cepstral Coefficients Using Sparse Spectrum Recovery
    Rao, Achuth M., V
    Ghosh, Prasanta Kumar
    2017 TWENTY-THIRD NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2017,
  • [29] Mel-Frequency Cepstral Coefficients Using Formants-Based Gaussian Distribution Filterbank
    Son, Young-Woo
    Hong, Jae-Keun
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2006, 25 (08): : 370 - 374
  • [30] Gender of Fetus Identification Using Modified Mel-Frequency Cepstral Coefficients Based on Fractional Discrete Cosine Transform
    Azmy, Mohamed Moustafa
    IEEE ACCESS, 2024, 12 : 48158 - 48164