Identification of Language using Mel-Frequency Cepstral Coefficients (MFCC)

被引:44
|
作者
Koolagudi, Shashidhar G. [1 ]
Rastogi, Deepika [1 ]
Rao, K. Sreenivasa [2 ]
机构
[1] Graph Era Univ, Sch Comp, Dehra Dun 248002, Uttarakhand, India
[2] Indian Inst Technol, Kharagpur 721302, W Bengal, India
关键词
Gaussian Mixture Model; Language identification; Mel-frequency Cepstral Coefficient; Spectral features;
D O I
10.1016/j.proeng.2012.06.392
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper focuses on the task of identifying a language from speech signal. In this paper, we have use Mel-frequency cepstral coefficient as features. Language identification models are developed for fifteen Indian languages namely Assamese, Bangla, Guajarati, Hindi, Kannada, Kashmiri, Malayalam, Marathi, Nepali, Oriya, Punjabi, Rajasthani, Tamil, Telugu and Urdu using these spectral features. The identification of above mentioned languages is carried out using Gaussian mixture model. A Semi natural read database is used for obtaining the language specific information. MFCC is obtained by using linear cosine transform of log power spectrum on a nonlinear mel-frequency scale. This paper shows that the performance of Language identification system is better when trained and tested with twenty nine features as compared to six, eight, thirteen, nineteen and twenty one MECC features. It means more the number of features we use better the result we get. The average language recognition rate over fifteen Indian languages is around 88\%. (C) 2012 Published by Elsevier Ltd. Selection and/or peer-review under responsibility of Noorul Islam Centre for Higher Education
引用
收藏
页码:3391 / 3398
页数:8
相关论文
共 50 条
  • [31] Modified Mel-frequency Cepstral Coefficients (MMFCC) in Robust Text-dependent Speaker Identification
    Islam, Md. Atiqul
    2017 4TH INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRICAL ENGINEERING (ICAEE), 2017, : 505 - 509
  • [32] Comparison of linear prediction cepstrum coefficients and Mel-Frequency Cepstrum Coefficients for language identification
    Wong, E
    Sridharan, S
    PROCEEDINGS OF 2001 INTERNATIONAL SYMPOSIUM ON INTELLIGENT MULTIMEDIA, VIDEO AND SPEECH PROCESSING, 2001, : 95 - 98
  • [33] Classification of Heart Sounds using Linear Prediction Coefficients and Mel-Frequency Cepstral Coefficients as Acoustic Features
    Narvaez, Pedro
    Vera, Katerine
    Bedoya, Nhikolas
    Percybrooks, Winston S.
    2017 IEEE COLOMBIAN CONFERENCE ON COMMUNICATIONS AND COMPUTING (COLCOM), 2017,
  • [34] A comparative between Mel Frequency Cepstral Coefficients (MFCC) and Inverse Mel Frequency Cepstral Coefficients (IMFCC) features for an Automatic Bird Species Recognition System
    Pedroza Ramirez, Angel David
    de la Rosa Vargas, Jose Ismael
    Rosas Valdez, Rogelio
    Becerra, Aldonso
    2018 IEEE LATIN AMERICAN CONFERENCE ON COMPUTATIONAL INTELLIGENCE (LA-CCI), 2018,
  • [35] Mel-Frequency Cepstral Coefficient (MFCC) for Music Feature Extraction for the Dancing Robot Movement Decision
    Sulistijono, Indra Adji
    Urrosyda, Renita Chulafa
    Darojah, Zaqiatud
    INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2016, PT II, 2016, 9835 : 283 - 294
  • [36] Predicting fundamental frequency from mel-frequency cepstral coefficients to enable speech reconstruction
    Shao, X
    Milner, B
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2005, 118 (02): : 1134 - 1143
  • [37] Extracting Mel-Frequency and Bark-Frequency Cepstral Coefficients from Encrypted Signals
    Thaine, Patricia
    Penn, Gerald
    INTERSPEECH 2019, 2019, : 3715 - 3719
  • [38] Indirect health monitoring of bridges using Mel-frequency cepstral coefficients and principal component analysis
    Mei, Qipei
    Gul, Mustafa
    Boay, Marcus
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2019, 119 : 523 - 546
  • [39] Hidden Markov Model Neurons Classification based on Mel-frequency Cepstral Coefficients
    Haggag, Sherif
    Mohamed, Shady
    Haggag, Hussein
    Nahavandi, Saeid
    PROCEEDINGS OF THE 2014 9TH INTERNATIONAL CONFERENCE ON SYSTEM OF SYSTEMS ENGINEERING (SOSE 2014), 2014, : 166 - 170
  • [40] A Wavelet Packet and Mel-Frequency Cepstral Coefficients-Based Feature Extraction Method for Speaker Identification
    Turner, Claude
    Joseph, Anthony
    COMPLEX ADAPTIVE SYSTEMS, 2015, 2015, 61 : 416 - 421