Development of Language Identification System using MFCC and Vector Quantization

被引:0
|
作者
Gunawan, Teddy Surya [1 ]
Husain, Rashida [1 ]
Kartiwi, Mira [2 ]
机构
[1] Int Islamic Univ Malaysia, Elect & Comp Engn Dept, Kuala Lumpur, Malaysia
[2] Int Islamic Univ Malaysia, Informat Syst Dept, Kuala Lumpur, Malaysia
关键词
Keywords language identification; MFCC; VQ; recognition rate;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper investigates the development of language identification based on Mel-Frequency Cepstral Coefficients (MFCC) and Vector Quantization (VQ) algorithm. In this study, a total of ten speakers were chosen randomly with different languages from online language database. A total of six males and four females were selected as subjects for this research and each of them spoke different languages, including Arabic, Chinese, English, Korean and Malay. The MFCC will be extracted to derive the related feature vector. Vector Quantization (VQ) algorithm is then used as classifier. The recognition rate is then calculated for each language. Several experiments were conducted to find the optimum parameters, in which we found that sampling frequency of 16000 Hz and codebook size of 75 provided good results. On average, the recognition rate for all five languages evaluated was 78%. The experimental results show that our proposed system provides a good recognition rate.
引用
下载
收藏
页数:4
相关论文
共 50 条
  • [21] A Speaker Identification System using MFCC Features with VQ Technique
    Zulfiqar, Ali
    Muhammad, Aslam
    Enriquez A M, Martinez
    2009 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL 3, PROCEEDINGS, 2009, : 115 - +
  • [22] Automatic spoken language identification using MFCC based time series features
    Biswas, Mainak
    Rahaman, Saif
    Ahmadian, Ali
    Subari, Kamalularifin
    Singh, Pawan Kumar
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (07) : 9565 - 9595
  • [23] Vector Quantization in Language Independent Speaker Identification Using Mel-Frequency Cepstrum Co-efficient
    Ambika, D.
    Radha, V.
    NETWORKS AND COMMUNICATIONS (NETCOM2013), 2014, 284 : 171 - 182
  • [24] On building local models for inverse system identification with vector quantization algorithms
    Souza, Luis Gustavo M.
    Barreto, Guilherme A.
    NEUROCOMPUTING, 2010, 73 (10-12) : 1993 - 2005
  • [25] Robust Automatic Speaker Identification System Using Shuffled MFCC Features
    Barhoush, Mahdi
    Hallawa, Ahmed
    Schmeink, Anke
    2021 IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLIED NETWORK TECHNOLOGIES (ICMLANT II), 2021, : 28 - 33
  • [26] Speaker identification based on vector quantization
    Radová, V
    Svenda, Z
    TEXT, SPEECH AND DIALOGUE, 1999, 1692 : 341 - 344
  • [27] Improved Language Identification Using Support Vector Machines for Language Modeling
    Yang, Xi
    Zhai, Lu-Feng
    Siu, Manhung
    Gish, Herbert
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 417 - +
  • [28] An MFCC-based Speaker Identification System
    Leu, Fang-Yie
    Lin, Guan-Liang
    2017 IEEE 31ST INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS (AINA), 2017, : 1055 - 1062
  • [29] Automatic language identification using support vector machines
    Zhang, Wenlin
    Li, Bicheng
    Qu, Dan
    Wang, Bingxi
    2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 728 - +
  • [30] Automatic speech recognition using Mel-frequency cepstrum coefficient (MFCC) and vector quantization (VQ) techniques for continuous speech
    Verma, Amit
    Kumar, Amit
    Kaur, Iqbaldeep
    INTERNATIONAL JOURNAL OF ADVANCED AND APPLIED SCIENCES, 2018, 5 (04): : 73 - 78