Speaker identification based on combination of MFCC and UMRT based features

被引:8
|
作者
Antony, Anett [1 ]
Gopikakumari, R. [1 ]
机构
[1] CUSAT, Sch Engn, Div Elect Engn, Cochin 682022, Kerala, India
关键词
Speaker identification; MFCC; UMRT; ANN;
D O I
10.1016/j.procs.2018.10.393
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper introduces an isolated word speaker identification system based on a new feature extractor and using Artificial Neural Network. The system is designed for both text independent and text dependent speaker identification system for English words. The speech is recorded using audio wave recorder. Then the preprocessing is applied for the given speech signals. UMRT is a transform which has been used for image compression. Combinations of MFCC and UMRT are taken and are used as a feature extractor. The classification of the features is done using Multi-layer perceptron with back propagation algorithm. The accuracy is taken using confusion matrix. The accuracy achieved is around 97.91% for speech dependent systems while for speech independent system the accuracy is around 94.44%. (C) 2018 The Authors. Published by Elsevier B.V.
引用
收藏
页码:250 / 257
页数:8
相关论文
共 50 条
  • [31] HISTOGRAM TRANSFORM MODEL USING MFCC FEATURES FOR TEXT-INDEPENDENT SPEAKER IDENTIFICATION
    Yu, Hong
    Ma, Zhanyu
    Li, Minyue
    Guo, Jun
    CONFERENCE RECORD OF THE 2014 FORTY-EIGHTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, 2014, : 500 - 504
  • [32] Gender Identification of a Speaker Using MFCC and GMM
    Yucesoy, Ergun
    Nabiyev, Vasif V.
    2013 8TH INTERNATIONAL CONFERENCE ON ELECTRICAL AND ELECTRONICS ENGINEERING (ELECO), 2013, : 626 - 629
  • [33] MFCC and Similarity Measurements for Speaker Identification Systems
    Maazouzi, A.
    Aqili, N.
    Aamoud, A.
    Raji, M.
    Hammouch, A.
    PROCEEDINGS OF 2017 INTERNATIONAL CONFERENCE ON ELECTRICAL AND INFORMATION TECHNOLOGIES (ICEIT 2017), 2017,
  • [34] Noise Robust Speaker Identification by Dividing MFCC
    Matsumoto, Kizuki
    Hayasaka, Noboru
    Iiguni, Youji
    2014 6TH INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS, CONTROL AND SIGNAL PROCESSING (ISCCSP), 2014, : 652 - 655
  • [35] Combination of pitch and MFCC GMM supervectors for speaker verification
    Huang, Wei
    Chao, Jianshu
    Zhang, Yaxin
    2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 1335 - 1339
  • [36] Automatic spoken language identification using MFCC based time series features
    Mainak Biswas
    Saif Rahaman
    Ali Ahmadian
    Kamalularifin Subari
    Pawan Kumar Singh
    Multimedia Tools and Applications, 2023, 82 : 9565 - 9595
  • [37] Unsupervised speaker segmentation with residual phase and MFCC features
    Jothilakshmi, S.
    Ramalingam, V.
    Palanivel, S.
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (06) : 9799 - 9804
  • [38] Comparative Analysis of Speaker Recognition System Based on Voice Activity Detection Technique, MFCC and PLP Features
    Kalia, Akanksha
    Sharma, Shikar
    Pandey, Saurabh Kumar
    Jadoun, Vinay Kumar
    Das, Madhulika
    INTELLIGENT COMPUTING TECHNIQUES FOR SMART ENERGY SYSTEMS, 2020, 607 : 781 - 787
  • [39] Automatic spoken language identification using MFCC based time series features
    Biswas, Mainak
    Rahaman, Saif
    Ahmadian, Ali
    Subari, Kamalularifin
    Singh, Pawan Kumar
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (07) : 9565 - 9595
  • [40] Fusion of TEO Phase with MFCC Features for Speaker Verification
    Agrawal, Purvi
    Patil, Hemant A.
    PERCEPTION AND MACHINE INTELLIGENCE, 2015, 2015, : 161 - 166