Speaker identification based on combination of MFCC and UMRT based features

被引:8
|
作者
Antony, Anett [1 ]
Gopikakumari, R. [1 ]
机构
[1] CUSAT, Sch Engn, Div Elect Engn, Cochin 682022, Kerala, India
关键词
Speaker identification; MFCC; UMRT; ANN;
D O I
10.1016/j.procs.2018.10.393
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper introduces an isolated word speaker identification system based on a new feature extractor and using Artificial Neural Network. The system is designed for both text independent and text dependent speaker identification system for English words. The speech is recorded using audio wave recorder. Then the preprocessing is applied for the given speech signals. UMRT is a transform which has been used for image compression. Combinations of MFCC and UMRT are taken and are used as a feature extractor. The classification of the features is done using Multi-layer perceptron with back propagation algorithm. The accuracy is taken using confusion matrix. The accuracy achieved is around 97.91% for speech dependent systems while for speech independent system the accuracy is around 94.44%. (C) 2018 The Authors. Published by Elsevier B.V.
引用
收藏
页码:250 / 257
页数:8
相关论文
共 50 条
  • [1] A Robust Speaker Identification System Based on the Combination of GFCC and MFCC Methods
    Bachir Tazi, El
    PROCEEDINGS OF 2016 5TH INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS), 2016, : 54 - 58
  • [2] An MFCC-based Speaker Identification System
    Leu, Fang-Yie
    Lin, Guan-Liang
    2017 IEEE 31ST INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS (AINA), 2017, : 1055 - 1062
  • [3] LDA combination of pitch and MFCC features in speaker recognition
    Harrag, A
    Mohamadi, T
    Serignat, JF
    INDICON 2005 Proceedings, 2005, : 237 - 240
  • [4] Bionic optimization of MFCC features based on speaker fast recognition
    Lin, Zhaodong
    Di, Changan
    Chen, Xiong
    APPLIED ACOUSTICS, 2021, 173
  • [5] Improved MFCC-Based Feature for Robust Speaker Identification
    吴尊敬
    曹志刚
    Tsinghua Science and Technology, 2005, (02) : 158 - 161
  • [6] An Information Theoretic Combination of MFCC and TDOA Features for Speaker Diarization
    Vijayasenan, Deepu
    Valente, Fabio
    Bourlard, Herve
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (02): : 431 - 438
  • [7] STUDY OF FUSION STRATEGIES AND EXPLOITING THE COMBINATION OF MFCC AND PNCC FEATURES FOR ROBUST BIOMETRIC SPEAKER IDENTIFICATION
    Al-Kaltakchi, M. T. S.
    Woo, W. L.
    Dlay, S. S.
    Chambers, J. A.
    2016 4TH INTERNATIONAL WORKSHOP ON BIOMETRICS AND FORENSICS (IWBF), 2016,
  • [8] Speaker gender recognition based on combining the contribution of MFCC and pitch features
    Engineering Lab on Intelligent Perception for Internet of Things, Shenzhen Graduate School, Peking University, Shenzhen 518055, Guangdong, China
    Huazhong Ligong Daxue Xuebao, 2013, SUPPL.I (108-111+120):
  • [9] ELM speaker identification for limited dataset using multitaper based MFCC and PNCC features with fusion score
    Bharath K P
    Rajesh Kumar M
    Multimedia Tools and Applications, 2020, 79 : 28859 - 28883
  • [10] SPEAKER IDENTIFICATION BY AGGREGATING GAUSSIAN MIXTURE MODELS (GMMs) BASED ON UNCORRELATED MFCC-DERIVED FEATURES
    Pal, Amita
    Bose, Smarajit
    Basak, Gopal K.
    Mukhopadhyay, Amitava
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2014, 28 (04)