An MFCC-based Speaker Identification System

被引:19
|
作者
Leu, Fang-Yie [1 ]
Lin, Guan-Liang [1 ]
机构
[1] Tunghai Univ, Comp Sci Dept, Taichung, Taiwan
关键词
speaker identification; Fourier transformation; Mel-frequency cepstral coefficients; Gaussian mixture model; acoustic model; RECOGNITION;
D O I
10.1109/AINA.2017.130
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays, many speech recognition applications have been used by people in the world. Typical examples are the SIRI of iPhone, Google speech recognition system, and mobile phones operated by voice, etc. On the contrary, speaker identification in its current stage is relatively immature. Therefore, in this paper, we study a speaker identification technique which first takes the original voice signals of a person, e.g., Bob, and then normalizes the audio energies of the signals. After that, the audio signals is converted from time domain to frequency domain by employing Fourier transformation approach. Next, a MFCC-based human auditory filtering model is utilized to identify the energy levels of different frequencies as the quantified characteristics of Bob's voice. Further, the probability density function of Gaussian mixture model is utilized to indicate the distribution of the quantified characteristics as Bob's specific acoustic model. When receiving an unknown person, e.g., x's voice, the system processes the voice with the same procedure, and compares the processing result, which is x's acoustic model, with known-people's acoustic models collected in an acoustic-model database beforehand to identify who the most possible speaker is.
引用
收藏
页码:1055 / 1062
页数:8
相关论文
共 50 条
  • [1] Improved MFCC-Based Feature for Robust Speaker Identification
    吴尊敬
    曹志刚
    [J]. Tsinghua Science and Technology, 2005, (02) : 158 - 161
  • [2] An MFCC-based text-independent speaker identification system for access control
    Liu, Jung-Chun
    Leu, Fang-Yie
    Lin, Guan-Liang
    Susanto, Heru
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2018, 30 (02):
  • [3] Evaluating MFCC-based speaker identification systems with data envelopment analysis
    Ozcan, Zubeyir
    Kayikcioglu, Temel
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 168
  • [4] Hardware Implementation of MFCC-Based Feature Extraction for Speaker Recognition
    Ehkan, P.
    Zakaria, F. F.
    Warip, M. N. M.
    Sauli, Z.
    Elshaikh, M.
    [J]. ADVANCED COMPUTER AND COMMUNICATION ENGINEERING TECHNOLOGY, 2015, 315 : 471 - 480
  • [5] Accuracy of MFCC-Based Speaker Recognition in Series 60 Device
    Juhani Saastamoinen
    Evgeny Karpov
    Ville Hautamäki
    Pasi Fränti
    [J]. EURASIP Journal on Advances in Signal Processing, 2005
  • [6] Accuracy of MFCC-based speaker recognition in Series 60 device
    Saastamoinen, J
    Karpov, E
    Hautamäki, V
    Fränti, P
    [J]. EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2005, 2005 (17) : 2816 - 2827
  • [7] A Robust Speaker Identification System Based on the Combination of GFCC and MFCC Methods
    Bachir Tazi, El
    [J]. PROCEEDINGS OF 2016 5TH INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS), 2016, : 54 - 58
  • [8] MFCC-based perceptual hashing for compressed domain of speech content identification
    Zhang, Qiu-Yu
    Liu, Yang-Wei
    Di, Yan-Jun
    Zhang, Qian-Yun
    Xing, Peng-Fei
    [J]. Journal of Chemical and Pharmaceutical Research, 2014, 6 (07) : 379 - 386
  • [9] A computer-aided MFCC-based HMM system for automatic auscultation
    Chauhan, Sunita
    Wang, Ping
    Lim, Chu Sing
    Anantharaman, V.
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2008, 38 (02) : 221 - 233
  • [10] The speaker recognition system based on the dynamic MFCC
    Dong, Zhi-Feng
    Wang, Zeng-Fu
    [J]. Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2005, 18 (05): : 596 - 601