Text-Independent Speaker Identification Using Vowel Formants

被引:0
|
作者
Noor Almaadeed
Amar Aggoun
Abbes Amira
机构
[1] Qatar University,Department of Computer Science and Engineering, College of Engineering
[2] University of Bedfordshire,Department of Computer Science and Technology
[3] University of the West of Scotland,Department of Engineering and Computer Science
来源
关键词
Vowel formants; Speaker identification; Vowel recognition; Linear predictive coding; Mel-frequency Cepstral coefficients;
D O I
暂无
中图分类号
学科分类号
摘要
Automatic speaker identification has become a challenging research problem due to its wide variety of applications. Neural networks and audio-visual identification systems can be very powerful, but they have limitations related to the number of speakers. The performance drops gradually as more and more users are registered with the system. This paper proposes a scalable algorithm for real-time text-independent speaker identification based on vowel recognition. Vowel formants are unique across different speakers and reflect the vocal tract information of a particular speaker. The contribution of this paper is the design of a scalable system based on vowel formant filters and a scoring scheme for classification of an unseen instance. Mel-Frequency Cepstral Coefficients (MFCC) and Linear Predictive Coding (LPC) have both been analysed for comparison to extract vowel formants by windowing the given signal. All formants are filtered by known formant frequencies to separate the vowel formants for further processing. The formant frequencies of each speaker are collected during the training phase. A test signal is also processed in the same way to find vowel formants and compare them with the saved vowel formants to identify the speaker for the current signal. A score-based scheme allows the speaker with the highest matching formants to own the current signal. This model requires less than 100 bytes of data to be saved for each speaker to be identified, and can identify the speaker within a second. Tests conducted on multiple databases show that this score-based scheme outperforms the back propagation neural network and Gaussian mixture models. Usually, the longer the speech files, the more significant were the improvements in accuracy.
引用
收藏
页码:345 / 356
页数:11
相关论文
共 50 条
  • [1] Text-Independent Speaker Identification Using Vowel Formants
    Almaadeed, Noor
    Aggoun, Amar
    Amira, Abbes
    [J]. JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2016, 82 (03): : 345 - 356
  • [2] Text-Independent Speaker Identification Using Formants and Convolutional Neural Networks
    Camarena-Ibarrola, Antonio
    Reynoso, Miguel
    Figueroa, Karina
    [J]. ADVANCES IN SOFT COMPUTING (MICAI 2021), PT II, 2021, 13068 : 108 - 119
  • [3] Text-independent speaker identification
    Gish, Herbert
    Schmidt, Michael
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 1994, 11 (04) : 18 - 32
  • [4] Text-independent speaker identification using temporal patterns
    Bocklet, Tobias
    Maier, Andreas
    Noeth, Elmar
    [J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2007, 4629 : 318 - 325
  • [5] Text-independent Speaker Identification in Birds
    Fox, E. J. S.
    Roberts, J. D.
    Bennamoun, M.
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2122 - 2125
  • [6] Text-independent speaker identification using fenonic speaker Markov modeling
    Birnbaum, M
    Brown, KL
    Bardenhagen, S
    [J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 677 - 680
  • [7] A TEXT-INDEPENDENT SPEAKER RECOGNITION SYSTEM BASED ON VOWEL SPOTTING
    FAKOTAKIS, N
    TSOPANOGLOU, A
    KOKKINAKIS, G
    [J]. SPEECH COMMUNICATION, 1993, 12 (01) : 57 - 68
  • [8] Text-independent speaker identification using robust statistics estimation
    El Ayadi, Moataz
    Hassan, Abdel-Karim S. O.
    Abdel-Naby, Ahmed
    Elgendy, Omar A.
    [J]. SPEECH COMMUNICATION, 2017, 92 : 52 - 63
  • [9] Text-independent speaker identification in noisy background
    Zhou, Y
    Xu, BL
    [J]. PROGRESS IN NATURAL SCIENCE, 2001, 11 : S384 - S387
  • [10] Text-Independent Speaker Identification Using the Histogram Transform Model
    Ma, Zhanyu
    Yu, Hong
    Tan, Zheng-Hua
    Guo, Jun
    [J]. IEEE ACCESS, 2016, 4 : 9733 - 9739