Speaker identification in noisy environment using bispectrum analysis and probabilistic neural network

被引:0
|
作者
Kusumoputro, B [1 ]
Triyanto, A [1 ]
Fanany, MI [1 ]
Jatmiko, W [1 ]
机构
[1] Univ Indonesia, Fac Comp Sci, Computat Intelligence Res Lab, Jakarta, Indonesia
关键词
D O I
10.1109/ICCIMA.2001.970480
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The work described in this paper addresses the application of a neural processing for extracting bispectrum feature of speech data, and the use of probabilistic neural network as a classifier in an automatic speech recognition system. The usually, used feature extraction paradigm in the early, development of the speech recognition system is power spectrum analysis, however, the recognition rate of this system is not high enough, especially, when a Gaussian noise is added to the utterance speech data. In this paper, we developed a speaker identification system using bispectrum feature analysis. To analysis the distribution of the bispectrum data along its two dimensional representation, we developed an adaptive feature extraction mechanism of the bispectrum speech data based on cascade neural network. A cascade configuration of SOFM (Self-Organizing Feature Map) and LVQ (Learning Vector Quantization) is used as an adaptive codebook generation algorithm for determining the feature distribution of the bispectrum speech data. The K-L transformation (K-LT) technique is then used as a preprocessing element before the neural classifier is utilized. This K-LT has shown as an effective procedure for orthogonalization and dimensionality reduction of the codebook vectors generated from bispectrum data. Experimental results show that our system could performed with high recognition rate on the undirected utterance speech, especially, when a higher number of codebook vectors are utilized. It is also shown that the use of PNN could increase the recognition rate significantly, even using speech data with additional Gaussian noise.
引用
收藏
页码:282 / 287
页数:6
相关论文
共 50 条
  • [1] Bispectrum analysis for speaker identification in noisy environment with Karhunen-Loeve transformation technique
    Kusumoputro, B
    Fanany, I
    Indrawati, D
    [J]. HYBRID IMAGE AND SIGNAL PROCESSING VII, 2000, 4044 : 143 - 149
  • [2] Speaker Identification using Wavelet Shannon Entropy and Probabilistic Neural Network
    Lei, Lei
    She, Kun
    [J]. 2016 12TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2016, : 566 - 571
  • [3] Identification and tracking of particular speaker in noisy environment
    Sawada, H
    Ohkado, M
    [J]. MACHINE VISION AND ITS OPTOMECHATRONIC APPLICATIONS, 2004, 5603 : 138 - 145
  • [4] Speaker Recognition Based on Principal Component Analysis and Probabilistic Neural Network
    Zhou, Yan
    Shang, Li
    [J]. ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS: WITH ASPECTS OF ARTIFICIAL INTELLIGENCE, 2012, 6839 : 708 - 715
  • [5] Effective learning in noisy environment using neural network ensemble
    Hartono, P
    Hashimoto, S
    [J]. IJCNN 2000: PROCEEDINGS OF THE IEEE-INNS-ENNS INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOL II, 2000, : 179 - 184
  • [6] Neural network comparison of speech recognition system using trispectrum analysis in noisy environment
    Kusumoputro, B
    Triyanto, A
    [J]. INTELLIGENT ROBOTS AND COMPUTER VISION XX: ALGORITHMS, TECHNIQUES, AND ACTIVE VISION, 2001, 4572 : 445 - 450
  • [7] Speaker identification using a hybrid neural network and conformity approach
    Ouzounov, A
    [J]. SIGNAL ANALYSIS & PREDICTION I, 1997, : 455 - 458
  • [8] Speaker Identification Using Robust Speech Detection and Neural Network
    Ouzounov, Atanas
    [J]. CYBERNETICS AND INFORMATION TECHNOLOGIES, 2007, 7 (03) : 48 - 54
  • [9] A real time speaker identification using artificial neural network
    Hossain, Md. Murad
    Ahmed, Boshir
    Asrafi, Mahrnuda
    [J]. PROCEEDINGS OF 10TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (ICCIT 2007), 2007, : 325 - 329
  • [10] Speaker Identification System Using Wavelet Transform and Neural Network
    Daqrouq, K.
    Abu Hilal, T.
    Sherif, M.
    El-Hajar, S.
    Al-Qawasmi, A.
    [J]. 2009 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTATIONAL TOOLS FOR ENGINEERING APPLICATIONS, 2009, : 560 - +