Speaker identification in noisy environment using bispectrum analysis and probabilistic neural network

被引：0

作者：

Kusumoputro, B ^{[1
]}

Triyanto, A ^{[1
]}

Fanany, MI ^{[1
]}

Jatmiko, W ^{[1
]}

机构：

[1] Univ Indonesia, Fac Comp Sci, Computat Intelligence Res Lab, Jakarta, Indonesia

来源：

ICCIMA 2001: FOURTH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND MULTIMEDIA APPLICATIONS, PROCEEDINGS | 2001年

关键词：

D O I：

10.1109/ICCIMA.2001.970480

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The work described in this paper addresses the application of a neural processing for extracting bispectrum feature of speech data, and the use of probabilistic neural network as a classifier in an automatic speech recognition system. The usually, used feature extraction paradigm in the early, development of the speech recognition system is power spectrum analysis, however, the recognition rate of this system is not high enough, especially, when a Gaussian noise is added to the utterance speech data. In this paper, we developed a speaker identification system using bispectrum feature analysis. To analysis the distribution of the bispectrum data along its two dimensional representation, we developed an adaptive feature extraction mechanism of the bispectrum speech data based on cascade neural network. A cascade configuration of SOFM (Self-Organizing Feature Map) and LVQ (Learning Vector Quantization) is used as an adaptive codebook generation algorithm for determining the feature distribution of the bispectrum speech data. The K-L transformation (K-LT) technique is then used as a preprocessing element before the neural classifier is utilized. This K-LT has shown as an effective procedure for orthogonalization and dimensionality reduction of the codebook vectors generated from bispectrum data. Experimental results show that our system could performed with high recognition rate on the undirected utterance speech, especially, when a higher number of codebook vectors are utilized. It is also shown that the use of PNN could increase the recognition rate significantly, even using speech data with additional Gaussian noise.

引用

页码：282 / 287

页数：6

共 50 条

[1] Bispectrum analysis for speaker identification in noisy environment with Karhunen-Loeve transformation technique
Kusumoputro, B
Fanany, I
Indrawati, D
[J]. HYBRID IMAGE AND SIGNAL PROCESSING VII, 2000, 4044 : 143 - 149
[2] Speaker Identification using Wavelet Shannon Entropy and Probabilistic Neural Network
Lei, Lei
She, Kun
[J]. 2016 12TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2016, : 566 - 571
[3] Identification and tracking of particular speaker in noisy environment
Sawada, H
Ohkado, M
[J]. MACHINE VISION AND ITS OPTOMECHATRONIC APPLICATIONS, 2004, 5603 : 138 - 145
[4] Speaker Recognition Based on Principal Component Analysis and Probabilistic Neural Network
Zhou, Yan
Shang, Li
[J]. ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS: WITH ASPECTS OF ARTIFICIAL INTELLIGENCE, 2012, 6839 : 708 - 715
[5] Effective learning in noisy environment using neural network ensemble
Hartono, P
Hashimoto, S
[J]. IJCNN 2000: PROCEEDINGS OF THE IEEE-INNS-ENNS INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOL II, 2000, : 179 - 184
[6] Neural network comparison of speech recognition system using trispectrum analysis in noisy environment
Kusumoputro, B
Triyanto, A
[J]. INTELLIGENT ROBOTS AND COMPUTER VISION XX: ALGORITHMS, TECHNIQUES, AND ACTIVE VISION, 2001, 4572 : 445 - 450
[7] Speaker identification using a hybrid neural network and conformity approach
Ouzounov, A
[J]. SIGNAL ANALYSIS & PREDICTION I, 1997, : 455 - 458
[8] Speaker Identification Using Robust Speech Detection and Neural Network
Ouzounov, Atanas
[J]. CYBERNETICS AND INFORMATION TECHNOLOGIES, 2007, 7 (03) : 48 - 54
[9] A real time speaker identification using artificial neural network
Hossain, Md. Murad
Ahmed, Boshir
Asrafi, Mahrnuda
[J]. PROCEEDINGS OF 10TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (ICCIT 2007), 2007, : 325 - 329
[10] Speaker Identification System Using Wavelet Transform and Neural Network
Daqrouq, K.
Abu Hilal, T.
Sherif, M.
El-Hajar, S.
Al-Qawasmi, A.
[J]. 2009 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTATIONAL TOOLS FOR ENGINEERING APPLICATIONS, 2009, : 560 - +

← 1 2 3 4 5 →