On neural networks-based preprocessing for speaker identification

被引：0

作者：

Tadj, C ^{[1
]}

机构：

[1] Ecole Technol Super, Montreal, PQ H3C 1K3, Canada

来源：

6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL IX, PROCEEDINGS: IMAGE, ACOUSTIC, SPEECH AND SIGNAL PROCESSING II | 2002年

关键词：

speaker-discriminant feature; feature extraction; nonlinear; discriminant analysis; neural networks;

D O I：

暂无

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

In this paper, we study a Nonlinear Discriminant Analysis (NLDA) technique that extracts a speaker-discriminant feature set. We present different system architectures to extract features that are more invariant to non-speakers-related conditions such as handset types and channel effects: (a) the first approach uses a Dynamic Programming Vector Quantization (DPVQ) Algorithm. (b) The second approach uses a simple Multi Layer Perceptron (MLP) network. (c) Finally the MLP uses DPVQ as an input to the network to maximize the separation between speakers by nonlinearly projecting a large set of acoustic features to a lower-dimensional feature set. The architecture proposed takes into account both the temporal changing of the speech signal and the powerful of the neural networks. The extracted features are optimized to discriminate between speakers and to be robust to mismatched training and testing conditions. The transformed features are used to train an HMM-based Speaker Identification system. The proposed architectures have been trained and tested over 60 speaker's Yoho corpus. The results have shown an average improvement of Speaker Identification performance by more than 18% compared to an HMM-Based reference system.

引用

页码：373 / 377

页数：5

共 50 条

[1] Speaker identification based on neural networks
Marhon, Sajid A.
Al-Aghar, Duaa N. Ubaid
[J]. NEURAL NETWORK WORLD, 2006, 16 (04) : 277 - 290
[2] Stability Analysis of Neural Networks-Based System Identification
Korkobi, Talel
Djemel, Mohamed
Chtourou, Andmohamed
[J]. MODELLING AND SIMULATION IN ENGINEERING, 2008, 2008
[3] Enhancing Neural Networks-based Classification of Incipient Faults in Power Transformers via Preprocessing
Rocha Reis, Agnaldo J.
Castanheira, Luciana G.
Barbosa, Ruben C.
[J]. 2013 1ST BRICS COUNTRIES CONGRESS ON COMPUTATIONAL INTELLIGENCE AND 11TH BRAZILIAN CONGRESS ON COMPUTATIONAL INTELLIGENCE (BRICS-CCI & CBIC), 2013, : 622 - 627
[4] Speaker recognition using dynamic synapse based neural networks with wavelet preprocessing
George, S
Dibazar, A
Liaw, JS
Berger, TW
[J]. IJCNN'01: INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2001, : 1122 - 1125
[5] Plant virus identification based on neural networks with evolutionary preprocessing
Glezakos, Thomas J.
Moschopoulou, Georgia
Tsiligiridis, Theodore A.
Kintzios, Spiridon
Yialouris, Constantine P.
[J]. COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2010, 70 (02) : 263 - 275
[6] Speaker identification using neural networks
Pawar, RV
Kajave, PP
Mali, SN
[J]. ENFORMATIKA, VOL 7: IEC 2005 PROCEEDINGS, 2005, : 429 - 433
[7] Speaker Identification using Neural Networks
Pawar, R. V.
Kajave, P. P.
Mali, S. N.
[J]. PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 7, 2005, 7 : 429 - 433
[8] Neural Networks-Based Cryptography: A Survey
Meraouche, Ishak
Dutta, Sabyasachi
Tan, Haowen
Sakurai, Kouichi
[J]. IEEE Access, 2021, 9 : 124727 - 124740
[9] Neural Networks-Based Cryptography: A Survey
Meraouche, Ishak
Dutta, Sabyasachi
Tan, Haowen
Sakurai, Kouichi
[J]. IEEE ACCESS, 2021, 9 : 124727 - 124740
[10] A Neural Networks-Based Method for Single-Phase Harmonic Content Identification
Nascimento, Claudionor F.
Oliveira, Azauri A., Jr.
Goedtel, Alessandro
Silva, Ivan N.
Suetake, Marcelo
[J]. IECON 2008: 34TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, VOLS 1-5, PROCEEDINGS, 2008, : 2606 - +

← 1 2 3 4 5 →