On neural networks-based preprocessing for speaker identification

被引:0
|
作者
Tadj, C [1 ]
机构
[1] Ecole Technol Super, Montreal, PQ H3C 1K3, Canada
关键词
speaker-discriminant feature; feature extraction; nonlinear; discriminant analysis; neural networks;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this paper, we study a Nonlinear Discriminant Analysis (NLDA) technique that extracts a speaker-discriminant feature set. We present different system architectures to extract features that are more invariant to non-speakers-related conditions such as handset types and channel effects: (a) the first approach uses a Dynamic Programming Vector Quantization (DPVQ) Algorithm. (b) The second approach uses a simple Multi Layer Perceptron (MLP) network. (c) Finally the MLP uses DPVQ as an input to the network to maximize the separation between speakers by nonlinearly projecting a large set of acoustic features to a lower-dimensional feature set. The architecture proposed takes into account both the temporal changing of the speech signal and the powerful of the neural networks. The extracted features are optimized to discriminate between speakers and to be robust to mismatched training and testing conditions. The transformed features are used to train an HMM-based Speaker Identification system. The proposed architectures have been trained and tested over 60 speaker's Yoho corpus. The results have shown an average improvement of Speaker Identification performance by more than 18% compared to an HMM-Based reference system.
引用
收藏
页码:373 / 377
页数:5
相关论文
共 50 条
  • [1] Speaker identification based on neural networks
    Marhon, Sajid A.
    Al-Aghar, Duaa N. Ubaid
    [J]. NEURAL NETWORK WORLD, 2006, 16 (04) : 277 - 290
  • [2] Stability Analysis of Neural Networks-Based System Identification
    Korkobi, Talel
    Djemel, Mohamed
    Chtourou, Andmohamed
    [J]. MODELLING AND SIMULATION IN ENGINEERING, 2008, 2008
  • [3] Enhancing Neural Networks-based Classification of Incipient Faults in Power Transformers via Preprocessing
    Rocha Reis, Agnaldo J.
    Castanheira, Luciana G.
    Barbosa, Ruben C.
    [J]. 2013 1ST BRICS COUNTRIES CONGRESS ON COMPUTATIONAL INTELLIGENCE AND 11TH BRAZILIAN CONGRESS ON COMPUTATIONAL INTELLIGENCE (BRICS-CCI & CBIC), 2013, : 622 - 627
  • [4] Speaker recognition using dynamic synapse based neural networks with wavelet preprocessing
    George, S
    Dibazar, A
    Liaw, JS
    Berger, TW
    [J]. IJCNN'01: INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2001, : 1122 - 1125
  • [5] Plant virus identification based on neural networks with evolutionary preprocessing
    Glezakos, Thomas J.
    Moschopoulou, Georgia
    Tsiligiridis, Theodore A.
    Kintzios, Spiridon
    Yialouris, Constantine P.
    [J]. COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2010, 70 (02) : 263 - 275
  • [6] Speaker identification using neural networks
    Pawar, RV
    Kajave, PP
    Mali, SN
    [J]. ENFORMATIKA, VOL 7: IEC 2005 PROCEEDINGS, 2005, : 429 - 433
  • [7] Speaker Identification using Neural Networks
    Pawar, R. V.
    Kajave, P. P.
    Mali, S. N.
    [J]. PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 7, 2005, 7 : 429 - 433
  • [8] Neural Networks-Based Cryptography: A Survey
    Meraouche, Ishak
    Dutta, Sabyasachi
    Tan, Haowen
    Sakurai, Kouichi
    [J]. IEEE Access, 2021, 9 : 124727 - 124740
  • [9] Neural Networks-Based Cryptography: A Survey
    Meraouche, Ishak
    Dutta, Sabyasachi
    Tan, Haowen
    Sakurai, Kouichi
    [J]. IEEE ACCESS, 2021, 9 : 124727 - 124740
  • [10] A Neural Networks-Based Method for Single-Phase Harmonic Content Identification
    Nascimento, Claudionor F.
    Oliveira, Azauri A., Jr.
    Goedtel, Alessandro
    Silva, Ivan N.
    Suetake, Marcelo
    [J]. IECON 2008: 34TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, VOLS 1-5, PROCEEDINGS, 2008, : 2606 - +