Capture interspeaker information with a neural network for speaker identification

被引:11
|
作者
Wang, L [1 ]
Chen, K
Chi, HS
机构
[1] Univ Cambridge, Dept Engn, Speech Vis & Robotocs Grp, Cambridge CA2 1PZ, England
[2] Univ Birmingham, Sch Comp Sci, Birmingham B15 2TT, W Midlands, England
[3] Peking Univ, Natl Lab Machine Percept, Beijing 100871, Peoples R China
[4] Peking Univ, Ctr Informat Sci, Beijing 100871, Peoples R China
来源
IEEE TRANSACTIONS ON NEURAL NETWORKS | 2002年 / 13卷 / 02期
基金
中国国家自然科学基金;
关键词
interspeaker information; KING speech corpus; model-based method; neural networks; query-based learning algorithm; rival penalized encoding scheme; speaker identification;
D O I
10.1109/72.991429
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Model-based approach is one of methods widely used for speaker identification, where a statistical model is used to characterize a specific speaker's voice but no interspeaker information is involved in its parameter estimation. It is observed that interspeaker information is very helpful in discriminating between different speakers. In this paper, we propose a novel method for the use of interspeaker information to improve performance of a model-based speaker identification system. A neural network is employed to capture the interspeaker information from the output space of those statistical models. In order to sufficiently utilize interspeaker information, a rival penalized encoding rule is proposed to design supervised learning pairs. For better generalization, moreover, a query-based learning algorithm is presented to actively select the input data of interest during training of the neural network. Comparative results on the KING speech corpus show that our method leads to a considerable improvement for a model-based speaker identification system.
引用
收藏
页码:436 / 445
页数:10
相关论文
共 50 条
  • [1] Capture inter-speaker information with a neural network for speaker identification
    Wang, L
    Chen, K
    Chi, HH
    IJCNN 2000: PROCEEDINGS OF THE IEEE-INNS-ENNS INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOL V, 2000, : 247 - 252
  • [2] A Deep Neural Network Model for Speaker Identification
    Ye, Feng
    Yang, Jun
    APPLIED SCIENCES-BASEL, 2021, 11 (08):
  • [3] A study of interspeaker variability in speaker verification
    Kenny, Patrick
    Ouellet, Pierre
    Dehak, Najim
    Gupta, Vishwa
    Dumouchel, Pierre
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (05): : 980 - 988
  • [4] Wavelet LPC with neural network for speaker identification system
    Daqrouq, Khaled
    Morfeq, Ali
    Ajour, Mohammad
    Alkhateeb, Abdulhameed
    WSEAS Transactions on Signal Processing, 2013, 9 (04): : 216 - 226
  • [5] Neural network approaches to capture temporal information
    van Veelen, M
    Nijhuis, J
    Spaanenburg, B
    COMPUTING ANTICIPATORY SYSTEMS, 2000, 517 : 361 - 371
  • [6] Speaker identification using a hybrid neural network and conformity approach
    Ouzounov, A
    SIGNAL ANALYSIS & PREDICTION I, 1997, : 455 - 458
  • [7] A real time speaker identification using artificial neural network
    Hossain, Md. Murad
    Ahmed, Boshir
    Asrafi, Mahrnuda
    PROCEEDINGS OF 10TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (ICCIT 2007), 2007, : 325 - 329
  • [8] Speaker Identification System Using Wavelet Transform and Neural Network
    Daqrouq, K.
    Abu Hilal, T.
    Sherif, M.
    El-Hajar, S.
    Al-Qawasmi, A.
    2009 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTATIONAL TOOLS FOR ENGINEERING APPLICATIONS, 2009, : 560 - +
  • [9] Priority ordered BP neural network and the application for speaker identification
    Deng, HJ
    Du, LM
    Wang, SJ
    2002 IEEE REGION 10 CONFERENCE ON COMPUTERS, COMMUNICATIONS, CONTROL AND POWER ENGINEERING, VOLS I-III, PROCEEDINGS, 2002, : 671 - 674
  • [10] Towards Speaker Identification System based on Dynamic Neural Network
    Ivanovas, E.
    Navakauskas, D.
    ELEKTRONIKA IR ELEKTROTECHNIKA, 2012, 18 (10) : 69 - 72