Capture interspeaker information with a neural network for speaker identification

被引:11
|
作者
Wang, L [1 ]
Chen, K
Chi, HS
机构
[1] Univ Cambridge, Dept Engn, Speech Vis & Robotocs Grp, Cambridge CA2 1PZ, England
[2] Univ Birmingham, Sch Comp Sci, Birmingham B15 2TT, W Midlands, England
[3] Peking Univ, Natl Lab Machine Percept, Beijing 100871, Peoples R China
[4] Peking Univ, Ctr Informat Sci, Beijing 100871, Peoples R China
来源
IEEE TRANSACTIONS ON NEURAL NETWORKS | 2002年 / 13卷 / 02期
基金
中国国家自然科学基金;
关键词
interspeaker information; KING speech corpus; model-based method; neural networks; query-based learning algorithm; rival penalized encoding scheme; speaker identification;
D O I
10.1109/72.991429
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Model-based approach is one of methods widely used for speaker identification, where a statistical model is used to characterize a specific speaker's voice but no interspeaker information is involved in its parameter estimation. It is observed that interspeaker information is very helpful in discriminating between different speakers. In this paper, we propose a novel method for the use of interspeaker information to improve performance of a model-based speaker identification system. A neural network is employed to capture the interspeaker information from the output space of those statistical models. In order to sufficiently utilize interspeaker information, a rival penalized encoding rule is proposed to design supervised learning pairs. For better generalization, moreover, a query-based learning algorithm is presented to actively select the input data of interest during training of the neural network. Comparative results on the KING speech corpus show that our method leads to a considerable improvement for a model-based speaker identification system.
引用
收藏
页码:436 / 445
页数:10
相关论文
共 50 条
  • [21] Arabic word dependent speaker identification system using artificial neural network
    Al-Qaisi A.
    International Journal of Circuits, Systems and Signal Processing, 2020, 14 : 290 - 295
  • [22] Speaker identification in noisy environment using bispectrum analysis and probabilistic neural network
    Kusumoputro, B
    Triyanto, A
    Fanany, MI
    Jatmiko, W
    ICCIMA 2001: FOURTH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND MULTIMEDIA APPLICATIONS, PROCEEDINGS, 2001, : 282 - 287
  • [23] Further Results on Speaker Identification Using Robust Speech Detection and a Neural Network
    Ouzounov, Atanas
    CYBERNETICS AND INFORMATION TECHNOLOGIES, 2009, 9 (01) : 37 - 45
  • [24] Speaker identification using hybrid neural network support vector machine classifier
    Karthikeyan V.
    Priyadharsini S.S.
    Balamurugan K.
    Ramasamy M.
    International Journal of Speech Technology, 2022, 25 (4) : 1041 - 1053
  • [25] Text Dependent Speaker Identification and Speech Recognition Using Artificial Neural Network
    Swamy, Suma
    Shalini, T.
    Nagabhushan, Sindhu P.
    Nawaz, Sumaiah
    Ramakrishnan, K. V.
    GLOBAL TRENDS IN COMPUTING AND COMMUNICATION SYSTEMS, PT 1, 2012, 269 : 160 - +
  • [26] Speaker identification system using empirical mode decomposition and an artificial neural network
    Wu, Jian-Da
    Tsai, Yi-Jang
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (05) : 6112 - 6117
  • [27] Speaker Identification Based On Gammatone Cepstral Coefficients And General Regression Neural Network
    Li, Penghua
    Hu, Fangchao
    Li, Yinguo
    Qiu, Baomei
    26TH CHINESE CONTROL AND DECISION CONFERENCE (2014 CCDC), 2014, : 751 - 756
  • [28] Speaker identification using neural networks
    Pawar, RV
    Kajave, PP
    Mali, SN
    ENFORMATIKA, VOL 7: IEC 2005 PROCEEDINGS, 2005, : 429 - 433
  • [29] Speaker identification based on neural networks
    Marhon, Sajid A.
    Al-Aghar, Duaa N. Ubaid
    NEURAL NETWORK WORLD, 2006, 16 (04) : 277 - 290
  • [30] Speaker Identification using Neural Networks
    Pawar, R. V.
    Kajave, P. P.
    Mali, S. N.
    PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 7, 2005, 7 : 429 - 433