Capture inter-speaker information with a neural network for speaker identification

被引:0
|
作者
Wang, L [1 ]
Chen, K [1 ]
Chi, HH [1 ]
机构
[1] Peking Univ, Natl Lab Machine Percept, Beijing 100871, Peoples R China
关键词
D O I
10.1109/IJCNN.2000.861465
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many speaker identification systems are created by model-based approaches, where a statistical model is used to characterize speaker's voice and no inter-speaker information is used in parameter estimation. It is well known that inter-speaker information is very helpful in discrimination of different speakers. In this paper, we propose a novel method for the use of inter-speaker information to improve performance of a model-based speaker identification system. A neural network is employed to capture inter-speaker information from output space of those statistical models. In order to sufficiently utilize inter-speaker information, a rival penalized encoding rule is proposed to design supervised learning pairs for training the neural network. Comparative results in the KING speech corpus show that our method leads to a considerable improvement for a model-based speaker identification system.
引用
收藏
页码:247 / 252
页数:6
相关论文
共 50 条
  • [21] SPECTRAL DISTRIBUTION CUES - COMPARATIVE-STUDY BASED ON 2 INTRA-SPEAKER AND INTER-SPEAKER DISCRIMINATING ANALYSES
    CAELEN, G
    VIGOUROUX, N
    SPEECH COMMUNICATION, 1983, 2 (2-3) : 133 - 136
  • [22] Studies on inter-speaker variability in speech and its application in automatic speech recognition
    S UMESH
    Sadhana, 2011, 36 : 853 - 883
  • [23] Studies on inter-speaker variability in speech and its application in automatic speech recognition
    Umesh, S.
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2011, 36 (05): : 853 - 883
  • [24] DEEP NEURAL NETWORK TRAINED WITH SPEAKER REPRESENTATION FOR SPEAKER NORMALIZATION
    Tang, Yun
    Mohan, Aanchan
    Rose, Richard C.
    Ma, Chengyuan
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [25] Neural Network Speaker Descriptor in Speaker Diarization of Telephone Speech
    Zajic, Zbynek
    Zelinka, Jan
    Mueller, Ludek
    SPEECH AND COMPUTER, SPECOM 2017, 2017, 10458 : 555 - 563
  • [26] Characterization of inter-speaker articulatory variability: A two-level multi-speaker modelling approach based on MRI data
    Serrurier, Antoine
    Badin, Pierre
    Lamalle, Laurent
    Neuschaefer-Rube, Christiane
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2019, 145 (04): : 2149 - 2170
  • [27] Speaker matters: Natural inter-speaker variation affects 4-month-olds' perception of audio-visual speech
    Pejovic, Jovana
    Yee, Eiling
    Molnar, Monika
    FIRST LANGUAGE, 2020, 40 (02) : 113 - 127
  • [28] Speaker identification using a hybrid neural network and conformity approach
    Ouzounov, A
    SIGNAL ANALYSIS & PREDICTION I, 1997, : 455 - 458
  • [29] A real time speaker identification using artificial neural network
    Hossain, Md. Murad
    Ahmed, Boshir
    Asrafi, Mahrnuda
    PROCEEDINGS OF 10TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (ICCIT 2007), 2007, : 325 - 329
  • [30] Speaker Identification System Using Wavelet Transform and Neural Network
    Daqrouq, K.
    Abu Hilal, T.
    Sherif, M.
    El-Hajar, S.
    Al-Qawasmi, A.
    2009 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTATIONAL TOOLS FOR ENGINEERING APPLICATIONS, 2009, : 560 - +