Capture inter-speaker information with a neural network for speaker identification

被引:0
|
作者
Wang, L [1 ]
Chen, K [1 ]
Chi, HH [1 ]
机构
[1] Peking Univ, Natl Lab Machine Percept, Beijing 100871, Peoples R China
关键词
D O I
10.1109/IJCNN.2000.861465
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many speaker identification systems are created by model-based approaches, where a statistical model is used to characterize speaker's voice and no inter-speaker information is used in parameter estimation. It is well known that inter-speaker information is very helpful in discrimination of different speakers. In this paper, we propose a novel method for the use of inter-speaker information to improve performance of a model-based speaker identification system. A neural network is employed to capture inter-speaker information from output space of those statistical models. In order to sufficiently utilize inter-speaker information, a rival penalized encoding rule is proposed to design supervised learning pairs for training the neural network. Comparative results in the KING speech corpus show that our method leads to a considerable improvement for a model-based speaker identification system.
引用
收藏
页码:247 / 252
页数:6
相关论文
共 50 条
  • [1] Towards better capturing inter-speaker information by active learning for speaker identification
    Lan, W
    Ke, C
    Hui, SC
    IJCNN'01: INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2001, : 2975 - 2980
  • [2] Capture interspeaker information with a neural network for speaker identification
    Wang, L
    Chen, K
    Chi, HS
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2002, 13 (02): : 436 - 445
  • [3] INTER-SPEAKER VARIATION IN COMPOUND PROMINENCE
    Bell, Melanie J.
    LINGUE E LINGUAGGIO, 2015, 14 (01) : 61 - 78
  • [4] Investigations on inter-speaker variability in the feature space
    Haeb-Umbach, R.
    ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1999, 1 : 397 - 400
  • [5] Investigations on inter-speaker variability in the feature space
    Haeb-Umbach, R
    ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 397 - 400
  • [6] Modeling inter-speaker variability in speech recognition
    Cloarec, Gwenael
    Jouvet, Denis
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4529 - 4532
  • [7] THE INFLUENCE OF INTER-SPEAKER AND INTRA-SPEAKER TEMPO ON FUNDAMENTAL-FREQUENCY AND PALATALIZATION
    COOPER, WE
    SOARES, C
    HAM, A
    DAMON, K
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1983, 73 (05): : 1723 - 1730
  • [8] Eliminating inter-speaker variability prior to discriminant transforms
    Saon, G
    Padmanabhan, M
    Gopinath, R
    ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS, 2001, : 73 - 76
  • [9] Inter-speaker interaction of F0 in dialogs
    Kakita, K
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 689 - 692
  • [10] Inter-speaker variability: speaker normalisation and quantitative estimation of articulatory invariants in speech production for French
    Serrurier, Antoine
    Badin, Pierre
    Boe, Louis-Jean
    Lamalle, Laurent
    Neuschaefer-Rube, Christiane
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2272 - 2276