Capture inter-speaker information with a neural network for speaker identification

被引：0

作者：

Wang, L ^{[1
]}

Chen, K ^{[1
]}

Chi, HH ^{[1
]}

机构：

[1] Peking Univ, Natl Lab Machine Percept, Beijing 100871, Peoples R China

来源：

IJCNN 2000: PROCEEDINGS OF THE IEEE-INNS-ENNS INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOL V | 2000年

关键词：

D O I：

10.1109/IJCNN.2000.861465

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Many speaker identification systems are created by model-based approaches, where a statistical model is used to characterize speaker's voice and no inter-speaker information is used in parameter estimation. It is well known that inter-speaker information is very helpful in discrimination of different speakers. In this paper, we propose a novel method for the use of inter-speaker information to improve performance of a model-based speaker identification system. A neural network is employed to capture inter-speaker information from output space of those statistical models. In order to sufficiently utilize inter-speaker information, a rival penalized encoding rule is proposed to design supervised learning pairs for training the neural network. Comparative results in the KING speech corpus show that our method leads to a considerable improvement for a model-based speaker identification system.

引用

页码：247 / 252

页数：6

共 50 条

[1] Towards better capturing inter-speaker information by active learning for speaker identification
Lan, W
Ke, C
Hui, SC
IJCNN'01: INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2001, : 2975 - 2980
[2] Capture interspeaker information with a neural network for speaker identification
Wang, L
Chen, K
Chi, HS
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2002, 13 (02): : 436 - 445
[3] INTER-SPEAKER VARIATION IN COMPOUND PROMINENCE
Bell, Melanie J.
LINGUE E LINGUAGGIO, 2015, 14 (01) : 61 - 78
[4] Investigations on inter-speaker variability in the feature space
Haeb-Umbach, R.
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1999, 1 : 397 - 400
[5] Investigations on inter-speaker variability in the feature space
Haeb-Umbach, R
ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 397 - 400
[6] Modeling inter-speaker variability in speech recognition
Cloarec, Gwenael
Jouvet, Denis
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4529 - 4532
[7] THE INFLUENCE OF INTER-SPEAKER AND INTRA-SPEAKER TEMPO ON FUNDAMENTAL-FREQUENCY AND PALATALIZATION
COOPER, WE
SOARES, C
HAM, A
DAMON, K
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1983, 73 (05): : 1723 - 1730
[8] Eliminating inter-speaker variability prior to discriminant transforms
Saon, G
Padmanabhan, M
Gopinath, R
ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS, 2001, : 73 - 76
[9] Inter-speaker interaction of F0 in dialogs
Kakita, K
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 689 - 692
[10] Inter-speaker variability: speaker normalisation and quantitative estimation of articulatory invariants in speech production for French
Serrurier, Antoine
Badin, Pierre
Boe, Louis-Jean
Lamalle, Laurent
Neuschaefer-Rube, Christiane
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2272 - 2276

← 1 2 3 4 5 →