A STUDY ON MINIMUM ERROR DISCRIMINATIVE TRAINING FOR SPEAKER RECOGNITION

被引：24

作者：

LIU, CS ^{[1
]}

LEE, CH ^{[1
]}

CHOU, W ^{[1
]}

JUANG, BH ^{[1
]}

ROSENBERG, AE ^{[1
]}

机构：

[1] AT&T BELL LABS,SPEECH RES DEPT,MURRAY HILL,NJ 07974

来源：

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA | 1995年 / 97卷 / 01期

关键词：

D O I：

10.1121/1.412286

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

The use of discriminative training to construct hidden Markov models of speakers for verification and identification is studied. As opposed to conventional maximum likelihood training which estimates a speaker's model based only on the training utterances from the same speaker, a discriminative training approach is used which takes into account the models of other competing speakers and formulates the optimization criterion such that speaker separation is enhanced and speaker recognition error rate on the training data is directly minimized. The optimization solution is obtained with a probabilistic descent algorithm. For all experiments an isolated digit database consisting of 100 speakers is used. For speaker identification, the resulting discriminative speaker models reduce the identification error rate by more than 25% over the results obtained with the conventional training algorithm. A new normalized score function is proposed which makes the verification formulation consistent with the minimum error training objective. When combining the proposed verification score function with discriminative training, an average equal error rate of 0.8% is achieved using only one-digit test utterances. This represents an error rate reduction of over 80% from an average equal error rate of 6.1% when using the conventional algorithm for training and the unnormalized score function for testing. © 1995, Acoustical Society of America. All rights reserved.

引用

下载

页码：637 / 648

页数：12

共 50 条

[1] A STUDY ON SPEAKER ADAPTATION FOR MANDARINE SYLLABLE RECOGNITION WITH MINIMUM ERROR DISCRIMINATIVE TRAINING
LIN, CH
WU, CH
CHANG, PC
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1995, E78D (06) : 712 - 718
[2] Incremental speaker adaptation with minimum error discriminative training for speaker identification
delAlamo, CM
Alvarez, J
delaTorre, C
Poyatos, FJ
Hernandez, L
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1760 - 1763
[3] Minimum Phone Error Discriminative Training For Mandarin Chinese Speaker Adaptation
Chen, Liang-Yu
Lee, Chun-Jen
Jang, Jyh-Shing Roger
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1241 - +
[4] Minimum error discriminative training for radical-based online Chinese handwriting recognition
Zhang, Yaodong
Liu, Peng
Soong, Frank K.
ICDAR 2007: NINTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2007, : 53 - +
[5] Discriminative training for large-vocabulary speech recognition using minimum classification error
McDermott, Erik
Hazen, Timothy J.
Le Roux, Jonathan
Nakamura, Atsushi
Katagiri, Shigeru
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (01): : 203 - 223
[6] Minimum Classification Error Training of Hidden Conditional Random Fields for Speech and Speaker Recognition
Hong, Wei-Tyng
JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2013, 29 (04) : 729 - 742
[7] Minimum classification error interactive training for speaker identification
Kida, Y
Yamamoto, H
Miyajima, C
Tokuda, K
Kitamura, T
2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 641 - 644
[8] Speaker identification using Minimum Classification Error training
Siohan, O
Rosenberg, AE
Parthasarathy, S
PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 109 - 112
[9] Minimum classification error/eigenvoices training for speaker identification
Valente, F
Wellekens, C
2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SPEECH II; INDUSTRY TECHNOLOGY TRACKS; DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS; NEURAL NETWORKS FOR SIGNAL PROCESSING, 2003, : 213 - 216
[10] Speaker verification using minimum verification error training
Rosenberg, AE
Siohan, O
Parthasarathy, S
PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 105 - 108

← 1 2 3 4 5 →