A STUDY ON MINIMUM ERROR DISCRIMINATIVE TRAINING FOR SPEAKER RECOGNITION

被引:24
|
作者
LIU, CS [1 ]
LEE, CH [1 ]
CHOU, W [1 ]
JUANG, BH [1 ]
ROSENBERG, AE [1 ]
机构
[1] AT&T BELL LABS,SPEECH RES DEPT,MURRAY HILL,NJ 07974
来源
关键词
D O I
10.1121/1.412286
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The use of discriminative training to construct hidden Markov models of speakers for verification and identification is studied. As opposed to conventional maximum likelihood training which estimates a speaker's model based only on the training utterances from the same speaker, a discriminative training approach is used which takes into account the models of other competing speakers and formulates the optimization criterion such that speaker separation is enhanced and speaker recognition error rate on the training data is directly minimized. The optimization solution is obtained with a probabilistic descent algorithm. For all experiments an isolated digit database consisting of 100 speakers is used. For speaker identification, the resulting discriminative speaker models reduce the identification error rate by more than 25% over the results obtained with the conventional training algorithm. A new normalized score function is proposed which makes the verification formulation consistent with the minimum error training objective. When combining the proposed verification score function with discriminative training, an average equal error rate of 0.8% is achieved using only one-digit test utterances. This represents an error rate reduction of over 80% from an average equal error rate of 6.1% when using the conventional algorithm for training and the unnormalized score function for testing. © 1995, Acoustical Society of America. All rights reserved.
引用
下载
收藏
页码:637 / 648
页数:12
相关论文
共 50 条
  • [1] A STUDY ON SPEAKER ADAPTATION FOR MANDARINE SYLLABLE RECOGNITION WITH MINIMUM ERROR DISCRIMINATIVE TRAINING
    LIN, CH
    WU, CH
    CHANG, PC
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1995, E78D (06) : 712 - 718
  • [2] Incremental speaker adaptation with minimum error discriminative training for speaker identification
    delAlamo, CM
    Alvarez, J
    delaTorre, C
    Poyatos, FJ
    Hernandez, L
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1760 - 1763
  • [3] Minimum Phone Error Discriminative Training For Mandarin Chinese Speaker Adaptation
    Chen, Liang-Yu
    Lee, Chun-Jen
    Jang, Jyh-Shing Roger
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1241 - +
  • [4] Minimum error discriminative training for radical-based online Chinese handwriting recognition
    Zhang, Yaodong
    Liu, Peng
    Soong, Frank K.
    ICDAR 2007: NINTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2007, : 53 - +
  • [5] Discriminative training for large-vocabulary speech recognition using minimum classification error
    McDermott, Erik
    Hazen, Timothy J.
    Le Roux, Jonathan
    Nakamura, Atsushi
    Katagiri, Shigeru
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (01): : 203 - 223
  • [6] Minimum Classification Error Training of Hidden Conditional Random Fields for Speech and Speaker Recognition
    Hong, Wei-Tyng
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2013, 29 (04) : 729 - 742
  • [7] Minimum classification error interactive training for speaker identification
    Kida, Y
    Yamamoto, H
    Miyajima, C
    Tokuda, K
    Kitamura, T
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 641 - 644
  • [8] Speaker identification using Minimum Classification Error training
    Siohan, O
    Rosenberg, AE
    Parthasarathy, S
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 109 - 112
  • [9] Minimum classification error/eigenvoices training for speaker identification
    Valente, F
    Wellekens, C
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SPEECH II; INDUSTRY TECHNOLOGY TRACKS; DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS; NEURAL NETWORKS FOR SIGNAL PROCESSING, 2003, : 213 - 216
  • [10] Speaker verification using minimum verification error training
    Rosenberg, AE
    Siohan, O
    Parthasarathy, S
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 105 - 108