Speaker independent acoustic modeling using speaker normalization

被引:0
|
作者
Ishii, J [1 ]
Fukada, T [1 ]
机构
[1] ATR, Interpreting Telecommun Res Labs, Kyoto 61902, Japan
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper proposes a novel speaker-independent (SI) modeling for spontaneous speech data from multiple speakers. The SI acoustic model parameters are estimated by individual training for inter-speaker variability and for intra-speaker phonetically related variation in order to obtain a more accurate acoustic model. The linear transformation technique is used for the speaker normalization to extract intra-speaker phonetically related variation and also is used for the re-estimation of inter-speaker variability. The proposed modeling is evaluated for a Japanese spontaneous speech data, using continuous density mixture Gaussian HMMs. Experimental results from the use of proposed acoustic model show that the reductions in word error rate can be achieved over the standard SI model regardless the type of acoustic model used.
引用
收藏
页码:97 / 100
页数:4
相关论文
共 50 条
  • [1] LIKELIHOOD NORMALIZATION FOR SPEAKER VERIFICATION USING A PHONEME-INDEPENDENT AND SPEAKER-INDEPENDENT MODEL
    MATSUI, T
    FURUI, S
    [J]. SPEECH COMMUNICATION, 1995, 17 (1-2) : 109 - 116
  • [2] Text-independent speaker identification using fenonic speaker Markov modeling
    Birnbaum, M
    Brown, KL
    Bardenhagen, S
    [J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 677 - 680
  • [3] A study on speaker normalization using vocal tract normalization and speaker adaptive training
    Welling, L
    Haeb-Umbach, R
    Aubert, X
    Haberland, N
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 797 - 800
  • [4] Speaker verification score normalization using speaker model clusters
    Apsingekar, Vijendra Raj
    De Leon, Phillip L.
    [J]. SPEECH COMMUNICATION, 2011, 53 (01) : 110 - 118
  • [5] Speaker recognition and speaker normalization by projection to speaker subspace
    Ariki, Y
    Tagashira, S
    Nishijima, M
    [J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 319 - 322
  • [6] Speaker-Independent Silent Speech Recognition with Across-Speaker Articulatory Normalization and Speaker Adaptive Training
    Wang, Jun
    Hahm, Seongjun
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2415 - 2419
  • [7] Unsupervised speaker adaptation for speaker independent acoustic to articulatory speech inversion
    Sivaraman, Ganesh
    Mitra, Vikramjit
    Nam, Hosung
    Tiede, Mark
    Espy-Wilson, Carol
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2019, 146 (01): : 316 - 329
  • [8] A new cohort normalization using local acoustic information for speaker verification
    Isobe, T
    Takahashi, J
    [J]. ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 841 - 844
  • [9] Across-speaker Articulatory Normalization for Speaker-independent Silent Speech Recognition
    Wang, Jun
    Samal, Ashok
    Green, Jordan R.
    [J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1179 - 1183
  • [10] Vocal tract length normalization for speaker independent acoustic-to-articulatory speech inversion
    Sivaraman, Ganesh
    Mitra, Vikramjit
    Nam, Hosung
    Tiede, Mark
    Espy-Wilson, Carol
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 455 - 459