Speaker independent acoustic modeling using speaker normalization

被引：0

作者：

Ishii, J ^{[1
]}

Fukada, T ^{[1
]}

机构：

[1] ATR, Interpreting Telecommun Res Labs, Kyoto 61902, Japan

来源：

PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6 | 1998年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper proposes a novel speaker-independent (SI) modeling for spontaneous speech data from multiple speakers. The SI acoustic model parameters are estimated by individual training for inter-speaker variability and for intra-speaker phonetically related variation in order to obtain a more accurate acoustic model. The linear transformation technique is used for the speaker normalization to extract intra-speaker phonetically related variation and also is used for the re-estimation of inter-speaker variability. The proposed modeling is evaluated for a Japanese spontaneous speech data, using continuous density mixture Gaussian HMMs. Experimental results from the use of proposed acoustic model show that the reductions in word error rate can be achieved over the standard SI model regardless the type of acoustic model used.

引用

页码：97 / 100

页数：4

共 50 条

[1] LIKELIHOOD NORMALIZATION FOR SPEAKER VERIFICATION USING A PHONEME-INDEPENDENT AND SPEAKER-INDEPENDENT MODEL
MATSUI, T
FURUI, S
[J]. SPEECH COMMUNICATION, 1995, 17 (1-2) : 109 - 116
[2] Text-independent speaker identification using fenonic speaker Markov modeling
Birnbaum, M
Brown, KL
Bardenhagen, S
[J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 677 - 680
[3] A study on speaker normalization using vocal tract normalization and speaker adaptive training
Welling, L
Haeb-Umbach, R
Aubert, X
Haberland, N
[J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 797 - 800
[4] Speaker verification score normalization using speaker model clusters
Apsingekar, Vijendra Raj
De Leon, Phillip L.
[J]. SPEECH COMMUNICATION, 2011, 53 (01) : 110 - 118
[5] Speaker recognition and speaker normalization by projection to speaker subspace
Ariki, Y
Tagashira, S
Nishijima, M
[J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 319 - 322
[6] Speaker-Independent Silent Speech Recognition with Across-Speaker Articulatory Normalization and Speaker Adaptive Training
Wang, Jun
Hahm, Seongjun
[J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2415 - 2419
[7] Unsupervised speaker adaptation for speaker independent acoustic to articulatory speech inversion
Sivaraman, Ganesh
Mitra, Vikramjit
Nam, Hosung
Tiede, Mark
Espy-Wilson, Carol
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2019, 146 (01): : 316 - 329
[8] A new cohort normalization using local acoustic information for speaker verification
Isobe, T
Takahashi, J
[J]. ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 841 - 844
[9] Across-speaker Articulatory Normalization for Speaker-independent Silent Speech Recognition
Wang, Jun
Samal, Ashok
Green, Jordan R.
[J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1179 - 1183
[10] Vocal tract length normalization for speaker independent acoustic-to-articulatory speech inversion
Sivaraman, Ganesh
Mitra, Vikramjit
Nam, Hosung
Tiede, Mark
Espy-Wilson, Carol
[J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 455 - 459

← 1 2 3 4 5 →